Benchmarking and comparing different evaluation awareness metrics | Manifund