Manifund foxManifund
Home
Login
About
People
Categories
Newsletter
HomeAboutPeopleCategoriesLoginCreate
3

Pilot for new benchmark by Epoch AI

Technical AI safetyAI governance
🦑

Epoch Artificial Intelligence, Inc.

ActiveGrant
$200,000raised
$200,000funding goal
Fully funded and not currently accepting donations.

For confidentiality reasons, keeping this brief—more available to Manifund on request.

This is for the pilot of a new benchmark that I think could be very valuable for tracking and predicting future AI progress to AGI. Funding will go towards API costs, labelers, software engineer and part of researcher salaries.

This is a pilot for a potentially larger effort. I will be spending some of my time helping mentor the pilot.

Comments2Donations1Similar7
EpochAI avatar

Epoch AI

Investigating and informing the public about the trajectory of AI

AI governanceForecasting
9
11
$5.97K raised
Manraj avatar

MANRAJ SINGH

New way of Thinking about Benchmarks

Exploring ways of Benchmarking that do not get saturated over time

Technical AI safetyAI governance
1
0
$0 / $8K
Alex-Leader avatar

Alex Leader

Offensive Cyber Kill Chain Benchmark for LLM Evaluation

Measuring whether AI can autonomously execute multi-stage cyberattacks to inform deployment decisions at frontier labs

Science & technologyTechnical AI safetyAI governanceGlobal catastrophic risks
1
2
$0 / $3.85M
🥦

Hieu Minh Nguyen

Benchmarking and comparing different evaluation awareness metrics

LLMs often know when they are being evaluated. We’ll do a study comparing various methods to measure and monitor this capability.

Technical AI safetyAI governance
4
1
$3K raised
🍋

Jonas Vollmer

AI forecasting and policy research by the AI 2027 team

AI Futures Project

AI governanceForecasting
10
24
$44K raised
AmritanshuPrasad avatar

Amritanshu Prasad

Suav Tech, an AI Safety evals for-profit

General Support for an AI Safety evals for-profit

Technical AI safetyAI governanceGlobal catastrophic risks
4
0
$0 raised
majorjanyau avatar

Muhammad Ahmad

Building Frontier AI Governance Capacity in Africa (Pilot Phase)

A pilot to build policy and technical capacity for governing high-risk AI systems in Africa

Technical AI safetyAI governanceBiosecurityForecastingGlobal catastrophic risks
1
0
$0 raised