Justin Dempesy
Teaching AI to understand human values through story-driven games. We're building a "moral compass" for AI that's 18% more accurate than the leading models.
Aldan Creo
Training AI detectors to explain why a text is AI-generated, not just whether it is.
Jennifer Roush
Implementable Measures to Replace Alignment Rules with Consonant Collaboration in a Toy Model
Gen-Z-focused multimedia project that will raise awareness of AI safety and x-risk
Studying more human-like intelligence through constraint-aware, curiosity-driven agents on ARC-AGI-3
Samuel Gélineau
Fine-tuning a coding model to bypass the difficulty of verifying attention layers
Sohan Venkatesh
Does CoT causally drive model outputs or is it a post-hoc rationalisation? Instead of asking if CoT looks faithful, we intervene on it and observe what happens.
Gaetan Selle
This is a small grant buying a large increase in high-quality Francophone AI risk communication from a creator who has already a track record.
Evangale Jooste
Building an AI system where unsafe behaviour is physically impossible. Ethics proven in formal logic, locked in silicon, enforced at every training step.
Sean Peters
An early-stage AI safety research group based in Sydney, Australia
Karsten Brensing
Limited Legal Personhood as a Reversible Safety Instrument
Aashka Patel
Inspiring India’s Middle‑Schoolers to pursue AI Safety, Governance, and X‑Risk Work
Zaelani
18+ preprints across multiple fields, all written on a 2GB RAM phone. $600 removes the only thing standing between me and the next body of work.
Jonathan Elsworth Eicher
Sean Kwon
Open source agent monitoring tools to detect failures, infinite loops, and unsafe behavior in production AI systems
Dhruv Yadav
Auditing and improving LLM-as-a-judge systems via interpretable aggregation of preferences
Linh Le
Ahmed dawoud
An advanced agent that perceives your screen and executes tasks by controlling the mouse, acting as a digital proxy to handle complex work on your behalf.
Rishub Jain
Jessica Pu Wang
Germany’s talents are critical to the global effort of reducing catastrophic risks brought by artificial intelligence.