I have seen some of amp's work, and it is pretty interesting, and novel in the grand scheme of things
@GarretteBaker
I'm an independent alignment researcher, self-taught in machine learning, convex optimization, and probability theory
https://github.com/GarretteBaker/$0 in pending offers
For approximately the past year, I’ve been doing alignment research full-time, working on a variety of approaches, and trying to understand the problem in-depth enough to invent new ones. If funded, I plan to continue doing approximately the same work as before, which has historically been scalable mechanistic interpretability, formal and prosaic corrigibility, reflective stability, and a bunch of value theory stuff. Along with lots of upskilling in convex optimization, machine learning, neuroscience, and economics.
My current project is an attempt to connect the tools & theory of singular learning theory with our knowledge of the inductive biases and loss landscapes of large language models.
Garrett Baker
2 months ago
I have seen some of amp's work, and it is pretty interesting, and novel in the grand scheme of things
For | Date | Type | Amount |
---|---|---|---|
AI Safety Reading Group at metauni [Retrospective] | about 1 month ago | project donation | 10 |
Act I: Exploring emergent behavior from multi-AI, multi-human interaction | about 2 months ago | project donation | 96 |
Act I: Exploring emergent behavior from multi-AI, multi-human interaction | about 2 months ago | project donation | 50 |
Lightcone Infrastructure | 2 months ago | project donation | 95 |
<176bd26d-9db4-4c7a-98c0-ba65570fb44c> | 2 months ago | tip | +1 |
Next Steps in Developmental Interpretability | 2 months ago | project donation | 200 |
Lightcone Infrastructure | 2 months ago | project donation | 50 |
Manifund Bank | 3 months ago | deposit | +500 |