Manifund foxManifund
Home
Login
About
People
Categories
Newsletter
HomeAboutPeopleCategoriesLoginCreate
Mtcpaa avatarMtcpaa avatar
A.Abby

@Mtcpaa

Independent AI safety researcher. Built MTCP — the only benchmark measuring post-correction constraint persistence in LLMs. 181,448 evaluations across 32 production models. Three published papers. DOI: 10.17605/OSF.IO/DXGK5

https://mtcp.live
$0total balance
$0charity balance
$0cash balance

$0 in pending offers

About Me

I research whether AI models maintain behavioural constraints after being corrected mid-conversation. No existing benchmark tested this. I built one, ran it 181,448 times across 32 models from 13 providers, and found that no model achieves reliable post correction persistence. The findings have direct implications for EU AI Act compliance and enterprise AI deployment assurance. Published on OSF and SSRN.

Projects

MTCP: Post Correction Persistence Benchmark for Frontier LLMs

pending admin approval