@Mtcpaa
Independent AI safety researcher. Built MTCP — the only benchmark measuring post-correction constraint persistence in LLMs. 181,448 evaluations across 32 production models. Three published papers. DOI: 10.17605/OSF.IO/DXGK5
https://mtcp.live$0 in pending offers
I research whether AI models maintain behavioural constraints after being corrected mid-conversation. No existing benchmark tested this. I built one, ran it 181,448 times across 32 models from 13 providers, and found that no model achieves reliable post correction persistence. The findings have direct implications for EU AI Act compliance and enterprise AI deployment assurance. Published on OSF and SSRN.
pending admin approval