Johannes Treutlein

Anthropic

Johannes completed the MATS Summer 2022 Cohort under the mentorship of Evan Hubinger (then a Research Fellow at MIRI). As a result of MATS, Johannes co-authored the paper Conditioning Predictive Models: Risks and Strategies with Evan as a lead author. He also published a follow-up paper on Incentivizing honest performative predictions with proper scoring rules at the UAI 2023 conference. After MATS, Johannes started a PhD in Computer Science at CHAI. Since 2024, he Johannes has been working at Anthropic on alignment stress-testing.

The Summer 2022 cohort was MATS's first full-scale program, with 31 scholars and 7 mentors from leading AI safety organizations including OpenAI, ARC, MIRI, EleutherAI, and Aligned AI. The program ran 5 weeks online followed by 8 weeks in-person in Berkeley, where scholars conducted independent research under expert mentorship, participated in educational seminars, and built community with peers in the Berkeley AI safety ecosystem.

MATS helped me get deeper into AI safety research by motivating me to get up to speed with current research and giving me access to mentorship from an expert in AI safety, as well as a smart and talented cohort and a large network of researchers. It also provided infrastructure such as office space in Berkeley and a generous stipend. SERI MATS worked as a matchmaker between Evan Hubinger and me and thus helped me get involved in his projects, which would have been harder to do otherwise. I feel like I have developed faster as a researcher since doing MATS.

Johannes Treutlein