Here are some examples of projects I have been interested in, but I may be interested in other projects by the time this cohort starts:
I am a researcher at METR.
I think the development of AI is going to be a confusing time for the world. I want to help provide good evidence and methodologies for tracking AI development and risk, so humanity can make sensible decisions.
I've had different roles at different times, including leading task development and our monitoring stream. I like prototyping new kinds of evaluations. I think it's healthy to read transcripts. I'm interested in what capabilities matter for being a competent agent, and why current AI agents fall short. I feel lucky that I get to spend time building an understanding of the models.
I've previously spent time at the Centre on Long-Term Risk and FHI. Before that I studied physics at university, where I did malaria diagnostics research.
I'll meet with scholars 2x/week each. I'll also be generally available async and potentially for code review.
Capabilities:
Monitoring:
Various profiles could be a good fit.
Wanted:
Some of the following would be great but not essential:
Can independently find collaboraters, but not required
I'll provide a list of possible projects to pick from, and talk through the options before making a decision.
Scholars can also suggest their own projects.