In this stream, I’m interested in developing concrete, actionable R&D agendas for post-AGI institutions and AI resilience. For post-AGI institutions, I’m especially interested in what infrastructure would be needed to make super-cooperative AGI or “Coasean bargaining at scale” possible. For AI resilience, I’m interested in follow-on work to airesilience.net that moves from high-level motivation to detailed proposals.
I’m open to fellows proposing concrete projects within either of two broad directions. I am especially interested in projects that turn high-level theses into tractable R&D agendas, prototypes, institutions, or field-building bets.
Direction 1: Cooperative AGI
The central question is: what would it take for humans and advanced AI systems to form stable, Pareto-improving coalitions, rather than ending up in unilateral domination, competitive erosion, or epistemic collapse?
AGI could radically lower the transaction costs of cooperation: better world models, better bargaining, better contract execution, better collective reflection. But it could also raise them: scalable manipulation, epistemic flooding, eroded ground truth, and AI systems whose moral reasoning is unstable, shallow, or strategically distorted.
I am especially (but not exclusively interested in working on)
Direction 2: AI Resilience.
Follow-on work to https://airesilience.net/, building civilisation’s ability to withstand, adapt to, and recover from AI-driven shocks. This could include:
I work on AI assurance and civilisational resilience: building the technical foundations for independently verifiable claims about the behaviour of AI systems and the infrastructure they run on — from secure silicon to software to multi-stakeholder coordination. Currently, I'm a Programme Director at the UK's Advanced Research and Invention Agency (ARIA). I run the Safeguarded AI programme, a ~£60M R&D programme building a mathematical assurance toolkit that lets fleets of AI agents produce formally verified artifacts at unprecedented speed and scale - from verified software, to microelectornics to a wide range of cyberphyiscal control systems. Before ARIA, I co-founded and led Principles of Intelligence (formerly PIBBSS), a research organisation facilitating knowledge transfer from interdisciplinary sciences into AI safety. I've also been a Research Affiliate with the Alignment of Complex Systems research group, and a Research Manager at the Future of Humanity Institute, University of Oxford.
60-minute weekly 1:1s + async written exchange (drafts, slack, etc.). Ad-hoc additional meetings as needed. The fellow is responsible for driving the project; my role is high-level steering, strategic input and sound boarding.
I am open to working with builder/entrepreneur, strategist or researcher types.
Essential:
Not a good fit:
Fellows are welcome to propose one or several concrete projects within the above directions. We will work together to refine the project, pressure-test its theory of change, and scope it into something tractable. The goal is to have converged on a concrete project within the first two weeks.