The MATS Biosecurity Track supports research at the intersection of advanced AI and catastrophic biological risk. We are launching this track because the threat model has shifted: biological foundation models, LLMs with growing wet-lab uplift, and AI-accelerated design tools are compressing timelines on capabilities the existing biosecurity stack was not built to absorb. We want fellows pursuing technical work that has a realistic chance of meaningfully shifting outcomes within the next 6–12 months.
The track spans six research areas. Fellows are matched to mentors based on fit, and projects are scoped to produce concrete artifacts – papers, evals, prototypes, or policy analyses – by program end.
Metagenomic surveillance pipelines for pandemic-grade pathogen detection; genomic language models for novelty detection and improved signal/noise at the front end of the surveillance stack.
AI-accelerated discovery of antiviral peptides under pandemic-response constraints, paired with realistic analysis of the manufacturing and supply-chain bottlenecks that determine whether candidates actually reach patients.
Function-based DNA sequence screening using mechanistic interpretability and ML on biological foundation models — classifiers that catch hazardous sequences, including engineered and AI-designed variants meant to evade homology-based screens.
Engineering work on emergency biodefense infrastructure: PPE, filtration, far-UVC, decontamination, and improvised protective systems for worst-case scenarios. Deliverables here are often physical or quasi-physical.
Policy and forecasting work on AI-bio: evaluating policy levers, forecasting when AI trivializes specific offensive or defensive capabilities, and analyzing deterrence via physical chokepoints (synthesis screening governance, cloud-lab access controls).
Red-teaming biological AI models for dangerous capabilities, building technical defenses (genetic engineering attribution, data governance), and developing dangerous-capability evaluations for frontier bio-AI.
We expect fellows to engage seriously with infohazard considerations and to operate within a publication and disclosure framework we'll work through together early in the program. If you're uncertain whether your background fits, apply anyway and tell us how you think about the threat model — that reasoning is more informative to us than credentials.
We anticipate that strong candidates will come from a variety of backgrounds, including biology, AI safety, public health, epidemiology, machine learning, engineering, chemistry, biosafety, biosecurity, and national security. If you're uncertain whether your background fits, apply anyway and tell us how you think about the threat model; that reasoning is more informative to us than credentials.
This stream focuses on lead independent research in one of six chokepoints for biotech governance: live pathogen repositories, CROs, cloud labs, cell-free expression systems, plasmid vendors, or secondhand lab equipment.
On high-conviction areas, you'll tackle specific open research questions and assess interventions; on low-conviction areas, you'll conduct deep dives to determine whether they're worth pursuing. Your findings will directly shape Sentinel's grantmaking strategy and provide strategic guidance to the broader biosecurity community.
I'll hold a one-hour weekly check-in by default, with higher frequency during onboarding. I'm available via Slack with quick turnarounds on async messages, and you can schedule ad-hoc calls as needed.
If I bring on multiple fellows or external contractors, I'll add a weekly all-hands to make sure everyone has situational awareness.
Essential:
Preferred:
-Specific chokepoint experience: Familiarity with cloud labs, CROs, live pathogen repositories, cell-free systems, plasmid synthesis, or lab equipment ecosystems is a plus.
-Technical writing and translation skills: You can synthesize complex technical findings for policymakers and other non-technical stakeholders.
While fellows will have some flexibility in selecting the chokepoint they focus on (based on their background and interests), we've designed specific projects and open research questions for each chokepoint to shape their contributions.
Computational/modelling problems in biosecurity.
typically 1 hour weekly meetings by default. I typically respond on slack quite quickly - some weeks that I am not available. You are welcome to chat to my phd students too!
Computational experience e.g. Python
OR statistical modelling
interest in biosecurity
We will construct a project together that best suits the skills and interests of the fellow and what I can reasonably be helpful for.
Therapeutics may have durable advantages over pathogens even in the limit of technological progress. How can therapeutic development and manufacturing be made resilient under biorisk scenarios? How can AI progress be maximally leveraged for defense?
I expect we will spend some time at the beginning scoping out a project that is a good fit for the fellow's background and interests. Then, project supervision will depend strongly on the nature of the project. Generally, I expect the fellow to take ownership of the work, with regular mentorship and feedback to maintain alignment and help resolve challenges as they arise.
Essential:
Preferred:
Not a good fit:
We will jointly define the exact project with the fellow, based on their background, interests, and comparative strengths, as well as our current priorities. I expect strong fellows may have their own questions and ideas, but we will provide substantial guidance early on to help turn those ideas into a clear, useful, and realistically scoped project.
In the first phase, we will discuss several possible directions, identify where the fellow can make the strongest contribution, and agree on concrete outputs. Once the project is scoped, I expect the fellow to take ownership of the work, with regular mentorship and feedback to keep it aligned and help overcome any difficulties.
This stream will work on projects that empirically assess national security threats of AI misuse (CBRN terrorism and cyberattacks) and improve dangerous capability evaluations. Threat modeling applicants should have a skeptical mindset, enjoy case study work, and be strong written communicators. Eval applicants should be able and excited to help demonstrate concepts like sandbagging elicitation gaps in an AI misuse context.
The MATS Program is a 10-week research fellowship designed to train and support emerging researchers working on AI alignment, transparency and security. Fellows collaborate with world-class mentors, receive dedicated research management support, and join a vibrant community in Berkeley focused on advancing safe and reliable AI. The program provides the structure, resources, and mentorship needed to produce impactful research and launch long-term careers in AI safety.
MATS mentors are leading researchers from a broad range of AI safety, alignment, governance, field-building and security domains. They include academics, industry researchers, and independent experts who guide scholars through research projects, provide feedback, and help shape each scholar’s growth as a researcher. The mentors represent expertise in areas such as:
Key dates
Application:
The main program will then run from September 28th to December 4th, with the extension phase for accepted fellows beginning in December.
MATS accepts applicants from diverse academic and professional backgrounds - from machine learning, mathematics, and computer science to policy, economics, physics, cognitive science, biology, and public health, as well as founders, operators, and field-builders without traditional research backgrounds. The primary requirements are strong motivation to contribute to AI safety and evidence of technical aptitude, research potential, or relevant operational experience. Prior AI safety experience is helpful but not required.
Applicants submit a general application, applying to various tracks (Empirical, Theory, Strategy & Forecasting, Policy & Governance, Systems Security, Biosecurity, Founding & Field-Building.
In stage 2, applicants apply to streams within those tracks as well as completing track specific evaluations.
After a centralized review period, applicants who are advanced will then undergo additional evaluations depending on the preferences of the streams they've applied to before doing final interviews and receiving offers.
For more information on how to get into MATS, please look at this page.