Jason Wolfe

OpenAI

Member of Technical Staff

Links

Focus

Scalable Oversight, Control, Monitoring, Interpretability, Adversarial Robustness, Red-Teaming, Alignment Training, Security, Scheming and Deception, Multi-Agent Safety

Jason is a Member of Technical Staff at OpenAI working on alignment and model behavior.