
OpenAI
—
AI Alignment Research Engineer
Links
Focus
H-index
Stream
OpenAI Safety Team
Juan is a researcher at OpenAI working on AI alignment and adversarial robustness. His public work includes model safety and refusal contributions for GPT-4, and instruction hierarchy training to improve robustness to jailbreaks and prompt injections.