Nicholas Carlini

Anthropic

—

Research Scientist

Links

Focus

Control, Model Organisms, Red-Teaming, Scheming and Deception

H-index

Stream

Anthropic and OpenAI Megastream

Nicholas is a researcher working at the intersection of machine learning and computer security. Currently he works at Anthropic studying what bad things you could do with, or do to, language models; he likes to break things.