
Google DeepMind
—
Research Scientist
Links
Focus
Scheming and Deception, Dangerous Capability Evals, Control, Red-Teaming
H-index
18
Stream
Victoria Krakovna
I am a research scientist on the AGI Safety & Alignment team at Google DeepMind. I am currently focusing on deceptive alignment and AI control (recent work: https://arxiv.org/abs/2505.01420), particularly scheming propensity evaluations and honeypots. My past research includes power-seeking incentives, specification gaming, and avoiding side effects.