Victoria Krakovna

Google DeepMind

—

Research Scientist

Links

Focus

Scheming and Deception, Dangerous Capability Evals, Control, Red-Teaming

H-index

Stream

Victoria Krakovna

I am a research scientist on the AGI Safety & Alignment team at Google DeepMind. I am currently focusing on deceptive alignment and AI control (recent work: https://arxiv.org/abs/2505.01420), particularly scheming propensity evaluations and honeypots. My past research includes power-seeking incentives, specification gaming, and avoiding side effects.