
CHAI
—
Postdoc
I mainly work on developing solutions-in-theory to the control problem. What I mean by a solution-in-theory is:
1. Could do superhuman long-term planning
2. Ongoing receptiveness to feedback about its objectives
3. No reason to escape human control to accomplish its objectives
4. No impossible demands on human designers/operators
5. No TODOs when defining how we set up the AI’s setting
6. No TODOs when defining any programs that are involved, except how to modify them to be tractable
You can see introductions to my work on this topic at michael-k-cohen.com/blog.