AI alignment researcher leading the adversarial training team at Redwood Research.
Formerly on Alignment at OpenAI.
I helped found MIT Effective Altruism and ran it for a year before graduating in 2017.