Technology & Future September 16, 2020 Andrew Critch on AI Research Considerations for Human Existential Safety
Technology & Future April 15, 2020 AIAP: An Overview of Technical AI Alignment in 2018 and 2019 with Buck Shlegeris and Rohin Shah
Technology & Future February 18, 2020 AIAP: On the Long-term Importance of Current AI Policy with Nicolas Moës and Jared Brown
Technology & Future January 16, 2020 AIAP: Identity and the AI Revolution with David Pearce and Andrés Gómez Emilsson
Technology & Future December 16, 2019 AIAP: On DeepMind, AI Safety, and Recursive Reward Modeling with Jan Leike
Technology & Future October 8, 2019 AIAP: Human Compatible: Artificial Intelligence and the Problem of Control with Stuart Russell
Technology & Future September 17, 2019 AIAP: Synthesizing a human's preferences into a utility function with Stuart Armstrong
Technology & Future May 23, 2019 AIAP: On Consciousness, Qualia, and Meaning with Mike Johnson and Andrés Gómez Emilsson
Technology & Future April 25, 2019 AIAP: An Overview of Technical AI Alignment with Rohin Shah (Part 2)
Technology & Future April 11, 2019 AIAP: An Overview of Technical AI Alignment with Rohin Shah (Part 1)
Technology & Future February 21, 2019 AIAP: Human Cognition and the Nature of Intelligence with Joshua Greene
Technology & Future January 17, 2019 AIAP: Cooperative Inverse Reinforcement Learning with Dylan Hadfield-Menell (Beneficial AGI 2019)
Technology & Future December 18, 2018 AIAP: Inverse Reinforcement Learning and the State of AI Alignment with Rohin Shah
Technology & Future September 18, 2018 AIAP: Moral Uncertainty and the Path to AI Alignment with William MacAskill
Technology & Future August 16, 2018 The Metaethics of Joy, Suffering, and Artificial Intelligence with Brian Tomasik and David Pearce
Technology & Future July 16, 2018 AIAP: AI Safety, Possible Minds, and Simulated Worlds with Roman Yampolskiy
Technology & Future June 14, 2018 AIAP: Astronomical Future Suffering and Superintelligence with Kaj Sotala
Technology & Future April 25, 2018 AIAP: Inverse Reinforcement Learning and Inferring Human Preferences with Dylan Hadfield-Menell