Category

AI Alignment Podcast

23 episodes

Andrew Critch on AI Research Considerations for Human Existential Safety

Technology & Future September 16, 2020

Andrew Critch on AI Research Considerations for Human Existential Safety

AIAP: An Overview of Technical AI Alignment in 2018 and 2019 with Buck Shlegeris and Rohin Shah

Technology & Future April 15, 2020

AIAP: An Overview of Technical AI Alignment in 2018 and 2019 with Buck Shlegeris and Rohin Shah

AIAP: On Lethal Autonomous Weapons with Paul Scharre

Technology & Future March 16, 2020

AIAP: On Lethal Autonomous Weapons with Paul Scharre

AIAP: On the Long-term Importance of Current AI Policy with Nicolas Moës and Jared Brown

Technology & Future February 18, 2020

AIAP: On the Long-term Importance of Current AI Policy with Nicolas Moës and Jared Brown

AIAP: Identity and the AI Revolution with David Pearce and Andrés Gómez Emilsson

Technology & Future January 16, 2020

AIAP: Identity and the AI Revolution with David Pearce and Andrés Gómez Emilsson

AIAP: On DeepMind, AI Safety, and Recursive Reward Modeling with Jan Leike

Technology & Future December 16, 2019

AIAP: On DeepMind, AI Safety, and Recursive Reward Modeling with Jan Leike

AIAP: Machine Ethics and AI Governance with Wendell Wallach

Technology & Future November 15, 2019

AIAP: Machine Ethics and AI Governance with Wendell Wallach

AIAP: Human Compatible: Artificial Intelligence and the Problem of Control with Stuart Russell

Technology & Future October 8, 2019

AIAP: Human Compatible: Artificial Intelligence and the Problem of Control with Stuart Russell

AIAP: Synthesizing a human's preferences into a utility function with Stuart Armstrong

Technology & Future September 17, 2019

AIAP: Synthesizing a human's preferences into a utility function with Stuart Armstrong

AIAP: China's AI Superpower Dream with Jeffrey Ding

Technology & Future August 16, 2019

AIAP: China's AI Superpower Dream with Jeffrey Ding

AIAP: On Consciousness, Qualia, and Meaning with Mike Johnson and Andrés Gómez Emilsson

Technology & Future May 23, 2019

AIAP: On Consciousness, Qualia, and Meaning with Mike Johnson and Andrés Gómez Emilsson

AIAP: An Overview of Technical AI Alignment with Rohin Shah (Part 2)

Technology & Future April 25, 2019

AIAP: An Overview of Technical AI Alignment with Rohin Shah (Part 2)

AIAP: An Overview of Technical AI Alignment with Rohin Shah (Part 1)

Technology & Future April 11, 2019

AIAP: An Overview of Technical AI Alignment with Rohin Shah (Part 1)

AIAP: AI Alignment through Debate with Geoffrey Irving

Technology & Future March 7, 2019

AIAP: AI Alignment through Debate with Geoffrey Irving

AIAP: Human Cognition and the Nature of Intelligence with Joshua Greene

Technology & Future February 21, 2019

AIAP: Human Cognition and the Nature of Intelligence with Joshua Greene

AIAP: Cooperative Inverse Reinforcement Learning with Dylan Hadfield-Menell (Beneficial AGI 2019)

Technology & Future January 17, 2019

AIAP: Cooperative Inverse Reinforcement Learning with Dylan Hadfield-Menell (Beneficial AGI 2019)

AIAP: Inverse Reinforcement Learning and the State of AI Alignment with Rohin Shah

Technology & Future December 18, 2018

AIAP: Inverse Reinforcement Learning and the State of AI Alignment with Rohin Shah

AIAP: On Becoming a Moral Realist with Peter Singer

Technology & Future October 18, 2018

AIAP: On Becoming a Moral Realist with Peter Singer

AIAP: Moral Uncertainty and the Path to AI Alignment with William MacAskill

Technology & Future September 18, 2018

AIAP: Moral Uncertainty and the Path to AI Alignment with William MacAskill

The Metaethics of Joy, Suffering, and Artificial Intelligence with Brian Tomasik and David Pearce

Technology & Future August 16, 2018

The Metaethics of Joy, Suffering, and Artificial Intelligence with Brian Tomasik and David Pearce

AIAP: AI Safety, Possible Minds, and Simulated Worlds with Roman Yampolskiy

Technology & Future July 16, 2018

AIAP: AI Safety, Possible Minds, and Simulated Worlds with Roman Yampolskiy

AIAP: Astronomical Future Suffering and Superintelligence with Kaj Sotala

Technology & Future June 14, 2018

AIAP: Astronomical Future Suffering and Superintelligence with Kaj Sotala

AIAP: Inverse Reinforcement Learning and Inferring Human Preferences with Dylan Hadfield-Menell

Technology & Future April 25, 2018

AIAP: Inverse Reinforcement Learning and Inferring Human Preferences with Dylan Hadfield-Menell