Oxford Philosophers Found a FLAW in the AI Doom Argument?

EXPLORE

Society & Culture

Health & Fitness

© 2024 PodJoint

https://is1-ssl.mzstatic.com/image/thumb/Podcasts221/v4/ba/43/e3/ba43e3ad-5b40-335b-06e6-2ba083ccd952/mza_3329182250069938322.jpg/600x600bb.jpg

What The Bot with Reuben Adams

Reuben Adams

4 episodes

6 days ago

Interviews about AI: what's going right, what's going wrong, and where we're all headed.

Show more...

All content for What The Bot with Reuben Adams is the property of Reuben Adams and is served directly from their servers with no modification, redirects, or rehosting. The podcast is not affiliated with or endorsed by Podjoint in any way.

Interviews about AI: what's going right, what's going wrong, and where we're all headed.

Show more...

https://d3t3ozftmdmh3i.cloudfront.net/staging/podcast_uploaded_episode/44508611/44508611-1761665521968-2dd4486bf1708.jpg

Oxford Philosophers Found a FLAW in the AI Doom Argument?

What The Bot with Reuben Adams

58 minutes 13 seconds

1 week ago

Oxford Philosophers Found a FLAW in the AI Doom Argument?

The explicit goal of OpenAI, DeepMind and others is to create AGI.This is insanely risky.It keeps me up at night.AIs smarter than us might:🚨Resist shutdown.🚨Resist us changing their goals.🚨Ruthlessly pursue goals, even if they know it’s not what we want or intended.Some people think I’m nuts for believing this. But they often come round once they hear the central arguments.At the core of the AI doom argument are two big ideas:💡Instrumental Convergence💡The Orthogonality Thesis❌If you don’t understand these ideas, you won’t truly understand why some AI researchers are so worried about AGI or Superintelligence.Oxford philosopher Rhys Southan joined me to explain the situation.💡Rhys Southan and his co-authors Helena Ward and Jen Semler argue that powerful AIs might NOT resist having their goals changed. Possibly a fatal flaw in the Instrumental Convergence Thesis.This would be a BIG DEAL. It would mean we could modify powerful AIs if they go wrong.While I don’t fully agree with their argument, it radically changed how I understand the Instrumental Convergence Thesis and forced me to rethink what it means for AIs to have goals.Check out the paper "A Timing Problem for Instrumental Convergence" here: https://link.springer.com/article/10.1007/s11098-025-02370-4