Home
Categories
EXPLORE
True Crime
Comedy
Society & Culture
Business
Sports
History
Music
About Us
Contact Us
Copyright
© 2024 PodJoint
00:00 / 00:00
Sign in

or

Don't have an account?
Sign up
Forgot password
https://is1-ssl.mzstatic.com/image/thumb/Podcasts221/v4/02/5f/d2/025fd23a-3ac1-782b-1a9c-b7c616c75e46/mza_9134582053710334182.jpg/600x600bb.jpg
80k After Hours
The 80,000 Hours team
109 episodes
3 months ago
Resources on how to do good with your career — and anything else we here at 80,000 Hours feel like releasing.
Show more...
Self-Improvement
Education,
Society & Culture,
Documentary
RSS
All content for 80k After Hours is the property of The 80,000 Hours team and is served directly from their servers with no modification, redirects, or rehosting. The podcast is not affiliated with or endorsed by Podjoint in any way.
Resources on how to do good with your career — and anything else we here at 80,000 Hours feel like releasing.
Show more...
Self-Improvement
Education,
Society & Culture,
Documentary
https://img.transistor.fm/NBAk2_jEZ05nRV1Dg7kiRxEnNIe7ecYfusMuoUoLFgA/rs:fill:3000:3000:1/q:60/aHR0cHM6Ly9pbWct/dXBsb2FkLXByb2R1/Y3Rpb24udHJhbnNp/c3Rvci5mbS8wMDkx/NzJiMjllMzA0Nzlh/M2NjODgzOWQxMGRi/YzlmYS5qcGc.jpg
Highlights: #214 – Buck Shlegeris on controlling AI that wants to take over – so we can use it anyway
80k After Hours
41 minutes
6 months ago
Highlights: #214 – Buck Shlegeris on controlling AI that wants to take over – so we can use it anyway

Most AI safety conversations centre on alignment: ensuring AI systems share our values and goals. But despite progress, we’re unlikely to know we’ve solved the problem before the arrival of human-level and superhuman systems in as little as three years.

So some — including Buck Shlegeris, CEO of Redwood Research — are developing a backup plan to safely deploy models we fear are actively scheming to harm us: so-called “AI control.” While this may sound mad, given the reluctance of AI companies to delay deploying anything they train, not developing such techniques is probably even crazier.

These highlights are from episode #214 of The 80,000 Hours Podcast: Buck Shlegeris on controlling AI that wants to take over – so we can use it anyway, and include:

  • What is AI control? (00:00:15)
  • One way to catch AIs that are up to no good (00:07:00)
  • What do we do once we catch a model trying to escape? (00:13:39)
  • Team Human vs Team AI (00:18:24)
  • If an AI escapes, is it likely to be able to beat humanity from there? (00:24:59)
  • Is alignment still useful? (00:32:10)
  • Could 10 safety-focused people in an AGI company do anything useful? (00:35:34)

These aren't necessarily the most important or even most entertaining parts of the interview — so if you enjoy this, we strongly recommend checking out the full episode!

And if you're finding these highlights episodes valuable, please let us know by emailing podcast@80000hours.org.

Highlights put together by Ben Cordell, Milo McGuire, and Dominic Armstrong

80k After Hours
Resources on how to do good with your career — and anything else we here at 80,000 Hours feel like releasing.