Home
Categories
EXPLORE
True Crime
Comedy
Society & Culture
Business
Sports
History
News
About Us
Contact Us
Copyright
© 2024 PodJoint
00:00 / 00:00
Sign in

or

Don't have an account?
Sign up
Forgot password
https://is1-ssl.mzstatic.com/image/thumb/Podcasts221/v4/02/5f/d2/025fd23a-3ac1-782b-1a9c-b7c616c75e46/mza_9134582053710334182.jpg/600x600bb.jpg
80k After Hours
The 80,000 Hours team
109 episodes
3 months ago
Resources on how to do good with your career — and anything else we here at 80,000 Hours feel like releasing.
Show more...
Self-Improvement
Education,
Society & Culture,
Documentary
RSS
All content for 80k After Hours is the property of The 80,000 Hours team and is served directly from their servers with no modification, redirects, or rehosting. The podcast is not affiliated with or endorsed by Podjoint in any way.
Resources on how to do good with your career — and anything else we here at 80,000 Hours feel like releasing.
Show more...
Self-Improvement
Education,
Society & Culture,
Documentary
https://img.transistor.fm/DWm7zU614ZmWbYfNAys-kyI2qqMV1dUoGHGmHIUWjPs/rs:fill:3000:3000:1/q:60/aHR0cHM6Ly9pbWct/dXBsb2FkLXByb2R1/Y3Rpb24udHJhbnNp/c3Rvci5mbS9jYTE3/YzQ2MDlmODdiYjM3/OTVhNTEwNmYyZGRk/ODZiZC5qcGc.jpg
Highlights: #217 – Beth Barnes on the most important graph in AI right now — and the 7-month rule that governs its progress
80k After Hours
40 minutes
4 months ago
Highlights: #217 – Beth Barnes on the most important graph in AI right now — and the 7-month rule that governs its progress

AI models today have a 50% chance of successfully completing a task that would take an expert human one hour. Seven months ago, that number was roughly 30 minutes — and seven months before that, 15 minutes.

These are substantial, multi-step tasks requiring sustained focus: building web applications, conducting machine learning research, or solving complex programming challenges.

Today’s guest, Beth Barnes, is CEO of METR (Model Evaluation & Threat Research) — the leading organisation measuring these capabilities.

These highlights are from episode #217 of The 80,000 Hours Podcast: Beth Barnes on the most important graph in AI right now — and the 7-month rule that governs its progress, and include:

  • Can we see AI scheming in the chain of thought? (00:00:34)
  • We have to test model honesty even before they're used inside AI companies (00:05:48)
  • It's essential to thoroughly test relevant real-world tasks (00:10:13)
  • Recursively self-improving AI might even be here in two years — which is alarming (00:16:09)
  • Do we need external auditors doing AI safety tests, not just the companies themselves? (00:21:55)
  • A case against safety-focused people working at frontier AI companies (00:29:30)
  • Open-weighting models is often good, and Beth has changed her attitude about it (00:34:57)

These aren't necessarily the most important or even most entertaining parts of the interview — so if you enjoy this, we strongly recommend checking out the full episode!

And if you're finding these highlights episodes valuable, please let us know by emailing podcast@80000hours.org.

Highlights put together by Ben Cordell, Milo McGuire, and Dominic Armstrong

80k After Hours
Resources on how to do good with your career — and anything else we here at 80,000 Hours feel like releasing.