Teaching Machines to Say "I Don't Know"—The AI Hallucination Problem

https://is1-ssl.mzstatic.com/image/thumb/Podcasts221/v4/c7/b6/dc/c7b6dc61-b395-e884-e1c7-f9cc2bfa0e0c/mza_1942038718587932187.jpg/600x600bb.jpg

AI Rounds by the Cumming School of Medicine

Office of Faculty Development, Cumming School of Medicine, University of Calgary

16 episodes

4 days ago

AI Rounds is an educational podcast designed for university faculty in medicine and health sciences navigating the evolving landscape of artificial intelligence in healthcare and education. Each episode breaks down complex AI concepts into digestible insights, exploring practical applications and discussing how these technologies are reshaping medical research and education. Join us as we examine AI through a beginner's lens, creating a space where faculty can grow their understanding of these powerful tools that are transforming healthcare delivery, research, and education.

Education

RSS

All content for AI Rounds by the Cumming School of Medicine is the property of Office of Faculty Development, Cumming School of Medicine, University of Calgary and is served directly from their servers with no modification, redirects, or rehosting. The podcast is not affiliated with or endorsed by Podjoint in any way.

Education

https://d3t3ozftmdmh3i.cloudfront.net/staging/podcast_uploaded_nologo/43390207/43390207-1744206119534-f71a692a5c0bd.jpg

Teaching Machines to Say "I Don't Know"—The AI Hallucination Problem

AI Rounds by the Cumming School of Medicine

15 minutes 8 seconds

1 month ago

Teaching Machines to Say "I Don't Know"—The AI Hallucination Problem

Why do GenAI systems confidently state incorrect medical facts instead of saying "I don't know?" Groundbreaking research from OpenAI and Georgia Tech reveals that AI hallucinations aren't bugs to be fixed—they're inevitable consequences of how these systems are trained. This episode explores the "singleton problem" that makes AI systematically unreliable on rare facts, connects to our previous discussion of AI benchmark saturation (Episode 9), and explains why the same evaluation methods that create impressive test scores actually reward confident guessing over appropriate uncertainty. For medical faculty evaluating AI tools, understanding these statistical realities is crucial for teaching students, conducting research, and developing institutional policies that account for AI's fundamental limitations.