Home
Categories
EXPLORE
True Crime
Comedy
Society & Culture
Business
Sports
Technology
Health & Fitness
About Us
Contact Us
Copyright
© 2024 PodJoint
Podjoint Logo
US
00:00 / 00:00
Sign in

or

Don't have an account?
Sign up
Forgot password
https://is1-ssl.mzstatic.com/image/thumb/Podcasts211/v4/6a/24/22/6a242243-a886-3562-51aa-5b0137909c8b/mza_6305134645633578970.jpg/600x600bb.jpg
The AI Research Deep Dive
The AI Research Deep Dive
36 episodes
6 days ago
From arXiv to insight: a daily tour of cutting-edge AI papers. The AI Research Deep Dive podcast dives into a new groundbreaking research paper every day. It combs through the most important details and results to give you a great idea of what the paper accomplishes and how it gets there.
Show more...
Science
RSS
All content for The AI Research Deep Dive is the property of The AI Research Deep Dive and is served directly from their servers with no modification, redirects, or rehosting. The podcast is not affiliated with or endorsed by Podjoint in any way.
From arXiv to insight: a daily tour of cutting-edge AI papers. The AI Research Deep Dive podcast dives into a new groundbreaking research paper every day. It combs through the most important details and results to give you a great idea of what the paper accomplishes and how it gets there.
Show more...
Science
https://d3t3ozftmdmh3i.cloudfront.net/staging/podcast_uploaded_nologo/43949260/43949260-1750798569136-3391783a0fb9a.jpg
DeepSeek-OCR: Contexts Optical Compression
The AI Research Deep Dive
17 minutes 23 seconds
1 week ago
DeepSeek-OCR: Contexts Optical Compression

Arxiv: https://www.arxiv.org/abs/2510.18234

This episode of "The AI Research Deep Dive" unpacks "DeepSeek-OCR," a paper that offers a radical solution to one of AI's biggest bottlenecks: the long context problem. The host explains how the quadratic scaling of LLMs makes processing long documents computationally impossible. Instead of tweaking the transformer, DeepSeek's "Contexts Optical Compression" reframes the problem: what if we treat an image of text as a highly compressed format? Listeners will learn about the specialized three-stage "DeepEncoder" that shrinks a high-resolution document into a tiny set of vision tokens, achieving a 10:1 compression ratio with 97% accuracy. This episode explores how this method provides a state-of-the-art tool for document parsing and, more profoundly, offers a new blueprint for a "biologically inspired memory" that could allow AI to remember vast quantities of information.

The AI Research Deep Dive
From arXiv to insight: a daily tour of cutting-edge AI papers. The AI Research Deep Dive podcast dives into a new groundbreaking research paper every day. It combs through the most important details and results to give you a great idea of what the paper accomplishes and how it gets there.