🤏 DeepSeek-OCR: Contexts Optical Compression

https://is1-ssl.mzstatic.com/image/thumb/Podcasts221/v4/e8/1e/bc/e81ebc24-dac9-31b4-5604-5987c7d85f0c/mza_526091057425586416.jpg/600x600bb.jpg

Build Wiz AI Show

Build Wiz AI

149 episodes

5 days ago

> Building the future of products with AI-powered innovation. < Build Wiz AI Show is your go-to podcast for transforming the latest and most interesting papers, articles, and blogs about AI into an easy-to-digest audio format. Using NotebookLM, we break down complex ideas into engaging discussions, making AI knowledge more accessible. Have a resource you’d love to hear in podcast form? Send us the link, and we might feature it in an upcoming episode! 🚀🎙️

Technology

RSS

All content for Build Wiz AI Show is the property of Build Wiz AI and is served directly from their servers with no modification, redirects, or rehosting. The podcast is not affiliated with or endorsed by Podjoint in any way.

Technology

https://d3t3ozftmdmh3i.cloudfront.net/staging/podcast_uploaded_nologo/43179880/43179880-1741080850174-19afe60766a2d.jpg

🤏 DeepSeek-OCR: Contexts Optical Compression

Build Wiz AI Show

15 minutes 28 seconds

2 weeks ago

🤏 DeepSeek-OCR: Contexts Optical Compression

Welcome to the show, where we discuss DeepSeek-OCR and its investigation into using optical 2D mapping for contexts compression, addressing the computational challenges of quadratic scaling faced by Large Language Models. We explore the DeepEncoder, the core engine designed to achieve high compression ratios, delivering near-lossless OCR precision (approximately 97%) even at a 10× token reduction. This groundbreaking work demonstrates strong practical value, achieving state-of-the-art document parsing performance on OmniDocBench while using the fewest vision tokens, offering a promising direction for future memory systems.