076 - Deepseek OCR

https://is1-ssl.mzstatic.com/image/thumb/Podcasts221/v4/e9/83/71/e9837137-75fd-d06b-8392-b1545a4a57e7/mza_7068534505453214339.jpg/600x600bb.jpg

Prompt und Antwort

KI-Gilde

80 episodes

2 days ago

Ein KI-generierter Podcasts rund um die Entwicklung von und mit KI. News, Updates und interessante Hintergrundinformationen für den professionellen Einsatz von KI hinaus. Ohne Hype und Buzzwords. Die KI-Gilde ist ein Angebot der YnotBetter UG.

Technology

RSS

All content for Prompt und Antwort is the property of KI-Gilde and is served directly from their servers with no modification, redirects, or rehosting. The podcast is not affiliated with or endorsed by Podjoint in any way.

Technology

https://d3t3ozftmdmh3i.cloudfront.net/staging/podcast_uploaded_episode/43606809/43606809-1761509407550-bf7d1e47913f.jpg

076 - Deepseek OCR

Prompt und Antwort

8 minutes 7 seconds

1 week ago

076 - Deepseek OCR

Im KI Gilde Podcast testen wir Deepseek OCR, das momentan "ziemlich viel Furore macht".

Deepseek OCR ist mehr als nur eine Texterkennung: Es erfasst Dokumente visuell (fast wie ein Mensch), nutzt "Kontexts optical Compression" und erreicht eine Kompression um das 7- bis 20-fache.

Erfahre, warum das Modell ideal für die Verarbeitung komplexer Dokumente ist:

Es erkennt Layouts und Tabellenstrukturen erstaunlich gut (über 92 % Genauigkeit bei Tabellen) und liefert strukturierte Daten, z.B. als sauberes Markdown.

Wir klären, wie Deepseek OCR als maßgeschneiderte Basis für RAG-Pipelines dient und wo es Tesseract überlegen ist. Achtung: Das Modell ist zwar Open Source, benötigt aber zwingend eine dedizierte Nvidia Grafikkarte (GPU) und ist keine reine CPU-Lösung.