Teaching PyTorch To Read Your Worst PDFs With Docling - Mingxuan Zhao, Peter Staar & Carol Chen

PyTorch · Intermediate ·🛠️ AI Tools & Apps ·3w ago
Skills: RAG Basics90%
Teaching PyTorch To Read Your Worst PDFs With Docling - Mingxuan Zhao & Peter Staar, IBM & Carol Chen, Red Hat Building production RAG pipelines starts with a problem most teams underestimate: getting clean, structured data out of real-world documents. PDFs lose table structure, figures get separated from captions, and multi-column layouts become unreadable. Before your PyTorch models even see your data, crucial information is already lost. Docling is an open-source, MIT-licensed document parsing library that uses PyTorch-based deep learning models to understand documents the way humans read them. It preserves hierarchy, extracts structured data from tables and figures, and supports over ten common file formats through a consistent API. Because everything runs locally, it integrates cleanly into PyTorch-native workflows with low latency and no data leaving your infrastructure. In this talk, I'll walk through Docling's PyTorch-powered architecture and show how to build document processing pipelines for RAG and other GenAI applications. I'll also share the architecture of real-world applications of Docling and how it has improved workflows. You'll leave with practical patterns for connecting Docling to your own PyTorch-based GenAI stack.
Watch on YouTube ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Related AI Lessons

Mental Algorithms: How AI Changes the Cost of Thinking
Discover how AI impacts the cost of thinking by altering mental effort and algorithms, and why it matters for professionals
Dev.to AI
The AI Content System I Built to Generate Viral LinkedIn Posts Started Bringing Clients…
Learn how to build an AI content system to generate viral LinkedIn posts and attract clients
Medium · Programming
$5,000/Month AI Income: Local Business Review Translation Service
Learn how to create a $5,000/month AI-powered translation service for local business reviews and boost your income
Medium · ChatGPT
Gmail's New AI Features Are Live—And They're About to Change What You Actually See
Gmail's new AI features are live, changing what users see in their inboxes, and it's crucial to understand how AI is transforming email experiences
Medium · Programming
Up next
Claude Opus 4.7 + NotebookLM is INSANE!
Julian Goldie SEO
Watch →