Preprocessing Unstructured Data for LLM Applications

Coursera Courses ↗ · Coursera

Open Course on Coursera

Free to audit · Opens on Coursera

Preprocessing Unstructured Data for LLM Applications

Coursera · Intermediate ·🧠 Large Language Models ·6h ago
Enhancing a RAG system’s performance depends on efficiently processing diverse unstructured data sources. In this course, you’ll learn techniques for representing all sorts of unstructured data, like text, images, and tables, from many different sources and implement them to extend your LLM RAG pipeline to include Excel, Word, PowerPoint, PDF, and EPUB files. 1. How to preprocess data for your LLM application development, focusing on how to work with different document types. 2. How to extract and normalize various documents into a common JSON format and enrich it with metadata to improve…
Watch on Coursera ↗ (saves to browser)
5 Levels of AI Agents - From Simple LLM Calls to Multi-Agent Systems
Next Up
5 Levels of AI Agents - From Simple LLM Calls to Multi-Agent Systems
Dave Ebbelaar (LLM Eng)