Stanford CS336 Language Modeling from Scratch | Spring 2026 | Lecture 13: Data (Sources, Datasets)

Stanford Online · Beginner ·🧠 Large Language Models ·2h ago
For more information about Stanford's online Artificial Intelligence programs, visit: https://stanford.io/ai To learn more about enrolling in this course, visit: https://online.stanford.edu/courses/cs336-language-modeling-scratch Follow along with the course schedule and syllabus, visit: https://cs336.stanford.edu/ Percy Liang Professor of Computer Science (and courtesy in Statistics) Tatsunori Hashimoto Assistant Professor of Computer Science View the course playlist: https://www.youtube.com/playlist?list=PLoROMvodv4rMqXOcazWaTUHhq-yembLCV
Watch on YouTube ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Related AI Lessons

What Happens When Your Defense Hits a Hard Floor
Learn how prompt injection and converging failures impact LLMs and their security implications
Medium · LLM
LLMs are Functions, not Brains — aiHelpDesk perspective
Understand LLMs as functions, not brains, to unlock their full potential and avoid common pitfalls
Medium · LLM
Gemma 4 Didn't Just Get Smarter. It Became a Different Kind of Model. Here's What the Agentic Numbers Actually Mean.
Gemma 4's new architecture represents a significant shift in open-weight models, enabling more efficient and effective processing of complex data, which is crucial for AI advancements
Dev.to · Om Shree
90. Phase 8 Capstone: Build a Full AI Application
Build a full AI application, DocuMind, that enables intelligent conversations about uploaded documents
Dev.to AI
Up next
5 Levels of AI Agents - From Simple LLM Calls to Multi-Agent Systems
Dave Ebbelaar (LLM Eng)
Watch →