Stanford CS25: Transformers United V6 I Distinct Modes of Generalization from Parameters and Context

Stanford Online · Advanced ·🧠 Large Language Models ·4h ago
For more information about Stanford’s graduate programs, visit: https://online.stanford.edu/graduate-education May 7, 2026 This seminar covers: • Two methods for teaching information to language models: training (updating parameters) or in-context learning (providing information in prompts) • Striking differences in the types of generalization that models make when they learn information via these two routes • Three different strategies that can help bridge the gap, based on data augmentation, retrieval, and RL Follow along with the seminar schedule. Visit: https://web.stanford.edu/class/cs25/ Guest Speaker: Andrew Lampinen (Anthropic) Instructors: • Steven Feng, Stanford Computer Science PhD student and NSERC PGS-D scholar • Karan P. Singh, Electrical Engineering PhD student and NSF Graduate Research Fellow in the Stanford Translational AI Lab • Michael C. Frank, Benjamin Scott Crocker Professor of Human Biology Director, Symbolic Systems Program • Christopher Manning, Thomas M. Siebel Professor in Machine Learning, Professor of Linguistics and of Computer Science, Co-Founder and Senior Fellow of the Stanford Institute for Human-Centered Artificial Intelligence (HAI)
Watch on YouTube ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Related AI Lessons

The RAG tool that auto-generates Q&A pairs from your documents
Learn to auto-generate Q&A pairs from documents using RAG tool and improve your document management
Dev.to · retrovirusretro
How to Build Secure AI: Implementing Guardrails for Enterprise LLM
Learn to build secure AI by implementing guardrails for enterprise LLMs, going beyond prompt engineering safety for production-ready defense-in-depth architecture
Medium · LLM
5 Chinese AI tools with 100K+ stars that the West is ignoring
Discover 5 Chinese AI tools with 100K+ stars on GitHub that the Western world is overlooking, and learn how to explore and utilize them
Dev.to AI
OpenAI claims it solved an 80-year-old math problem — for real this time
OpenAI's reasoning model claims to have solved an 80-year-old math problem, with mathematicians verifying its solution
TechCrunch AI
Up next
5 Levels of AI Agents - From Simple LLM Calls to Multi-Agent Systems
Dave Ebbelaar (LLM Eng)
Watch →