Stanford CS336 Language Modeling from Scratch | Spring 2026 | Lecture 8: Parallelism

Stanford Online · Beginner ·🧠 Large Language Models ·1w ago
For more information about Stanford's online Artificial Intelligence programs, visit: https://stanford.io/ai To learn more about enrolling in this course, visit: https://online.stanford.edu/courses/cs336-language-modeling-scratch Follow along with the course schedule and syllabus, visit: https://cs336.stanford.edu/ Percy Liang Professor of Computer Science (and courtesy in Statistics) Tatsunori Hashimoto Assistant Professor of Computer Science View the course playlist: https://www.youtube.com/playlist?list=PLoROMvodv4rMqXOcazWaTUHhq-yembLCV
Watch on YouTube ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Related AI Lessons

What's new in Prompt Optimizer: latest features and improvements
Learn how to optimize prompts with the latest features and improvements in Prompt Optimizer, a crucial tool for effective LLM interactions
Dev.to AI
AI vs LLM vs AI Agents vs Automation — What’s the Real Difference?
Understand the differences between AI, LLM, AI Agents, and Automation to clarify their roles in technology
Dev.to AI
PagedAttention: vLLM’s Solution to GPU Memory Waste
Learn how PagedAttention solves GPU memory waste for large language models (LLMs) and improve your LLM serving efficiency
Medium · ChatGPT
From 30 to 60 Tokens/Second: How I Got vLLM Running on 2x RTX 3090
Learn how to install and run vLLM on 2x RTX 3090 to achieve 60 tokens/second, a significant performance boost for LLM applications
Medium · LLM
Up next
5 Levels of AI Agents - From Simple LLM Calls to Multi-Agent Systems
Dave Ebbelaar (LLM Eng)
Watch →