Sholto Douglas & Trenton Bricken — How LLMs actually think

Dwarkesh Patel · Advanced ·📄 Research Papers Explained ·2y ago

Skills: LLM Foundations90%

Had so much fun chatting with my good friends Trenton Bricken and Sholto Douglas on the podcast. No way to summarize it, except: * This is the best context dump out there on how LLMs are trained, what capabilities they're likely to soon have, and what exactly is going on inside them. * You would be shocked how much of what I know about this field, I've learned just from talking with them. * To the extent that you've enjoyed my other AI interviews, now you know why. There's a transcript with links to all the papers the boys were throwing down - may help you follow along. 𝐄𝐏𝐈𝐒𝐎𝐃𝐄 𝐋𝐈𝐍𝐊𝐒 * Transcript: https://www.dwarkeshpatel.com/p/sholto-douglas-trenton-bricken * Spotify: https://open.spotify.com/episode/2dtDauiE4v8ldNRqPFq0uP?si=7S4n69QuTjeYz0lZwW4xIw * Apple Podcasts: https://podcasts.apple.com/us/podcast/sholto-douglas-trenton-bricken-how-to-build-understand/id1516093381?i=1000650748087 * Trenton Bricken's twitter: https://twitter.com/TrentonBricken * Sholto Douglas's twitter: https://twitter.com/_sholtodouglas 𝐓𝐈𝐌𝐄𝐒𝐓𝐀𝐌𝐏𝐒 00:00:00 - Long contexts 00:17:04 - Intelligence is just associations 00:33:27 - Intelligence explosion & great researchers 01:07:44 - Superposition & secret communication 01:23:26 - Agents & true reasoning 01:35:32 - How Sholto & Trenton got into AI research 02:08:08 - Are feature spaces the wrong way to think about intelligence? 02:22:04 - Will interp actually work on superhuman models 02:45:57 - Sholto's technical challenge for the audience 03:04:49 - Rapid fire

Watch on YouTube ↗ (saves to browser)

Sign in to unlock AI tutor explanation · ⚡30

Playlist

Uploads from Dwarkesh Patel · Dwarkesh Patel · 0 of 60

← Previous Next →

Rubik's Cube Encryption Demo

Rubik's Cube Encryption Demo

Bryan Caplan - Nurturing Orphaned Ideas, Education, and UBI

Bryan Caplan - Nurturing Orphaned Ideas, Education, and UBI

Matjaž Leonardis - Science, Identity and Probability

Matjaž Leonardis - Science, Identity and Probability

Robin Hanson - The Long View and The Elephant in the Brain

Robin Hanson - The Long View and The Elephant in the Brain

Caleb Watney - America's Innovation Engine

Caleb Watney - America's Innovation Engine

Alex Tabarrok - Prizes, Prices, and Public Goods

Alex Tabarrok - Prizes, Prices, and Public Goods

Scott Young - Ultralearning, The MIT Challenge

Scott Young - Ultralearning, The MIT Challenge

Scott Aaronson - Quantum Computing, Complexity, and Creativity

Scott Aaronson - Quantum Computing, Complexity, and Creativity

Uncle Bob - The Long Reach of Code, Automating Programming, and Developing Coding Talent

Uncle Bob - The Long Reach of Code, Automating Programming, and Developing Coding Talent

Michael Huemer - Anarchy, Capitalism, and Progress

Michael Huemer - Anarchy, Capitalism, and Progress

Sarah Fitz-Claridge - Taking Children Seriously | The Lunar Society #15

Sarah Fitz-Claridge - Taking Children Seriously | The Lunar Society #15

Byrne Hobart - Optionality, Stagnation, and Secret Societies

Byrne Hobart - Optionality, Stagnation, and Secret Societies

David Deutsch - AI, America, Fun, & Bayes

David Deutsch - AI, America, Fun, & Bayes

Bryan Caplan - Labor Econ, Poverty, & Mental Illness

Bryan Caplan - Labor Econ, Poverty, & Mental Illness

Jimmy Soni - Peter Thiel, Elon Musk, and the Paypal Mafia

Jimmy Soni - Peter Thiel, Elon Musk, and the Paypal Mafia

Razib Khan - Genomics, Intelligence, and The Church of Science

Razib Khan - Genomics, Intelligence, and The Church of Science

Pradyu Prasad - Imperial Japan, the God Emperor, and Militarization in the Modern World

Pradyu Prasad - Imperial Japan, the God Emperor, and Militarization in the Modern World

Manifold Markets Founder - Predictions Markets & Revolutionizing Governance

Manifold Markets Founder - Predictions Markets & Revolutionizing Governance

Ananyo Bhattacharya - John von Neumann, Jewish Genius, and Nuclear War

Ananyo Bhattacharya - John von Neumann, Jewish Genius, and Nuclear War

Agustin Lebron - Trading, Crypto, and Adverse Selection

Agustin Lebron - Trading, Crypto, and Adverse Selection

Sam Bankman-Fried - Crypto, FTX, Altruism, & Leadership

Sam Bankman-Fried - Crypto, FTX, Altruism, & Leadership

Alexander Mikaberidze - Napoleon, War, Progress, and Global Order

Alexander Mikaberidze - Napoleon, War, Progress, and Global Order

Sam Bankman-Fried On FOCUS

Sam Bankman-Fried On FOCUS

Sam Bankman-Fried on GREAT FOUNDERS

Sam Bankman-Fried on GREAT FOUNDERS

$30 BILLION Opportunity Ignored by Sam Bankman-Fried Competitors

$30 BILLION Opportunity Ignored by Sam Bankman-Fried Competitors

Fin Moorhouse - Longtermism, Space, & Entrepreneurship

Fin Moorhouse - Longtermism, Space, & Entrepreneurship

Joseph Carlsmith - Utopia, AI, & Infinite Ethics

Joseph Carlsmith - Utopia, AI, & Infinite Ethics

Will MacAskill - Longtermism, Effective Altruism, History, & Technology

Will MacAskill - Longtermism, Effective Altruism, History, & Technology

Steve Hsu - Intelligence, Embryo Selection, & The Future of Humanity

Steve Hsu - Intelligence, Embryo Selection, & The Future of Humanity

Austin Vernon - Energy Superabundance, Starship Missiles, & Finding Alpha

Austin Vernon - Energy Superabundance, Starship Missiles, & Finding Alpha

Charles C. Mann - Americas Before Columbus & Scientific Wizardry

Charles C. Mann - Americas Before Columbus & Scientific Wizardry

Tyler Cowen - Why Society Will Collapse & Why Sex is Pessimistic

Tyler Cowen - Why Society Will Collapse & Why Sex is Pessimistic

Bryan Caplan - Feminists, Billionaires, and Demagogues

Bryan Caplan - Feminists, Billionaires, and Demagogues

Brian Potter - Future of Construction, Ugly Modernism, & Environmental Review

Brian Potter - Future of Construction, Ugly Modernism, & Environmental Review

Kenneth T. Jackson - Robert Moses, Hero of New York?

Kenneth T. Jackson - Robert Moses, Hero of New York?

Edward Glaeser - Cities, Terrorism, Housing, & Remote Work

Edward Glaeser - Cities, Terrorism, Housing, & Remote Work

Byrne Hobart - FTX, Drugs, Twitter, Taiwan, & Monasticism

Byrne Hobart - FTX, Drugs, Twitter, Taiwan, & Monasticism

Nadia Asparouhova — Tech elites, democracy, open source, & philanthropy

Nadia Asparouhova — Tech elites, democracy, open source, & philanthropy

Bethany McLean — Enron, FTX, 2008, Musk, frauds, & visionaries

Bethany McLean — Enron, FTX, 2008, Musk, frauds, & visionaries

Holden Karnofsky — History's most important century

Holden Karnofsky — History's most important century

$30m Grant to OpenAI?

$30m Grant to OpenAI?

Does GPT Have Holden Worried?

Does GPT Have Holden Worried?

Lars Doucet — Progress, poverty, Georgism, & why rent is too damn high

Lars Doucet — Progress, poverty, Georgism, & why rent is too damn high

Deep Learning Changes Everything

Deep Learning Changes Everything

Garett Jones — Immigration, national IQ, & less democracy

Garett Jones — Immigration, national IQ, & less democracy

Marc Andreessen — AI, crypto, 1000 Elon Musks, regrets, vulnerabilities, & managerial revolution

Marc Andreessen — AI, crypto, 1000 Elon Musks, regrets, vulnerabilities, & managerial revolution

Why You Shouldn't Start A Startup

Why You Shouldn't Start A Startup

The Future Of Venture Capital

The Future Of Venture Capital

The Crucial Skill For A Startup Founder

The Crucial Skill For A Startup Founder

Brett Harrison — FTX US former president speaks out

Brett Harrison — FTX US former president speaks out

Nat Friedman (Github CEO) — Reading ancient scrolls, open source, & AI

Nat Friedman (Github CEO) — Reading ancient scrolls, open source, & AI

Ilya Sutskever (OpenAI Chief Scientist) — Why next-token prediction could surpass human intelligence

Ilya Sutskever (OpenAI Chief Scientist) — Why next-token prediction could surpass human intelligence

Impact of Taiwan Invasion on AI

Impact of Taiwan Invasion on AI

Reliability is Bottleneck on AI - OpenAI Founder

Reliability is Bottleneck on AI - OpenAI Founder

Next Token Prediction SOLVES AI Says OpenAI Founder

Next Token Prediction SOLVES AI Says OpenAI Founder

Harmful Uses of GPT - OpenAI Founder

Harmful Uses of GPT - OpenAI Founder

Why OpenAI Founder Thinks AI Is Near

Why OpenAI Founder Thinks AI Is Near

AI will help us achieve enlightenment - OpenAI Founder

AI will help us achieve enlightenment - OpenAI Founder

Eliezer Yudkowsky — Why AI will kill us, aligning LLMs, nature of intelligence, SciFi, & rationality

Eliezer Yudkowsky — Why AI will kill us, aligning LLMs, nature of intelligence, SciFi, & rationality

Richard Rhodes — The making of the atomic bomb

Richard Rhodes — The making of the atomic bomb

More on: LLM Foundations

View skill →

Getting Started with Vertex AI Gemini 1.5 Flash

I TRAINED AN AI TO SOLVE 2+2 (w/ Live Coding)

I TRAINED AN AI TO SOLVE 2+2 (w/ Live Coding)

How to use the ChatGPT API with Python!!

How to use the ChatGPT API with Python!!

Nicholas Renotte

Gemini 2.5: Create an interactive plot of economic data

Gemini 2.5: Create an interactive plot of economic data

Google DeepMind

LangChain Chatbots: Building a Personalized AI Assistant

LangChain Chatbots: Building a Personalized AI Assistant

Analytics Vidhya

Auto-generating meeting notes with Python

Auto-generating meeting notes with Python

Related AI Lessons

The ABCs of reading medical research and review papers these days

Learn to critically evaluate medical research papers by accepting nothing at face value, believing no one blindly, and checking everything

#1 DevLog Meta-research: I Got Tired of Tab Chaos While Reading Research Papers.

Learn to manage research paper tabs efficiently and apply meta-research techniques to improve productivity

How to Set Up a Karpathy-Style Wiki for Your Research Field

Learn to set up a Karpathy-style wiki for your research field to organize and share knowledge effectively

The Non-Optimality of Scientific Knowledge: Path Dependence, Lock-In, and The Local Minimum Trap

Scientific knowledge may be stuck in a local minimum, hindering optimal progress, and understanding this concept is crucial for advancing research

Chapters (10)

Long contexts

17:04 Intelligence is just associations

33:27 Intelligence explosion & great researchers

1:07:44 Superposition & secret communication

1:23:26 Agents & true reasoning

1:35:32 How Sholto & Trenton got into AI research

2:08:08 Are feature spaces the wrong way to think about intelligence?

2:22:04 Will interp actually work on superhuman models

2:45:57 Sholto's technical challenge for the audience

3:04:49 Rapid fire

Generating novel scientific hypotheses with Co-Scientist

Google DeepMind