Sholto Douglas & Trenton Bricken — How LLMs actually think
Skills:
LLM Foundations90%
Had so much fun chatting with my good friends Trenton Bricken and Sholto Douglas on the podcast. No way to summarize it, except:
* This is the best context dump out there on how LLMs are trained, what capabilities they're likely to soon have, and what exactly is going on inside them.
* You would be shocked how much of what I know about this field, I've learned just from talking with them.
* To the extent that you've enjoyed my other AI interviews, now you know why.
There's a transcript with links to all the papers the boys were throwing down - may help you follow along.
𝐄𝐏𝐈𝐒𝐎𝐃𝐄 𝐋𝐈𝐍𝐊𝐒
* Transcript: https://www.dwarkeshpatel.com/p/sholto-douglas-trenton-bricken
* Spotify: https://open.spotify.com/episode/2dtDauiE4v8ldNRqPFq0uP?si=7S4n69QuTjeYz0lZwW4xIw
* Apple Podcasts: https://podcasts.apple.com/us/podcast/sholto-douglas-trenton-bricken-how-to-build-understand/id1516093381?i=1000650748087
* Trenton Bricken's twitter: https://twitter.com/TrentonBricken
* Sholto Douglas's twitter: https://twitter.com/_sholtodouglas
𝐓𝐈𝐌𝐄𝐒𝐓𝐀𝐌𝐏𝐒
00:00:00 - Long contexts
00:17:04 - Intelligence is just associations
00:33:27 - Intelligence explosion & great researchers
01:07:44 - Superposition & secret communication
01:23:26 - Agents & true reasoning
01:35:32 - How Sholto & Trenton got into AI research
02:08:08 - Are feature spaces the wrong way to think about intelligence?
02:22:04 - Will interp actually work on superhuman models
02:45:57 - Sholto's technical challenge for the audience
03:04:49 - Rapid fire
Watch on YouTube ↗
(saves to browser)
Sign in to unlock AI tutor explanation · ⚡30
Playlist
Uploads from Dwarkesh Patel · Dwarkesh Patel · 0 of 60
← Previous
Next →
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
Rubik's Cube Encryption Demo
Dwarkesh Patel
Bryan Caplan - Nurturing Orphaned Ideas, Education, and UBI
Dwarkesh Patel
Matjaž Leonardis - Science, Identity and Probability
Dwarkesh Patel
Robin Hanson - The Long View and The Elephant in the Brain
Dwarkesh Patel
Caleb Watney - America's Innovation Engine
Dwarkesh Patel
Alex Tabarrok - Prizes, Prices, and Public Goods
Dwarkesh Patel
Scott Young - Ultralearning, The MIT Challenge
Dwarkesh Patel
Scott Aaronson - Quantum Computing, Complexity, and Creativity
Dwarkesh Patel
Uncle Bob - The Long Reach of Code, Automating Programming, and Developing Coding Talent
Dwarkesh Patel
Michael Huemer - Anarchy, Capitalism, and Progress
Dwarkesh Patel
Sarah Fitz-Claridge - Taking Children Seriously | The Lunar Society #15
Dwarkesh Patel
Byrne Hobart - Optionality, Stagnation, and Secret Societies
Dwarkesh Patel
David Deutsch - AI, America, Fun, & Bayes
Dwarkesh Patel
Bryan Caplan - Labor Econ, Poverty, & Mental Illness
Dwarkesh Patel
Jimmy Soni - Peter Thiel, Elon Musk, and the Paypal Mafia
Dwarkesh Patel
Razib Khan - Genomics, Intelligence, and The Church of Science
Dwarkesh Patel
Pradyu Prasad - Imperial Japan, the God Emperor, and Militarization in the Modern World
Dwarkesh Patel
Manifold Markets Founder - Predictions Markets & Revolutionizing Governance
Dwarkesh Patel
Ananyo Bhattacharya - John von Neumann, Jewish Genius, and Nuclear War
Dwarkesh Patel
Agustin Lebron - Trading, Crypto, and Adverse Selection
Dwarkesh Patel
Sam Bankman-Fried - Crypto, FTX, Altruism, & Leadership
Dwarkesh Patel
Alexander Mikaberidze - Napoleon, War, Progress, and Global Order
Dwarkesh Patel
Sam Bankman-Fried On FOCUS
Dwarkesh Patel
Sam Bankman-Fried on GREAT FOUNDERS
Dwarkesh Patel
$30 BILLION Opportunity Ignored by Sam Bankman-Fried Competitors
Dwarkesh Patel
Fin Moorhouse - Longtermism, Space, & Entrepreneurship
Dwarkesh Patel
Joseph Carlsmith - Utopia, AI, & Infinite Ethics
Dwarkesh Patel
Will MacAskill - Longtermism, Effective Altruism, History, & Technology
Dwarkesh Patel
Steve Hsu - Intelligence, Embryo Selection, & The Future of Humanity
Dwarkesh Patel
Austin Vernon - Energy Superabundance, Starship Missiles, & Finding Alpha
Dwarkesh Patel
Charles C. Mann - Americas Before Columbus & Scientific Wizardry
Dwarkesh Patel
Tyler Cowen - Why Society Will Collapse & Why Sex is Pessimistic
Dwarkesh Patel
Bryan Caplan - Feminists, Billionaires, and Demagogues
Dwarkesh Patel
Brian Potter - Future of Construction, Ugly Modernism, & Environmental Review
Dwarkesh Patel
Kenneth T. Jackson - Robert Moses, Hero of New York?
Dwarkesh Patel
Edward Glaeser - Cities, Terrorism, Housing, & Remote Work
Dwarkesh Patel
Byrne Hobart - FTX, Drugs, Twitter, Taiwan, & Monasticism
Dwarkesh Patel
Nadia Asparouhova — Tech elites, democracy, open source, & philanthropy
Dwarkesh Patel
Bethany McLean — Enron, FTX, 2008, Musk, frauds, & visionaries
Dwarkesh Patel
Holden Karnofsky — History's most important century
Dwarkesh Patel
$30m Grant to OpenAI?
Dwarkesh Patel
Does GPT Have Holden Worried?
Dwarkesh Patel
Lars Doucet — Progress, poverty, Georgism, & why rent is too damn high
Dwarkesh Patel
Deep Learning Changes Everything
Dwarkesh Patel
Garett Jones — Immigration, national IQ, & less democracy
Dwarkesh Patel
Marc Andreessen — AI, crypto, 1000 Elon Musks, regrets, vulnerabilities, & managerial revolution
Dwarkesh Patel
Why You Shouldn't Start A Startup
Dwarkesh Patel
The Future Of Venture Capital
Dwarkesh Patel
The Crucial Skill For A Startup Founder
Dwarkesh Patel
Brett Harrison — FTX US former president speaks out
Dwarkesh Patel
Nat Friedman (Github CEO) — Reading ancient scrolls, open source, & AI
Dwarkesh Patel
Ilya Sutskever (OpenAI Chief Scientist) — Why next-token prediction could surpass human intelligence
Dwarkesh Patel
Impact of Taiwan Invasion on AI
Dwarkesh Patel
Reliability is Bottleneck on AI - OpenAI Founder
Dwarkesh Patel
Next Token Prediction SOLVES AI Says OpenAI Founder
Dwarkesh Patel
Harmful Uses of GPT - OpenAI Founder
Dwarkesh Patel
Why OpenAI Founder Thinks AI Is Near
Dwarkesh Patel
AI will help us achieve enlightenment - OpenAI Founder
Dwarkesh Patel
Eliezer Yudkowsky — Why AI will kill us, aligning LLMs, nature of intelligence, SciFi, & rationality
Dwarkesh Patel
Richard Rhodes — The making of the atomic bomb
Dwarkesh Patel
More on: LLM Foundations
View skill →Related AI Lessons
⚡
⚡
⚡
⚡
The ABCs of reading medical research and review papers these days
Medium · LLM
#1 DevLog Meta-research: I Got Tired of Tab Chaos While Reading Research Papers.
Dev.to AI
How to Set Up a Karpathy-Style Wiki for Your Research Field
Medium · AI
The Non-Optimality of Scientific Knowledge: Path Dependence, Lock-In, and The Local Minimum Trap
ArXiv cs.AI
Chapters (10)
Long contexts
17:04
Intelligence is just associations
33:27
Intelligence explosion & great researchers
1:07:44
Superposition & secret communication
1:23:26
Agents & true reasoning
1:35:32
How Sholto & Trenton got into AI research
2:08:08
Are feature spaces the wrong way to think about intelligence?
2:22:04
Will interp actually work on superhuman models
2:45:57
Sholto's technical challenge for the audience
3:04:49
Rapid fire
🎓
Tutor Explanation
DeepCamp AI