ChatGPT O1 Explained

Siraj Raval · Beginner ·🧠 Large Language Models ·1y ago
I reverse-engineered OpenAI's O1-Preview model using O1-Preview! I asked it to generate the full research paper with code and I gave it dozens of related research papers from the past few years as context. It recreated a working version of the O1 model to the best of it's ability and In this video, we'll go over all the details of the model, the code, and the research techniques that make the O1 model series state of art across so many benchmarks. LETS SPREAD THIS AI POWER, I can't wait to see what you think, enjoy! Code & paper for this video: https://github.com/llsourcell/o1-nano/ Deploy y…
Watch on YouTube ↗ (saves to browser)

Chapters (16)

Introduction: Reproducing OpenAI's o1 Model Series
1:30 Generating a Research Paper Using o1 Preview
2:30 Overview of 'o1-nano': An Open Source, Explainable Model
3:30 Understanding Chain-of-Thought Reasoning in o1 Models
4:30 How Reinforcement Learning is Used in Training and Inference
5:30 Exploring Reasoning Paths and Subtasks During Inference
6:30 Unpacking OpenAI's Reasoning Tokens
7:30 Overview of the Model Architecture
8:30 Core Components: Transformer, Chain-of-Thought Module, Reasoning Token Generat
9:30 Training the Model to Reason Better Using Reinforcement Learning
13:30 Historical Papers Leading to o1: Chain-of-Thought and 'Let's Verify Step by St
15:30 The New Scaling Law: Inference Time Scaling
16:30 The Usage of of Reinforcement Learning
17:30 Demo of the Code: Running the Test
18:30 Conclusion: Open Source Code and Research Paper as a Starting Point
19:00 Closing Remarks and Encouragement to Explore the GitHub Repository

Playlist

Uploads from Siraj Raval · Siraj Raval · 0 of 60

← Previous Next →
1 What is Bitcoin?
What is Bitcoin?
Siraj Raval
2 5 Ways to Use Bitcoin
5 Ways to Use Bitcoin
Siraj Raval
3 BTC Fever - Siraj [Music Video]
BTC Fever - Siraj [Music Video]
Siraj Raval
4 5 Reasons to Build Decentralized Apps
5 Reasons to Build Decentralized Apps
Siraj Raval
5 The Interplanetary File System
The Interplanetary File System
Siraj Raval
6 How to Build a Dapp in 3 min
How to Build a Dapp in 3 min
Siraj Raval
7 Life Before Smartphones
Life Before Smartphones
Siraj Raval
8 4 Ways to Use Smart Contracts
4 Ways to Use Smart Contracts
Siraj Raval
9 3 Dapps You HAVE to See
3 Dapps You HAVE to See
Siraj Raval
10 Char's Life as a BitTorrent Engineer
Char's Life as a BitTorrent Engineer
Siraj Raval
11 Gears - Siraj [Music Video]
Gears - Siraj [Music Video]
Siraj Raval
12 4 Reasons AlphaGo is a Huge Deal
4 Reasons AlphaGo is a Huge Deal
Siraj Raval
13 Build a Neural Net in 4 Minutes
Build a Neural Net in 4 Minutes
Siraj Raval
14 Sentiment Analysis in 4 Minutes
Sentiment Analysis in 4 Minutes
Siraj Raval
15 The Hackathon Life
The Hackathon Life
Siraj Raval
16 Your First ML App - Machine Learning for Hackers #1
Your First ML App - Machine Learning for Hackers #1
Siraj Raval
17 Build an AI Composer - Machine Learning for Hackers #2
Build an AI Composer - Machine Learning for Hackers #2
Siraj Raval
18 Build a Game AI - Machine Learning for Hackers #3
Build a Game AI - Machine Learning for Hackers #3
Siraj Raval
19 Build a Movie Recommender - Machine Learning for Hackers #4
Build a Movie Recommender - Machine Learning for Hackers #4
Siraj Raval
20 Build an AI Artist - Machine Learning for Hackers #5
Build an AI Artist - Machine Learning for Hackers #5
Siraj Raval
21 Build a Chatbot - ML for Hackers #6
Build a Chatbot - ML for Hackers #6
Siraj Raval
22 Build an AI Reader - Machine Learning for Hackers #7
Build an AI Reader - Machine Learning for Hackers #7
Siraj Raval
23 Build an AI Writer - Machine Learning for Hackers #8
Build an AI Writer - Machine Learning for Hackers #8
Siraj Raval
24 Build a Chatbot w/ an API - ML for Hackers #9
Build a Chatbot w/ an API - ML for Hackers #9
Siraj Raval
25 One-Shot Learning - Fresh Machine Learning #1
One-Shot Learning - Fresh Machine Learning #1
Siraj Raval
26 Generative Adversarial Nets - Fresh Machine Learning #2
Generative Adversarial Nets - Fresh Machine Learning #2
Siraj Raval
27 Tone Analysis - Fresh Machine Learning #3
Tone Analysis - Fresh Machine Learning #3
Siraj Raval
28 Generate Rap Lyrics - Fresh Machine Learning #4
Generate Rap Lyrics - Fresh Machine Learning #4
Siraj Raval
29 Build an Autoencoder in 5 Min - Fresh Machine Learning #5
Build an Autoencoder in 5 Min - Fresh Machine Learning #5
Siraj Raval
30 Build a Self Driving Car in 5 Min - Fresh Machine Learning #6
Build a Self Driving Car in 5 Min - Fresh Machine Learning #6
Siraj Raval
31 Build an Antivirus in 5 Min - Fresh Machine Learning #7
Build an Antivirus in 5 Min - Fresh Machine Learning #7
Siraj Raval
32 TensorFlow in 5 Minutes (tutorial)
TensorFlow in 5 Minutes (tutorial)
Siraj Raval
33 Build a Recurrent Neural Net in 5 Min
Build a Recurrent Neural Net in 5 Min
Siraj Raval
34 Build a Simulation in 5 Min
Build a Simulation in 5 Min
Siraj Raval
35 Build a TensorFlow Image Classifier in 5 Min
Build a TensorFlow Image Classifier in 5 Min
Siraj Raval
36 Tensorboard Explained in 5 Min
Tensorboard Explained in 5 Min
Siraj Raval
37 Generate Music in TensorFlow
Generate Music in TensorFlow
Siraj Raval
38 Build a Game Bot (LIVE)
Build a Game Bot (LIVE)
Siraj Raval
39 Deep Learning Frameworks Compared
Deep Learning Frameworks Compared
Siraj Raval
40 Introduction - Learn Python for Data Science #1
Introduction - Learn Python for Data Science #1
Siraj Raval
41 Build a Neural Network (LIVE)
Build a Neural Network (LIVE)
Siraj Raval
42 Twitter Sentiment Analysis - Learn Python for Data Science #2
Twitter Sentiment Analysis - Learn Python for Data Science #2
Siraj Raval
43 Recommendation Systems - Learn Python for Data Science #3
Recommendation Systems - Learn Python for Data Science #3
Siraj Raval
44 Predicting Stock Prices - Learn Python for Data Science #4
Predicting Stock Prices - Learn Python for Data Science #4
Siraj Raval
45 Pong Neural Network (LIVE)
Pong Neural Network (LIVE)
Siraj Raval
46 Deep Dream in TensorFlow - Learn Python for Data Science #5
Deep Dream in TensorFlow - Learn Python for Data Science #5
Siraj Raval
47 Visualizing Data with D3.js (LIVE)
Visualizing Data with D3.js (LIVE)
Siraj Raval
48 Genetic Algorithms - Learn Python for Data Science #6
Genetic Algorithms - Learn Python for Data Science #6
Siraj Raval
49 Enter Siraj [Music Video]
Enter Siraj [Music Video]
Siraj Raval
50 Build a Web Scraper (LIVE)
Build a Web Scraper (LIVE)
Siraj Raval
51 Why is P vs NP Important?
Why is P vs NP Important?
Siraj Raval
52 How to Make a Neural Network (LIVE)
How to Make a Neural Network (LIVE)
Siraj Raval
53 How to Make an Amazing Tensorflow Chatbot Easily
How to Make an Amazing Tensorflow Chatbot Easily
Siraj Raval
54 How to Make an Amazing Video Game Bot Easily
How to Make an Amazing Video Game Bot Easily
Siraj Raval
55 How to Make a Tensorflow Neural Network (LIVE)
How to Make a Tensorflow Neural Network (LIVE)
Siraj Raval
56 How to Make a Simple Tensorflow Speech Recognizer
How to Make a Simple Tensorflow Speech Recognizer
Siraj Raval
57 Joel Shor - Really Quick Questions with an Awesome Google Engineer
Joel Shor - Really Quick Questions with an Awesome Google Engineer
Siraj Raval
58 How to Make a Path Planning Algorithm Easily (LIVE)
How to Make a Path Planning Algorithm Easily (LIVE)
Siraj Raval
59 The Best Way to Prepare a Dataset Easily
The Best Way to Prepare a Dataset Easily
Siraj Raval
60 Catherine Olsson - Really Quick Questions with an OpenAI Engineer
Catherine Olsson - Really Quick Questions with an OpenAI Engineer
Siraj Raval
5 Levels of AI Agents - From Simple LLM Calls to Multi-Agent Systems
Next Up
5 Levels of AI Agents - From Simple LLM Calls to Multi-Agent Systems
Dave Ebbelaar (LLM Eng)