Why Every Skyrim AI Becomes a Stealth Archer
I spent 500 hours training 3 AIs on different Skyrim builds. Warrior, Mage, and Thief, each with completely different optimization functions. By hour 200, all three had independently become stealth archers. The meme is math.
💻 Full code & models: https://github.com/llSourcell/Skryim_Reinforcement_Learning_AI/
🔗 Neo Browser (Early Access):
https://neobrowser.ai/
This isn't just about Skyrim, it's about emergent behavior in complex systems. Three independent AIs, three different optimization pressures, one inevitable outcome.
In the final boss gauntlet:
- Warrior: 8 minutes, died to Dragon Priest
- Mage: 22 minutes, died to Dragon
- Thief (Stealth Archer): 11 minutes, defeated all three
The math is undeniable. Every Skyrim player who becomes a stealth archer isn't unoriginal, they're rational. You're not lazy. You're optimizing. And now we have proof.
⏱️ TIMESTAMPS:
0:00 - The Convergence
0:53 - Why I Did This Experiment
2:38 - Meet The Three AIs
3:52 - Day 1: Pure Chaos
4:45 - Hour 100: Three Distinct Builds
5:45 - Neo Browser (How I Built This)
6:41 - Hour 127: The Turning Point
8:08 - The Intervention Experiment
8:54 - Final Boss Gauntlet
9:41 - The Mathematical Proof
10:49 - Full Code & Outro
🔬 TECHNICAL DETAILS:
- Algorithm: PPO (Proximal Policy Optimization)
- Training: 500 hours per AI (1,500 total)
- Compute Used: (Lambda Labs GPU cluster)
- Framework: OpenAI Gym + custom Skyrim environment
📊 CONVERGENCE TIMELINE:
Warrior → Stealth Archer (Hour 151)
Mage → Stealth Archer (Hour 160)
Thief → Stealth Archer (Hour 168)
Full convergence by Hour 200
All three abandoned their builds. Stealth archery isn't a meme, it's a Nash equilibrium. It's game theory. It's mathematics.
ABOUT THIS EXPERIMENT I'm Siraj, an AI researcher exploring what happens when you give reinforcement learning agents too much compute and drop them into video games. Currently balancing paternity leave with breaking Skyrim using math.
If you somehow train an AI that doesn't become
Watch on YouTube ↗
(saves to browser)
Sign in to unlock AI tutor explanation · ⚡30
Playlist
Uploads from Siraj Raval · Siraj Raval · 0 of 60
← Previous
Next →
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
What is Bitcoin?
Siraj Raval
5 Ways to Use Bitcoin
Siraj Raval
BTC Fever - Siraj [Music Video]
Siraj Raval
5 Reasons to Build Decentralized Apps
Siraj Raval
The Interplanetary File System
Siraj Raval
How to Build a Dapp in 3 min
Siraj Raval
Life Before Smartphones
Siraj Raval
4 Ways to Use Smart Contracts
Siraj Raval
3 Dapps You HAVE to See
Siraj Raval
Char's Life as a BitTorrent Engineer
Siraj Raval
4 Reasons AlphaGo is a Huge Deal
Siraj Raval
Build a Neural Net in 4 Minutes
Siraj Raval
Sentiment Analysis in 4 Minutes
Siraj Raval
The Hackathon Life
Siraj Raval
Your First ML App - Machine Learning for Hackers #1
Siraj Raval
Build an AI Composer - Machine Learning for Hackers #2
Siraj Raval
Build a Game AI - Machine Learning for Hackers #3
Siraj Raval
Build a Movie Recommender - Machine Learning for Hackers #4
Siraj Raval
Build an AI Artist - Machine Learning for Hackers #5
Siraj Raval
Build a Chatbot - ML for Hackers #6
Siraj Raval
Build an AI Reader - Machine Learning for Hackers #7
Siraj Raval
Build an AI Writer - Machine Learning for Hackers #8
Siraj Raval
Build a Chatbot w/ an API - ML for Hackers #9
Siraj Raval
One-Shot Learning - Fresh Machine Learning #1
Siraj Raval
Generative Adversarial Nets - Fresh Machine Learning #2
Siraj Raval
Tone Analysis - Fresh Machine Learning #3
Siraj Raval
Generate Rap Lyrics - Fresh Machine Learning #4
Siraj Raval
Build an Autoencoder in 5 Min - Fresh Machine Learning #5
Siraj Raval
Build a Self Driving Car in 5 Min - Fresh Machine Learning #6
Siraj Raval
Build an Antivirus in 5 Min - Fresh Machine Learning #7
Siraj Raval
TensorFlow in 5 Minutes (tutorial)
Siraj Raval
Build a Recurrent Neural Net in 5 Min
Siraj Raval
Build a Simulation in 5 Min
Siraj Raval
Build a TensorFlow Image Classifier in 5 Min
Siraj Raval
Tensorboard Explained in 5 Min
Siraj Raval
Generate Music in TensorFlow
Siraj Raval
Build a Game Bot (LIVE)
Siraj Raval
Deep Learning Frameworks Compared
Siraj Raval
Introduction - Learn Python for Data Science #1
Siraj Raval
Build a Neural Network (LIVE)
Siraj Raval
Twitter Sentiment Analysis - Learn Python for Data Science #2
Siraj Raval
Recommendation Systems - Learn Python for Data Science #3
Siraj Raval
Predicting Stock Prices - Learn Python for Data Science #4
Siraj Raval
Pong Neural Network (LIVE)
Siraj Raval
Deep Dream in TensorFlow - Learn Python for Data Science #5
Siraj Raval
Visualizing Data with D3.js (LIVE)
Siraj Raval
Genetic Algorithms - Learn Python for Data Science #6
Siraj Raval
Enter Siraj [Music Video]
Siraj Raval
Build a Web Scraper (LIVE)
Siraj Raval
Why is P vs NP Important?
Siraj Raval
How to Make a Neural Network (LIVE)
Siraj Raval
How to Make an Amazing Tensorflow Chatbot Easily
Siraj Raval
How to Make an Amazing Video Game Bot Easily
Siraj Raval
How to Make a Tensorflow Neural Network (LIVE)
Siraj Raval
How to Make a Simple Tensorflow Speech Recognizer
Siraj Raval
Joel Shor - Really Quick Questions with an Awesome Google Engineer
Siraj Raval
How to Make a Path Planning Algorithm Easily (LIVE)
Siraj Raval
The Best Way to Prepare a Dataset Easily
Siraj Raval
Catherine Olsson - Really Quick Questions with an OpenAI Engineer
Siraj Raval
How to Make a Tic Tac Toe Neural Network Easily (LIVE)
Siraj Raval
More on: Agent Foundations
View skill →Related AI Lessons
Chapters (11)
The Convergence
0:53
Why I Did This Experiment
2:38
Meet The Three AIs
3:52
Day 1: Pure Chaos
4:45
Hour 100: Three Distinct Builds
5:45
Neo Browser (How I Built This)
6:41
Hour 127: The Turning Point
8:08
The Intervention Experiment
8:54
Final Boss Gauntlet
9:41
The Mathematical Proof
10:49
Full Code & Outro
🎓
Tutor Explanation
DeepCamp AI