Pong AI with Policy Gradients
Trained for ~8000 episodes, each episode = ~30 games. Updates were done in batches of 10 episodes, so ~800 updates total. Policy network is a 2-layer neural net connected to raw pixels, with 200 hidden units. Trained with RMSProp and learning rate 1e-4. The final agent does not beat the hard-coded AI consistently, but holds its own. Should be trained longer, with ConvNets, and on GPU.
This is ATARI 2600 Pong version, using OpenAI Gym.
Watch on YouTube ↗
(saves to browser)
Sign in to unlock AI tutor explanation · ⚡30
Playlist
Uploads from Andrej Karpathy · Andrej Karpathy · 19 of 19
← Previous
Next →
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
▶
Large-scale Video Classification with Convolutional Neural Networks, CVPR 2014
Andrej Karpathy
ConvNet forward pass demo
Andrej Karpathy
CS231n Winter 2016: Lecture1: Introduction and Historical Context
Andrej Karpathy
CS231n Winter 2016: Lecture 2: Data-driven approach, kNN, Linear Classification 1
Andrej Karpathy
CS231n Winter 2016: Lecture 3: Linear Classification 2, Optimization
Andrej Karpathy
CS231n Winter 2016: Lecture 4: Backpropagation, Neural Networks 1
Andrej Karpathy
CS231n Winter 2016: Lecture 5: Neural Networks Part 2
Andrej Karpathy
CS231n Winter 2016: Lecture 6: Neural Networks Part 3 / Intro to ConvNets
Andrej Karpathy
CS231n Winter 2016: Lecture 7: Convolutional Neural Networks
Andrej Karpathy
CS231n Winter 2016: Lecture 8: Localization and Detection
Andrej Karpathy
CS231n Winter 2016: Lecture 9: Visualization, Deep Dream, Neural Style, Adversarial Examples
Andrej Karpathy
CS231n Winter 2016: Lecture 10: Recurrent Neural Networks, Image Captioning, LSTM
Andrej Karpathy
CS231n Winter 2016: Lecture 11: ConvNets in practice
Andrej Karpathy
CS231n Winter 2016: Lecture 12: Deep Learning libraries
Andrej Karpathy
CS231n Winter 2016: Lecture 13: Segmentation, soft attention, spatial transformers
Andrej Karpathy
CS231n Winter 2016: Lecture 14: Videos and Unsupervised Learning
Andrej Karpathy
CS231n Winter 2016: Lecture 15: Invited Talk by Jeff Dean
Andrej Karpathy
Introducing arxiv-sanity
Andrej Karpathy
Pong AI with Policy Gradients
Andrej Karpathy
Related AI Lessons
⚡
⚡
⚡
⚡
Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.
Dev.to AI
GM, Ford, and Stellantis have cut 20,000 white-collar jobs. AI is about to accelerate the trend.
The Next Web AI
The 3 Tests That Tell You If Your Job Is AI-Proof
Medium · AI
I Asked AI to Predict My Death Age — And It Scared Me
Medium · AI
🎓
Tutor Explanation
DeepCamp AI