Pong AI with Policy Gradients

Andrej Karpathy · Beginner ·📰 AI News & Updates ·9y ago

Trained for ~8000 episodes, each episode = ~30 games. Updates were done in batches of 10 episodes, so ~800 updates total. Policy network is a 2-layer neural net connected to raw pixels, with 200 hidden units. Trained with RMSProp and learning rate 1e-4. The final agent does not beat the hard-coded AI consistently, but holds its own. Should be trained longer, with ConvNets, and on GPU. This is ATARI 2600 Pong version, using OpenAI Gym.

Watch on YouTube ↗ (saves to browser)

Sign in to unlock AI tutor explanation · ⚡30

Playlist

Uploads from Andrej Karpathy · Andrej Karpathy · 19 of 19

← Previous Next →

Large-scale Video Classification with Convolutional Neural Networks, CVPR 2014

Large-scale Video Classification with Convolutional Neural Networks, CVPR 2014

Andrej Karpathy

ConvNet forward pass demo

ConvNet forward pass demo

Andrej Karpathy

CS231n Winter 2016: Lecture1: Introduction and Historical Context

CS231n Winter 2016: Lecture1: Introduction and Historical Context

Andrej Karpathy

CS231n Winter 2016: Lecture 2: Data-driven approach, kNN, Linear Classification 1

CS231n Winter 2016: Lecture 2: Data-driven approach, kNN, Linear Classification 1

Andrej Karpathy

CS231n Winter 2016: Lecture 3: Linear Classification 2, Optimization

CS231n Winter 2016: Lecture 3: Linear Classification 2, Optimization

Andrej Karpathy

CS231n Winter 2016: Lecture 4: Backpropagation, Neural Networks 1

CS231n Winter 2016: Lecture 4: Backpropagation, Neural Networks 1

Andrej Karpathy

CS231n Winter 2016: Lecture 5: Neural Networks Part 2

CS231n Winter 2016: Lecture 5: Neural Networks Part 2

Andrej Karpathy

CS231n Winter 2016: Lecture 6: Neural Networks Part 3 / Intro to ConvNets

CS231n Winter 2016: Lecture 6: Neural Networks Part 3 / Intro to ConvNets

Andrej Karpathy

CS231n Winter 2016: Lecture 7: Convolutional Neural Networks

CS231n Winter 2016: Lecture 7: Convolutional Neural Networks

Andrej Karpathy

CS231n Winter 2016: Lecture 8: Localization and Detection

CS231n Winter 2016: Lecture 8: Localization and Detection

Andrej Karpathy

CS231n Winter 2016: Lecture 9: Visualization, Deep Dream, Neural Style, Adversarial Examples

CS231n Winter 2016: Lecture 9: Visualization, Deep Dream, Neural Style, Adversarial Examples

Andrej Karpathy

CS231n Winter 2016: Lecture 10: Recurrent Neural Networks, Image Captioning, LSTM

CS231n Winter 2016: Lecture 10: Recurrent Neural Networks, Image Captioning, LSTM

Andrej Karpathy

CS231n Winter 2016: Lecture 11: ConvNets in practice

CS231n Winter 2016: Lecture 11: ConvNets in practice

Andrej Karpathy

CS231n Winter 2016: Lecture 12: Deep Learning libraries

CS231n Winter 2016: Lecture 12: Deep Learning libraries

Andrej Karpathy

CS231n Winter 2016: Lecture 13: Segmentation, soft attention, spatial transformers

CS231n Winter 2016: Lecture 13: Segmentation, soft attention, spatial transformers

Andrej Karpathy

CS231n Winter 2016: Lecture 14: Videos and Unsupervised Learning

CS231n Winter 2016: Lecture 14: Videos and Unsupervised Learning

Andrej Karpathy

CS231n Winter 2016: Lecture 15: Invited Talk by Jeff Dean

CS231n Winter 2016: Lecture 15: Invited Talk by Jeff Dean

Andrej Karpathy

Introducing arxiv-sanity

Introducing arxiv-sanity

Andrej Karpathy

Pong AI with Policy Gradients

Pong AI with Policy Gradients

Andrej Karpathy

Related AI Lessons

Only 1 in 50 AI Projects Delivers Real Value — Here’s How to Be in That 2%

Only 2% of AI projects deliver real value, learn how to be in that 2% by focusing on transformative returns

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.

Big Tech firms are investing heavily in AI, driving growth and transformation, while prioritizing safety and responsible adoption

From Ericsson intern to AI consultant: How this Cameroonian engineer built a career around data

Learn how Jehpte Ioudom, a Cameroonian AI engineer, built a career around data and AI, and gain insights into his journey from intern to consultant

Techpoint Africa

From a Cold War Spy Bug to My Drawing

Learn how military tech evolved into everyday AI, and the double-edged sword of innovation, to understand the complexities of AI development and its applications

Medium · Deep Learning

Elon Musk Just Supercharged Claude

Mayank Aggarwal