DQN Breakout

Google DeepMind · Advanced ·🧠 Large Language Models ·10y ago

Skills: RL Foundations85%

Key Takeaways

Demonstrates DQN improvement over training on the game Breakout

Original Description

This video illustrates the improvement in the performance of DQN over training (i.e. after 100, 200, 400 and 600 episodes). After 600 episodes DQN finds and exploits the optimal strategy in this game, which is to make a tunnel around the side, and then allow the ball to hit blocks by bouncing behind the wall. Note: the score is displayed at the top left of the screen (maximum for clearing one screen is 448 points), number of lives remaining is shown in the middle (starting with 5 lives), and the “1” on the top right indicates this is a 1-player game.

Watch on YouTube ↗ (saves to browser)

Sign in to unlock AI tutor explanation · ⚡30

Playlist

Uploads from Google DeepMind · Google DeepMind · 22 of 60

← Previous Next →

RL Course by David Silver - Lecture 8: Integrating Learning and Planning

RL Course by David Silver - Lecture 8: Integrating Learning and Planning

Google DeepMind

RL Course by David Silver - Lecture 1: Introduction to Reinforcement Learning

RL Course by David Silver - Lecture 1: Introduction to Reinforcement Learning

Google DeepMind

RL Course by David Silver - Lecture 2: Markov Decision Process

RL Course by David Silver - Lecture 2: Markov Decision Process

Google DeepMind

RL Course by David Silver - Lecture 5: Model Free Control

RL Course by David Silver - Lecture 5: Model Free Control

Google DeepMind

RL Course by David Silver - Lecture 6: Value Function Approximation

RL Course by David Silver - Lecture 6: Value Function Approximation

Google DeepMind

RL Course by David Silver - Lecture 4: Model-Free Prediction

RL Course by David Silver - Lecture 4: Model-Free Prediction

Google DeepMind

RL Course by David Silver - Lecture 3: Planning by Dynamic Programming

RL Course by David Silver - Lecture 3: Planning by Dynamic Programming

Google DeepMind

RL Course by David Silver - Lecture 10: Classic Games

RL Course by David Silver - Lecture 10: Classic Games

Google DeepMind

RL Course by David Silver - Lecture 7: Policy Gradient Methods

RL Course by David Silver - Lecture 7: Policy Gradient Methods

Google DeepMind

Google DeepMind: Ground-breaking AlphaGo masters the game of Go

Google DeepMind: Ground-breaking AlphaGo masters the game of Go

Google DeepMind

Match 1 - Google DeepMind Challenge Match: Lee Sedol vs AlphaGo

Match 1 - Google DeepMind Challenge Match: Lee Sedol vs AlphaGo

Google DeepMind

Match 2 - Google DeepMind Challenge Match: Lee Sedol vs AlphaGo

Match 2 - Google DeepMind Challenge Match: Lee Sedol vs AlphaGo

Google DeepMind

Match 1 15 min Summary - Google DeepMind Challenge Match

Match 1 15 min Summary - Google DeepMind Challenge Match

Google DeepMind

Match 3 - Google DeepMind Challenge Match: Lee Sedol vs AlphaGo

Match 3 - Google DeepMind Challenge Match: Lee Sedol vs AlphaGo

Google DeepMind

Match 2 15 Minute Summary - Google DeepMind Challenge Match 2016

Match 2 15 Minute Summary - Google DeepMind Challenge Match 2016

Google DeepMind

Match 3 15 Minute Summary - Google DeepMind Challenge Match 2016

Match 3 15 Minute Summary - Google DeepMind Challenge Match 2016

Google DeepMind

Match 4 - Google DeepMind Challenge Match: Lee Sedol vs AlphaGo

Match 4 - Google DeepMind Challenge Match: Lee Sedol vs AlphaGo

Google DeepMind

Match 4 15 Minute Summary - Google DeepMind Challenge Match 2016

Match 4 15 Minute Summary - Google DeepMind Challenge Match 2016

Google DeepMind

Match 5 - Google DeepMind Challenge Match: Lee Sedol vs AlphaGo

Match 5 - Google DeepMind Challenge Match: Lee Sedol vs AlphaGo

Google DeepMind

Match 5 15 Minute Summary - Google DeepMind Challenge Match 2016

Match 5 15 Minute Summary - Google DeepMind Challenge Match 2016

Google DeepMind

DQN SPACE INVADERS

DQN SPACE INVADERS

Google DeepMind

Google DeepMind

Asynchronous Methods for Deep Reinforcement Learning: Labyrinth

Asynchronous Methods for Deep Reinforcement Learning: Labyrinth

Google DeepMind

Asynchronous Methods for Deep Reinforcement Learning: MuJoCo

Asynchronous Methods for Deep Reinforcement Learning: MuJoCo

Google DeepMind

Asynchronous Methods for Deep Reinforcement Learning: TORCS

Asynchronous Methods for Deep Reinforcement Learning: TORCS

Google DeepMind

Differentiable neural computer family tree inference task

Differentiable neural computer family tree inference task

Google DeepMind

StarCraft II DeepMind feature layer API

StarCraft II DeepMind feature layer API

Google DeepMind

DeepMind Health – Partnership with the Royal Free London NHS Foundation Trust

DeepMind Health – Partnership with the Royal Free London NHS Foundation Trust

Google DeepMind

DeepMind Health – Michael Wise – a patient's journey

DeepMind Health – Michael Wise – a patient's journey

Google DeepMind

Streams – a platform for a digital NHS

Streams – a platform for a digital NHS

Google DeepMind

DeepMind Lab - Nav Maze Level 1

DeepMind Lab - Nav Maze Level 1

Google DeepMind

DeepMind Lab - Stairway to Melon Level

DeepMind Lab - Stairway to Melon Level

Google DeepMind

DeepMind Lab - Laser Tag Space Bounce Level (Hard)

DeepMind Lab - Laser Tag Space Bounce Level (Hard)

Google DeepMind

Exploring the mysteries of Go with AlphaGo and China's top players

Exploring the mysteries of Go with AlphaGo and China's top players

Google DeepMind

Demis Hassabis on AlphaGo: its legacy and the 'Future of Go Summit' in Wuzhen, China

Demis Hassabis on AlphaGo: its legacy and the 'Future of Go Summit' in Wuzhen, China

Google DeepMind

The Future of Go Summit: AlphaGo & Ke Jie match 1 moves analysis

The Future of Go Summit: AlphaGo & Ke Jie match 1 moves analysis

Google DeepMind

The Future of Go Summit: AlphaGo & Ke Jie match 2 moves analysis

The Future of Go Summit: AlphaGo & Ke Jie match 2 moves analysis

Google DeepMind

The Future of Go Summit: Pair Go moves analysis

The Future of Go Summit: Pair Go moves analysis

Google DeepMind

The Future of Go Summit: AlphaGo & Ke Jie match 3 moves analysis

The Future of Go Summit: AlphaGo & Ke Jie match 3 moves analysis

Google DeepMind

Emergence of Locomotion Behaviours in Rich Environments

Emergence of Locomotion Behaviours in Rich Environments

Google DeepMind

StarCraft II 'mini games' for AI research

StarCraft II 'mini games' for AI research

Google DeepMind

Trained and untrained agents play StarCraft II full 1vs1 game

Trained and untrained agents play StarCraft II full 1vs1 game

Google DeepMind

DeepMind open source PySC2 toolset for Starcraft II

DeepMind open source PySC2 toolset for Starcraft II

Google DeepMind

ICML 2017: Test of Time Award (Sylvain Gelly & David Silver)

ICML 2017: Test of Time Award (Sylvain Gelly & David Silver)

Google DeepMind

Ke Jie and DeepMind's Go Ambassador Fan Hui review the 3rd AlphaGo vs Ke Jie game

Ke Jie and DeepMind's Go Ambassador Fan Hui review the 3rd AlphaGo vs Ke Jie game

Google DeepMind

Ke Jie and DeepMind's Go Ambassador Fan Hui review the 1st AlphaGo vs Ke Jie game

Ke Jie and DeepMind's Go Ambassador Fan Hui review the 1st AlphaGo vs Ke Jie game

Google DeepMind

Ke Jie and DeepMind's Go Ambassador Fan Hui review the 2nd AlphaGo vs Ke Jie game

Ke Jie and DeepMind's Go Ambassador Fan Hui review the 2nd AlphaGo vs Ke Jie game

Google DeepMind

AlphaGo Zero: Discovering new knowledge

AlphaGo Zero: Discovering new knowledge

Google DeepMind

AlphaGo Zero: Starting from scratch

AlphaGo Zero: Starting from scratch

Google DeepMind

Defining principles for tech companies in the NHS: DeepMind Health's Collaborative Listening Summit

Defining principles for tech companies in the NHS: DeepMind Health's Collaborative Listening Summit

Google DeepMind

A systems neuroscience approach to building AGI - Demis Hassabis, Singularity Summit 2010

A systems neuroscience approach to building AGI - Demis Hassabis, Singularity Summit 2010

Google DeepMind

Retour de Rémi Munos en France et ouverture de DeepMind Paris

Retour de Rémi Munos en France et ouverture de DeepMind Paris

Google DeepMind

Grid cells - Caswell Barry, UCL

Grid cells - Caswell Barry, UCL

Google DeepMind

DeepMind Health Research and Moorfields Eye Hospital NHS Foundation Trust: What our research shows

DeepMind Health Research and Moorfields Eye Hospital NHS Foundation Trust: What our research shows

Google DeepMind

DeepMind Health Research and Moorfields Eye Hospital NHS Foundation Trust: A Patient's Story

DeepMind Health Research and Moorfields Eye Hospital NHS Foundation Trust: A Patient's Story

Google DeepMind

Deep Learning 3: Neural Networks Foundations

Deep Learning 3: Neural Networks Foundations

Google DeepMind

Deep Learning 5: Optimization for Machine Learning

Deep Learning 5: Optimization for Machine Learning

Google DeepMind

Deep Learning 8: Unsupervised learning and generative models

Deep Learning 8: Unsupervised learning and generative models

Google DeepMind

Reinforcement Learning 1: Introduction to Reinforcement Learning

Reinforcement Learning 1: Introduction to Reinforcement Learning

Google DeepMind

Deep Learning 2: Introduction to TensorFlow

Deep Learning 2: Introduction to TensorFlow

Google DeepMind

More on: RL Foundations

View skill →

Build a Doom AI Model with Python | Gaming Reinforcement Learning Full Course

Build a Doom AI Model with Python | Gaming Reinforcement Learning Full Course

Nicholas Renotte

Deep Reinforcement Learning for Atari Games Python Tutorial | AI Plays Space Invaders

Deep Reinforcement Learning for Atari Games Python Tutorial | AI Plays Space Invaders

Nicholas Renotte

Training & Testing Deep reinforcement learning (DQN) Agent - Reinforcement Learning p.6

Training & Testing Deep reinforcement learning (DQN) Agent - Reinforcement Learning p.6

Build a Game Bot (LIVE)

Build a Game Bot (LIVE)

How to Win Slot Machines - Intro to Deep Learning #13

How to Win Slot Machines - Intro to Deep Learning #13

Build an Mario AI Model with Python | Gaming Reinforcement Learning

Build an Mario AI Model with Python | Gaming Reinforcement Learning

Nicholas Renotte

Related AI Lessons

The 2026 AI Model Release Race: Every Major LLM Launch You Need to Know

Stay updated on the 2026 AI model release race, including major LLM launches like Claude Sonnet 5 and GPT-5.6, to leverage the latest advancements in AI technology

Call GPT, Claude, and Gemini from one API key — a 3-step setup

Access GPT, Claude, and Gemini through one API key with a 3-step setup using Modelishub

Your LLM Doesn’t Pick Stocks — It Remembers Them

Discover how LLMs remember stock picks rather than making actual predictions, and why this matters for AI-driven investment strategies

Medium · Machine Learning

Word Representation

Learn how word representation works in NLP and its importance in understanding human language, enabling applications like text classification and language translation

5 Levels of AI Agents - From Simple LLM Calls to Multi-Agent Systems

Dave Ebbelaar (LLM Eng)