How to Make an Amazing Video Game Bot Easily

Siraj Raval · Beginner ·📰 AI News & Updates ·9y ago

Skills: Agent Foundations80%LLM Engineering70%LLM Foundations60%

Key Takeaways

The video demonstrates how to create a video game bot using OpenAI's Universe platform and reinforcement learning, with tools such as Deep learning, Deep Mind, and TensorFlow.

Full Transcript

yes beat you again sucker one day I'm going to make a bot that beats you in any game keep telling yourself that SJ hello world it's SJ and let's make an amazing video game bot in just 10 lines of code that can play a huge variety of games video games have been around since the 50s when Joseph Kates publicly demoed tic-tac-toe at the Canadian national exhibition that bought you simple scripted actions that ran the same way every time regardless of whatever move the player made his demo got people hyped though because no one had ever seen a computer play a game before and they were lining up off the block to check it out the game Bots that were invented afterwards for games like Nim and space War were similar but Along Came pully I mean pong the pong Bots paddle had to make decisions based on the human player's actions and that made it feel more realistic pong marked the beginning of using cisic to create game Bots heuristics are educated guesses and pretty much every single video game bot since pongs has used them a bot will map out a possible set of decisions as a tree of possibilities then use one of many techniques to pick the best one but as cool as that sounds it still always boiled down to a bunch of if then statements if Pac-Man moves this way then the blue go should move this way if Master Chief sees a grunt then it should run in circles like my Facebook Newsfeed if Captain Falcon is being annoying AF then your team bot should help you pone him Squad goals but yeah video game Bots have pretty much always sucked because there are only so many edge cases that a programmer can predict like if the human in Fallout 3 has a pistol and isn't moving and there are no enemies nearby run into each other we need to think about this problem differently when you or I start playing a game we don't know anything about its environment beforehand the Hallmark of intelligence is our ability to generalize but can we make artificial intelligence that can generalize to solve any task a team of researchers at Deep mind recently got close by creating one bot that could beat almost any Atari game knowing literally nothing about the game beforehand no game specific hardcoded rules at all it was just fed the raw pixels of the game and its controls using those two things it learned how to be almost any Atari game it was given it did this using a technology called Deep learning if you take a deep neural network and feed it lots of data and compute it can learn to do a whole lot of incredible things the field of deep learning right now is where physics was in the early 1900s the state-of-the-art in a huge number of subfields like vision and speech is being broken almost every other day it's a very exciting time right now the Marie curries and Albert Einstein of computer science are all alive right now and newcomers are coming in every day Deep Mind is awesome and they keep a good chunk of their code private since Google uses it to outperform its competitors but then Elon Musk came along and it was all like I think it's important if we have this incredible power of AI that it not be concentrated in the hands of a few and so he co-founded a nonprofit called open AI whose goal is to democratize AI so anyone can use it and just today they released something called universe universe is a platform that lets you build a bot and test it out in thousands of different environments from games as simple as Space Invaders to Grand Theft Auto to protein folding simulations that could cure cancer you can create a bot and the better you make it the more games it'll learn to become amazing at you can compete with other bot developers to see whose bot beats the most games and universe has other environments too for web interface tasks like managing emails and booking flights if you create a bot that's able to defeat any environment you're not only the dopest coder of all time you just solved intelligence we could then use your Bot to solve literally everything from global warming to Poverty to all known diseases so with that let's create our first simple bot in just 10 lines of python code in our first two lines of code code will import gy and universe gy is open ai's original code base that Universe Builds on and extends to include way more environments and features those are the only two dependencies we'll need now we can select our environment we'll Define an environment variable called EnV and use jy's make method to Define our environment parameter there's so many to choose from it's hard to pick but let's go ahead and pick the popular Flash game coaster racer Universe lets us run as many environments at the same time as we want but for now let's just use one our next step is to initialize our environment with the reset method it'll return a list of what we call observations for every environment we've initialized an observation is an environment specific object that represents what the agent observes like pixel data of what it sees and the state of the game initially we'll just have an empty set of observations since the game hasn't started yet now that we've initialized our environment let's go ahead and create a while statement so our agent will just keep running inde Ely we're just going to have our bot do one simple thing it's going to hit the up Arrow this is formatted by first specifying the type of event the key then true which means press it and we'll do this for each environment's observation we'll call this an action and store it in our action variable now we'll call our environment step method to move forward one time step and use the action as a parameter this is our implementation of reinforcement learning our bot will take an action in our case pushing the up Arrow then it'll observe the result and may or may not receive a reward if that action was beneficial to its goal which in our case is increasing the game score open AI uses a custom image recognition module here to read the game score in order to return a reward this module is included in the environment so we don't need to worry about it if it does receive a reward we could update our bot to do similar actions in the future so it gets better over time through trial and error so the step method returns four variables an observation of the environment a reward a yes or no value if the game is done and some info like performance timings and latencies for debugging and it'll do this for all the environments you train your Bot in simultaneously lastly we'll render the environment so it's visible to us let's demo this baby I'll run the code in terminal and it'll connect to our VNC server in our local Docker container running a flash enabled Chrome browser the prescripted mouse will click through the necessary screens to get the game started then our bot will start programmatically controlling the game remotely yeah our bot really sucks but how dope is this we can do this for as many games as we'd like and to make it better we can try different strategies like random search or hill climbing or just replicate what Deep Mind did they fed the observations that their bot received into a neural network that updated its connections to get better if it received a reward open aai already has a starter bot that uses deep reinforcement learning via tensorflow that I'll put a link to in the description and so to break it down open AI universe is a platform that lets you train and test Bots for thousands of games and other environments reinforcement learning is the process of using trial and error similar to how we learn to improve a bot and if you create one bot that can succeed in any environment it's given you just solved intelligence the coding challenge for this video is to create a bot for just coaster racer that is better than this video's demo code post your GitHub Link in the comments and I'll give a shout out to the winner in my video one week from today and I'll do a one-on-one Google Hangout with them just to say hi and talk about whatever for for now I've got to make a laundry folding robot so thanks for watching

Original Description

In this video, we first go over the history of video game AI, then I introduce OpenAI's Universe, which lets you build a bot that can play thousands of different video games. It has environments for all sorts of games, from Space Invaders, to Grand Theft Auto, to Protein folding simulations. CODING CHALLENGE DUE DATE: Thursday, December 15th. (which is 2 weeks, not 1 week from now like usual) The coding challenge for this video is to make a bot that's better than this video's demo code. Post your Github link in the comments! For your README, just include a 1-3 sentence description of your strategy and instructions on how to run the code.The demo code can be found in the README of the Universe repo. : https://github.com/openai/universe And a Tensorflow based starter bot can be found here: https://github.com/openai/universe-starter-agent Some great learning resources: https://www.nervanasys.com/openai/ http://karpathy.github.io/2016/05/31/rl/ http://kvfrans.com/simple-algoritms-for-solving-cartpole/ https://kofzor.github.io/Reinforcement_Learning_101/ Join other Wizards on our Slack channel: https://wizards.herokuapp.com/ OpenAI asked me to make this video and I gladly said yes. They are awesome!! Please subscribe! And like and comment. That's what keeps me going. And please support me on Patreon: https://www.patreon.com/user?u=3191693 Follow me: Twitter: https://twitter.com/sirajraval Facebook: https://www.facebook.com/sirajology Instagram: https://www.instagram.com/sirajraval/ Instagram: https://www.instagram.com/sirajraval/ Signup for my newsletter for exciting updates in the field of AI: https://goo.gl/FZzJ5w Hit the Join button above to sign up to become a member of my channel for access to exclusive content! Join my AI community: http://chatgptschool.io/ Sign up for my AI Sports betting Bot, WagerGPT! (500 spots available): https://www.wagergpt.xyz

Watch on YouTube ↗ (saves to browser)

Sign in to unlock AI tutor explanation · ⚡30

Playlist

Uploads from Siraj Raval · Siraj Raval · 53 of 60

← Previous Next →

What is Bitcoin?

What is Bitcoin?

5 Ways to Use Bitcoin

5 Ways to Use Bitcoin

BTC Fever - Siraj [Music Video]

BTC Fever - Siraj [Music Video]

5 Reasons to Build Decentralized Apps

5 Reasons to Build Decentralized Apps

The Interplanetary File System

The Interplanetary File System

How to Build a Dapp in 3 min

How to Build a Dapp in 3 min

Life Before Smartphones

Life Before Smartphones

4 Ways to Use Smart Contracts

4 Ways to Use Smart Contracts

3 Dapps You HAVE to See

3 Dapps You HAVE to See

Char's Life as a BitTorrent Engineer

Char's Life as a BitTorrent Engineer

4 Reasons AlphaGo is a Huge Deal

4 Reasons AlphaGo is a Huge Deal

Build a Neural Net in 4 Minutes

Build a Neural Net in 4 Minutes

Sentiment Analysis in 4 Minutes

Sentiment Analysis in 4 Minutes

The Hackathon Life

The Hackathon Life

Your First ML App - Machine Learning for Hackers #1

Your First ML App - Machine Learning for Hackers #1

Build an AI Composer - Machine Learning for Hackers #2

Build an AI Composer - Machine Learning for Hackers #2

Build a Game AI - Machine Learning for Hackers #3

Build a Game AI - Machine Learning for Hackers #3

Build a Movie Recommender - Machine Learning for Hackers #4

Build a Movie Recommender - Machine Learning for Hackers #4

Build an AI Artist - Machine Learning for Hackers #5

Build an AI Artist - Machine Learning for Hackers #5

Build a Chatbot - ML for Hackers #6

Build a Chatbot - ML for Hackers #6

Build an AI Reader - Machine Learning for Hackers #7

Build an AI Reader - Machine Learning for Hackers #7

Build an AI Writer - Machine Learning for Hackers #8

Build an AI Writer - Machine Learning for Hackers #8

Build a Chatbot w/ an API - ML for Hackers #9

Build a Chatbot w/ an API - ML for Hackers #9

One-Shot Learning - Fresh Machine Learning #1

One-Shot Learning - Fresh Machine Learning #1

Generative Adversarial Nets - Fresh Machine Learning #2

Generative Adversarial Nets - Fresh Machine Learning #2

Tone Analysis - Fresh Machine Learning #3

Tone Analysis - Fresh Machine Learning #3

Generate Rap Lyrics - Fresh Machine Learning #4

Generate Rap Lyrics - Fresh Machine Learning #4

Build an Autoencoder in 5 Min - Fresh Machine Learning #5

Build an Autoencoder in 5 Min - Fresh Machine Learning #5

Build a Self Driving Car in 5 Min - Fresh Machine Learning #6

Build a Self Driving Car in 5 Min - Fresh Machine Learning #6

Build an Antivirus in 5 Min - Fresh Machine Learning #7

Build an Antivirus in 5 Min - Fresh Machine Learning #7

TensorFlow in 5 Minutes (tutorial)

TensorFlow in 5 Minutes (tutorial)

Build a Recurrent Neural Net in 5 Min

Build a Recurrent Neural Net in 5 Min

Build a Simulation in 5 Min

Build a Simulation in 5 Min

Build a TensorFlow Image Classifier in 5 Min

Build a TensorFlow Image Classifier in 5 Min

Tensorboard Explained in 5 Min

Tensorboard Explained in 5 Min

Generate Music in TensorFlow

Generate Music in TensorFlow

Build a Game Bot (LIVE)

Build a Game Bot (LIVE)

Deep Learning Frameworks Compared

Deep Learning Frameworks Compared

Introduction - Learn Python for Data Science #1

Introduction - Learn Python for Data Science #1

Build a Neural Network (LIVE)

Build a Neural Network (LIVE)

Twitter Sentiment Analysis - Learn Python for Data Science #2

Twitter Sentiment Analysis - Learn Python for Data Science #2

Recommendation Systems - Learn Python for Data Science #3

Recommendation Systems - Learn Python for Data Science #3

Predicting Stock Prices - Learn Python for Data Science #4

Predicting Stock Prices - Learn Python for Data Science #4

Pong Neural Network (LIVE)

Pong Neural Network (LIVE)

Deep Dream in TensorFlow - Learn Python for Data Science #5

Deep Dream in TensorFlow - Learn Python for Data Science #5

Visualizing Data with D3.js (LIVE)

Visualizing Data with D3.js (LIVE)

Genetic Algorithms - Learn Python for Data Science #6

Genetic Algorithms - Learn Python for Data Science #6

Enter Siraj [Music Video]

Enter Siraj [Music Video]

Build a Web Scraper (LIVE)

Build a Web Scraper (LIVE)

Why is P vs NP Important?

Why is P vs NP Important?

How to Make a Neural Network (LIVE)

How to Make a Neural Network (LIVE)

How to Make an Amazing Tensorflow Chatbot Easily

How to Make an Amazing Tensorflow Chatbot Easily

How to Make an Amazing Video Game Bot Easily

How to Make an Amazing Video Game Bot Easily

How to Make a Tensorflow Neural Network (LIVE)

How to Make a Tensorflow Neural Network (LIVE)

How to Make a Simple Tensorflow Speech Recognizer

How to Make a Simple Tensorflow Speech Recognizer

Joel Shor - Really Quick Questions with an Awesome Google Engineer

Joel Shor - Really Quick Questions with an Awesome Google Engineer

How to Make a Path Planning Algorithm Easily (LIVE)

How to Make a Path Planning Algorithm Easily (LIVE)

The Best Way to Prepare a Dataset Easily

The Best Way to Prepare a Dataset Easily

Catherine Olsson - Really Quick Questions with an OpenAI Engineer

Catherine Olsson - Really Quick Questions with an OpenAI Engineer

How to Make a Tic Tac Toe Neural Network Easily (LIVE)

How to Make a Tic Tac Toe Neural Network Easily (LIVE)

This video teaches how to create a video game bot using OpenAI's Universe platform and reinforcement learning, allowing the bot to play thousands of different games. The bot can be improved through trial and error, and can even be used to solve various tasks such as managing emails and booking flights.

Key Takeaways

Initialize environment with reset method
Create a while statement to keep the agent running indefinitely
Specify the type of event (key press) and press the up arrow for each environment's observation
Call the environment step method to move forward one time step and use the action as a parameter
Render the environment so it's visible to us
Use OpenAI's Universe platform to train and test the bot
Use reinforcement learning to improve the bot through trial and error

💡 The key insight is that OpenAI's Universe platform provides a wide range of environments for training and testing bots, allowing for the creation of highly generalizable bots that can play multiple games and solve various tasks.

🔒 Pro feature: Ask AI to explain this lesson →

More on: Agent Foundations

View skill →

Build and Deploy an Agent with Reasoning Engine in Vertex AI

Adding a Phone Gateway to a Virtual Agent

From Zero to Working AI Agent in 60 Seconds

From Zero to Working AI Agent in 60 Seconds

Create An AI Agent With Replit That Automates Your Sales

Create An AI Agent With Replit That Automates Your Sales

Capstone: Autonomous Runway Detection for IoT

Capstone: Autonomous Runway Detection for IoT

AI Agents with Model Context Protocol & Typescript

AI Agents with Model Context Protocol & Typescript

Related Reads

Can Artificial Intelligence Beat the Inventions That Built Civilization?

Explore how AI's ability to learn and adapt can surpass traditional inventions in impact and innovation

Why AI Skills Matter More Than College Degrees in 2026

In 2026, AI skills surpass college degrees in importance for career success, highlighting a shift in hiring priorities

Hyundai and Kia built a UV system that kills bacteria inside a car while you are sitting in it

Hyundai and Kia develop an in-vehicle UV system to kill bacteria and viruses while passengers are present, using far-ultraviolet light technology

The Next Web AI

The latest AI news we announced in June 2026

Get the latest AI news from Google's June 2026 updates and stay current with industry developments

FABLE 5 IS BACK