AI Powers PAC-MAN - The Game Engine-Free Revolution

What's AI by Louis-François Bouchard · Beginner ·📄 Research Papers Explained ·6y ago

Key Takeaways

The video discusses the GameGAN model, a neural network that can recreate games like PAC-MAN without a game engine, using adversarial training and a new memory module to ensure long-term consistency. The model is trained on screen recordings and agent keystrokes from past gameplay, allowing it to learn the rules of the game and generate new frames.

Full Transcript

and video managed to recreate the whole pacman game with an AI trained on the game itself without any game engine and only using games the best thing from this paper is that this pac-man game copy is even playable this is a first in the field and here's how it's made this is what's AI and I share artificial intelligence news every week if you are new to the channel and want to stay up-to-date please consider subscribing to not miss any further news NVIDIA recently published a paper called learning to simulate dynamic environments with game gam where they visually imitated the pac-man game only by ingesting screen play and keyboard actions during training they made that happen without any underlying game engine they called this model game game it leverages adversarial training to learn to simulate games it is trained by observing screen play along with user's actions and does not require access to the game's logic or engine itself in fact it does not even require a game engine at all in this case they trained the neural networks on the pac-man episodes a few million frames in total paired with data on the keystrokes of an AI agent playing the game the main difference with this new model is that game gained features a new memory module to ensure long-term consistency and is trying to separate static and dynamic elements and yes before you ask what you are currently seeing was a hundred percent made with neural networks with no game engine at all to simplify the problem they framed this as a 2d image generation problem given sequences of observed image frames and the corresponding actions the agent took their goal was to emulate image creation as if it was rendered from a real dynamic environment that is reacting to the agents actions so game can ingest screen play and keyboard actions during training and aims to predict the next frame like conditioning and the action in this pac-man example an action will be a key pressed by the agent game gained is composed of three main modules first there's the dynamics engine which enables game Gann to learn how various aspects of an environment change with respect to the given user action for instance it needs to learn that certain actions are not possible like walking through a wall and how other objects behave as a consequence of the action this permit component is able to learn such transitions by implementing it as an action conditioned lsdm the engine maintains the standard state variables for LS TM HT and city which contain information about every aspect of the current environment at time T then it computes the state variables given a t ZT m t minus 1 and XT to communicate with the other modules and itself as you can see in this illustration the next module is optionally applied for environments that require a long-term consistency for example it's useful if you have an agent that needs to navigate through an environment this environment shall not change when the agent comes back to the same location a few moments later it's an external memory module which uses the neural Turing machine that allows their model to remember every scene it generates in the hidden State and design Ellis that enforces such long-term consistency which is a challenging task for typical models such as Aaron ins this module has a memory block and the attendant location at time T as you can see in this picture at all time the model knows the current location that the agent is located at and their previous t minus 1 location as well as the action taken during this previous step to get to where it is currently in short this new memory module encourages the model to build an internal map of the environment allowing the agent to return to previously visited locations with high visual consistency the last module is a rendering engine Theory Clee it can be simply implemented with standard transpose convolution layers however they decided to introduce a specialized rendering engine architecture for answering long-term consistency by learning to produce these entangled scenes I will not dive deeper into the architecture of this module in this video but I invite you to read their paper if you are interested in this part basically it is responsible for rendering the next stimulated image T plus 1 given a state at a certain time frame T using a purposely designed decoder that learns to disentangle static and dynamic components within the image this makes the behavior of the model more interpretable and if further allows us to modify existing games by swapping out different components to sum up everything the model learns key rules of the game both simple and complex just like in the original game pac-man can't walk through the maze walls he eats up dots as he moves around and when he consumes a power palette the ghosts turned blue and flee when pac-man exists the maze from the one side he is teleported to the opposite end if he runs into a ghost the screen flashes and the game ends the game can addition relies on neural networks instead of a traditional game engine to generate pac-man's environment the AI keeps track of the virtual world remembering what's already been generated to maintain visual consistency from frame to frame no matter the game the gang can learn its rules simply by ingesting screen recordings and agent keystrokes from past gameplay since the model can disentangle the background from the moving character it's possible to recast the game to take place in an outdoor edge maze or swap out pac-man for your favorite character game developers could use this capability to experiment with new character IDs or game themes similarities are used to develop autonomous machines of all kinds such as warehouse robots learning how to grab and move an object around or even delivery robots that must navigate side walls to transport food or medicine game game introduces the possibility that the work of writing a simulator for tests like these could one day be replaced by simply training a neural network suppose you install a camera on a car it can record what the environment looks like or what the driver is doing like turning the steering wheel are hitting the accelerator this data could then be used to train a deep learning model that can predict what will happen in the real world if a human driver Namaskar took an action like slamming the brakes of course this was just a simple overview of the game gain network I strongly recommend to read the paper and the interesting post and Nvidia's blog both linked in the description for more information leave a like if you went this far in the video and since they are over 90% of you guys watching that are not subscribed yet please consider subscribing to the channel to not miss any further news clearly explained [Music]

Original Description

This week my interest was directed towards the new paper: GameGAN. Their AI recreated the PACMAN game! Ask any questions or remarks you have in the comments, I will gladly answer to everything! Subscribe to not miss any AI news and terms clearly vulgarized! Share this to someone who needs to learn more about Artificial Intelligence! Spread knowledge, not germs! NVIDIA's blog post: https://blogs.nvidia.com/blog/2020/05/22/gamegan-research-pacman-anniversary/ The Paper: https://arxiv.org/pdf/2005.12126.pdf The game will be available later this year on: https://www.nvidia.com/en-us/research/ai-playground/ Follow me for more AI content: Instagram: https://www.instagram.com/whats_ai/ LinkedIn: www.linkedin.com/in/whats-ai Twitter: https://twitter.com/Whats_AI Facebook: https://www.facebook.com/whats.artificial.intelligence/ The best courses to start and progress in AI: https://www.omologapps.com/whats-ai Join Our Discord channel, Learn AI Together: https://discord.gg/SVse4Sr Chapters: 0:00 Don't forget to like the video if you enjoyed it, and subscribe to the channel, you won,t regret it, I promise! 0:32 Paper explanation - GameGAN 6:41 Conclusion Song credit: https://soundcloud.com/mattis-rodrigue/sans-titre #NVIDIA #GameGAN#PACMAN
Watch on YouTube ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Playlist

Uploads from What's AI by Louis-François Bouchard · What's AI by Louis-François Bouchard · 36 of 60

1 What is Artificial intelligence? | Artificial Intelligence terms explained for everyone 1
What is Artificial intelligence? | Artificial Intelligence terms explained for everyone 1
What's AI by Louis-François Bouchard
2 What is Machine Learning? | Introduction to ML for beginners in a minute 2
What is Machine Learning? | Introduction to ML for beginners in a minute 2
What's AI by Louis-François Bouchard
3 What is Deep Learning | Introduction to DL for beginners in a minute 3
What is Deep Learning | Introduction to DL for beginners in a minute 3
What's AI by Louis-François Bouchard
4 What is Supervised Learning | Machine Learning basics explained for beginners 4
What is Supervised Learning | Machine Learning basics explained for beginners 4
What's AI by Louis-François Bouchard
5 What is Unsupervised Learning | Machine Learning basics explained for beginners 5
What is Unsupervised Learning | Machine Learning basics explained for beginners 5
What's AI by Louis-François Bouchard
6 What is Semi-Supervised Learning | Machine Learning basics explained for beginners 6
What is Semi-Supervised Learning | Machine Learning basics explained for beginners 6
What's AI by Louis-François Bouchard
7 What is Reinforcement Learning | Machine Learning basics explained for beginners 7
What is Reinforcement Learning | Machine Learning basics explained for beginners 7
What's AI by Louis-François Bouchard
8 What is Classification | Introduction to Machine Learning for beginners | The Most Used Terms 8
What is Classification | Introduction to Machine Learning for beginners | The Most Used Terms 8
What's AI by Louis-François Bouchard
9 What is Regression | Introduction to Machine Learning for beginners | The Most Used Terms 9
What is Regression | Introduction to Machine Learning for beginners | The Most Used Terms 9
What's AI by Louis-François Bouchard
10 What is Clustering | Introduction to Machine Learning for beginners | The Most Used Terms 10
What is Clustering | Introduction to Machine Learning for beginners | The Most Used Terms 10
What's AI by Louis-François Bouchard
11 What is Backpropagation | Artificial Intelligence & Machine Learning Basics for Beginners 11
What is Backpropagation | Artificial Intelligence & Machine Learning Basics for Beginners 11
What's AI by Louis-François Bouchard
12 What is NLP ? | Introduction to Natural Language Processing for Beginners | Machine Learning 12
What is NLP ? | Introduction to Natural Language Processing for Beginners | Machine Learning 12
What's AI by Louis-François Bouchard
13 Comparing AGI and Traditional AI: Now and Beyond
Comparing AGI and Traditional AI: Now and Beyond
What's AI by Louis-François Bouchard
14 Demystifying Neural Network: A Beginner's Guide to Machine Learning Fundamentals
Demystifying Neural Network: A Beginner's Guide to Machine Learning Fundamentals
What's AI by Louis-François Bouchard
15 Understanding Computer Vision: An Entry-Level Introduction to ML-Driven CV
Understanding Computer Vision: An Entry-Level Introduction to ML-Driven CV
What's AI by Louis-François Bouchard
16 Chatbots for Beginners: A Comprehensive Intro to Machine Learning Applications
Chatbots for Beginners: A Comprehensive Intro to Machine Learning Applications
What's AI by Louis-François Bouchard
17 What is Image Segmentation ? | Computer Vision & ML Techniques Explained for Beginners 17
What is Image Segmentation ? | Computer Vision & ML Techniques Explained for Beginners 17
What's AI by Louis-François Bouchard
18 Object Detection Clearly Explained for Everyone
Object Detection Clearly Explained for Everyone
What's AI by Louis-François Bouchard
19 What is a RNN ? | Introduction to Recurrent Neural Network FOR EVERYONE 19
What is a RNN ? | Introduction to Recurrent Neural Network FOR EVERYONE 19
What's AI by Louis-François Bouchard
20 What is Transfer Learning ? | Deep Learning Basics Explained for Beginners 20
What is Transfer Learning ? | Deep Learning Basics Explained for Beginners 20
What's AI by Louis-François Bouchard
21 Data Science Demystified - An Essential Introduction
Data Science Demystified - An Essential Introduction
What's AI by Louis-François Bouchard
22 Demystifying Data Mining - A Clear and Concise Explanation
Demystifying Data Mining - A Clear and Concise Explanation
What's AI by Louis-François Bouchard
23 Decoding Logistic Regression - A Simple and Comprehensive Explanation
Decoding Logistic Regression - A Simple and Comprehensive Explanation
What's AI by Louis-François Bouchard
24 What is the YOLO algorithm? | Introduction to You Only Look Once, Real Time Object Detection 24
What is the YOLO algorithm? | Introduction to You Only Look Once, Real Time Object Detection 24
What's AI by Louis-François Bouchard
25 AI or Human? What is the Turing Test
AI or Human? What is the Turing Test
What's AI by Louis-François Bouchard
26 Genetic Algorithms Demystified - How Algorithms Evolve
Genetic Algorithms Demystified - How Algorithms Evolve
What's AI by Louis-François Bouchard
27 What is Data Labeling ? | Prepare Your Data for ML and AI | Attaching meaning to digital data 27
What is Data Labeling ? | Prepare Your Data for ML and AI | Attaching meaning to digital data 27
What's AI by Louis-François Bouchard
28 Human Pose Estimation in Machine Learning Explained (2D & 3D)
Human Pose Estimation in Machine Learning Explained (2D & 3D)
What's AI by Louis-François Bouchard
29 What is Self-Supervised Learning ? | Will machines be able to learn like humans ? 29
What is Self-Supervised Learning ? | Will machines be able to learn like humans ? 29
What's AI by Louis-François Bouchard
30 What are GANs ? | Introduction to Generative Adversarial Networks | Face Generation & Editing - 30
What are GANs ? | Introduction to Generative Adversarial Networks | Face Generation & Editing - 30
What's AI by Louis-François Bouchard
31 Introduction to Energy-Based Learning | Yann LeCun Paper
Introduction to Energy-Based Learning | Yann LeCun Paper
What's AI by Louis-François Bouchard
32 The Science Behind Google Translate: Understanding Transformers
The Science Behind Google Translate: Understanding Transformers
What's AI by Louis-François Bouchard
33 Mastering CNNs in 5 Minutes | ConvNets Explained
Mastering CNNs in 5 Minutes | ConvNets Explained
What's AI by Louis-François Bouchard
34 Discover the Power of YOLOv4 - Real-Time Object Detection Simplified
Discover the Power of YOLOv4 - Real-Time Object Detection Simplified
What's AI by Louis-François Bouchard
35 Learn to Draw Real People using AI: Unveiling Future of Image-to-Image Translation
Learn to Draw Real People using AI: Unveiling Future of Image-to-Image Translation
What's AI by Louis-François Bouchard
AI Powers PAC-MAN - The Game Engine-Free Revolution
AI Powers PAC-MAN - The Game Engine-Free Revolution
What's AI by Louis-François Bouchard
37 This AI makes blurry faces look 60 times sharper! Introduction to PULSE: photo upsampling
This AI makes blurry faces look 60 times sharper! Introduction to PULSE: photo upsampling
What's AI by Louis-François Bouchard
38 Facebook's TransCoder: Converting Programming Languages with AI
Facebook's TransCoder: Converting Programming Languages with AI
What's AI by Louis-François Bouchard
39 Transforming Images to 3D Models with AI - Discover PIFuHD
Transforming Images to 3D Models with AI - Discover PIFuHD
What's AI by Louis-François Bouchard
40 Optimize Your ML Models - Avoid Underfitting and Overfitting
Optimize Your ML Models - Avoid Underfitting and Overfitting
What's AI by Louis-François Bouchard
41 Behind the Scenes - Disney's Secrets to High-Res Face Swaps
Behind the Scenes - Disney's Secrets to High-Res Face Swaps
What's AI by Louis-François Bouchard
42 Linear Regression in Machine Learning Explained in 5 Minutes
Linear Regression in Machine Learning Explained in 5 Minutes
What's AI by Louis-François Bouchard
43 Style Transfer Better Than GANs! Swapping Autoencoder Explained
Style Transfer Better Than GANs! Swapping Autoencoder Explained
What's AI by Louis-François Bouchard
44 Use AI to Remove Objects from Videos
Use AI to Remove Objects from Videos
What's AI by Louis-François Bouchard
45 OpenAI's Language Generator: GPT | The first AI Generating Text, Code, Websites...
OpenAI's Language Generator: GPT | The first AI Generating Text, Code, Websites...
What's AI by Louis-François Bouchard
46 Autocomplete Images With AI: image-GPT explained
Autocomplete Images With AI: image-GPT explained
What's AI by Louis-François Bouchard
47 Turning Reality into Art - AI That Cartoonizes Your Pictures and Videos
Turning Reality into Art - AI That Cartoonizes Your Pictures and Videos
What's AI by Louis-François Bouchard
48 From Portrait to Cartoon - Discover the Power of FreezeG
From Portrait to Cartoon - Discover the Power of FreezeG
What's AI by Louis-François Bouchard
49 Transfer clothes between photos using AI. From a single image!
Transfer clothes between photos using AI. From a single image!
What's AI by Louis-François Bouchard
50 Precise 3D Human Pose and Mesh Estimation from a Single RGB Image
Precise 3D Human Pose and Mesh Estimation from a Single RGB Image
What's AI by Louis-François Bouchard
51 Smart Navigation - How AI Robots Understand and Explore Environments
Smart Navigation - How AI Robots Understand and Explore Environments
What's AI by Louis-François Bouchard
52 Techfitlab Breaks Down Tesla Autopilot, AI, ML, and DL Complexities
Techfitlab Breaks Down Tesla Autopilot, AI, ML, and DL Complexities
What's AI by Louis-François Bouchard
53 ECCV 2020 Best Paper Award | RAFT: A New Deep Network Architecture For Optical Flow | WITH CODE
ECCV 2020 Best Paper Award | RAFT: A New Deep Network Architecture For Optical Flow | WITH CODE
What's AI by Louis-François Bouchard
54 Maximize Business Efficiency with AI / GPT Technology!
Maximize Business Efficiency with AI / GPT Technology!
What's AI by Louis-François Bouchard
55 AI Transforms Google Photos into Real-Life Scenes
AI Transforms Google Photos into Real-Life Scenes
What's AI by Louis-François Bouchard
56 Old Photo Restoration Using Deep Learning | 2020 Novel Approach Explained & Results
Old Photo Restoration Using Deep Learning | 2020 Novel Approach Explained & Results
What's AI by Louis-François Bouchard
57 This computer vision algorithm removes the water from underwater images !
This computer vision algorithm removes the water from underwater images !
What's AI by Louis-François Bouchard
58 DeepFakes in 5 minutes | Understand how deepfakes work and create your own!
DeepFakes in 5 minutes | Understand how deepfakes work and create your own!
What's AI by Louis-François Bouchard
59 A new brain-inspired intelligent system can drive a car using only 19 control neurons!
A new brain-inspired intelligent system can drive a car using only 19 control neurons!
What's AI by Louis-François Bouchard
60 Toonify: Turn Real Faces into Animated Disney Characters
Toonify: Turn Real Faces into Animated Disney Characters
What's AI by Louis-François Bouchard

The GameGAN model uses neural networks to recreate games like PAC-MAN without a game engine. It is trained on screen recordings and agent keystrokes from past gameplay, allowing it to learn the rules of the game and generate new frames. This technology has potential applications in game development, autonomous machines, and other fields.

Key Takeaways
  1. Read the research paper on GameGAN
  2. Understand the architecture of the GameGAN model
  3. Learn about adversarial training and its applications
  4. Explore the potential applications of GameGAN in game development and autonomous machines
💡 The GameGAN model can learn the rules of a game and generate new frames without a game engine, using adversarial training and a new memory module to ensure long-term consistency.

Related AI Lessons

I Spent Weeks Looking for a Research Gap Before I Realized I Was Searching the Wrong Way
Learn how to effectively find research gaps by changing your approach, a crucial skill for AI researchers and academics
Medium · AI
ICMI 2026 Reviews [D]
Learn how to interpret ICMI 2026 reviews and improve your paper's acceptance chances
Reddit r/MachineLearning
Workshop submission for main conference paper under review [D]
Learn how to navigate submitting a paper to a non-archival workshop before the final decision of a main conference like ECCV
Reddit r/MachineLearning
Kept context-switching between arxiv, OpenReview, GitHub, and HuggingFace for every paper, so I built this. Chrome extension + website with everything inline, plus citation graph + SPECTER2 neighbors. 3M papers, free, feedback welcome [P]
Streamline your research with a new Chrome extension and website that integrates 3M papers from arxiv, OpenReview, GitHub, and HuggingFace, including citation graphs and SPECTER2 neighbors, and provide feedback to improve it
Reddit r/MachineLearning

Chapters (3)

Don't forget to like the video if you enjoyed it, and subscribe to the channel,
0:32 Paper explanation - GameGAN
6:41 Conclusion
Up next
Beyond Big Vendors: ERP Systems Explained #shorts
Digital Transformation with Eric Kimberling
Watch →