AI Powers PAC-MAN - The Game Engine-Free Revolution
Key Takeaways
The video discusses the GameGAN model, a neural network that can recreate games like PAC-MAN without a game engine, using adversarial training and a new memory module to ensure long-term consistency. The model is trained on screen recordings and agent keystrokes from past gameplay, allowing it to learn the rules of the game and generate new frames.
Full Transcript
and video managed to recreate the whole pacman game with an AI trained on the game itself without any game engine and only using games the best thing from this paper is that this pac-man game copy is even playable this is a first in the field and here's how it's made this is what's AI and I share artificial intelligence news every week if you are new to the channel and want to stay up-to-date please consider subscribing to not miss any further news NVIDIA recently published a paper called learning to simulate dynamic environments with game gam where they visually imitated the pac-man game only by ingesting screen play and keyboard actions during training they made that happen without any underlying game engine they called this model game game it leverages adversarial training to learn to simulate games it is trained by observing screen play along with user's actions and does not require access to the game's logic or engine itself in fact it does not even require a game engine at all in this case they trained the neural networks on the pac-man episodes a few million frames in total paired with data on the keystrokes of an AI agent playing the game the main difference with this new model is that game gained features a new memory module to ensure long-term consistency and is trying to separate static and dynamic elements and yes before you ask what you are currently seeing was a hundred percent made with neural networks with no game engine at all to simplify the problem they framed this as a 2d image generation problem given sequences of observed image frames and the corresponding actions the agent took their goal was to emulate image creation as if it was rendered from a real dynamic environment that is reacting to the agents actions so game can ingest screen play and keyboard actions during training and aims to predict the next frame like conditioning and the action in this pac-man example an action will be a key pressed by the agent game gained is composed of three main modules first there's the dynamics engine which enables game Gann to learn how various aspects of an environment change with respect to the given user action for instance it needs to learn that certain actions are not possible like walking through a wall and how other objects behave as a consequence of the action this permit component is able to learn such transitions by implementing it as an action conditioned lsdm the engine maintains the standard state variables for LS TM HT and city which contain information about every aspect of the current environment at time T then it computes the state variables given a t ZT m t minus 1 and XT to communicate with the other modules and itself as you can see in this illustration the next module is optionally applied for environments that require a long-term consistency for example it's useful if you have an agent that needs to navigate through an environment this environment shall not change when the agent comes back to the same location a few moments later it's an external memory module which uses the neural Turing machine that allows their model to remember every scene it generates in the hidden State and design Ellis that enforces such long-term consistency which is a challenging task for typical models such as Aaron ins this module has a memory block and the attendant location at time T as you can see in this picture at all time the model knows the current location that the agent is located at and their previous t minus 1 location as well as the action taken during this previous step to get to where it is currently in short this new memory module encourages the model to build an internal map of the environment allowing the agent to return to previously visited locations with high visual consistency the last module is a rendering engine Theory Clee it can be simply implemented with standard transpose convolution layers however they decided to introduce a specialized rendering engine architecture for answering long-term consistency by learning to produce these entangled scenes I will not dive deeper into the architecture of this module in this video but I invite you to read their paper if you are interested in this part basically it is responsible for rendering the next stimulated image T plus 1 given a state at a certain time frame T using a purposely designed decoder that learns to disentangle static and dynamic components within the image this makes the behavior of the model more interpretable and if further allows us to modify existing games by swapping out different components to sum up everything the model learns key rules of the game both simple and complex just like in the original game pac-man can't walk through the maze walls he eats up dots as he moves around and when he consumes a power palette the ghosts turned blue and flee when pac-man exists the maze from the one side he is teleported to the opposite end if he runs into a ghost the screen flashes and the game ends the game can addition relies on neural networks instead of a traditional game engine to generate pac-man's environment the AI keeps track of the virtual world remembering what's already been generated to maintain visual consistency from frame to frame no matter the game the gang can learn its rules simply by ingesting screen recordings and agent keystrokes from past gameplay since the model can disentangle the background from the moving character it's possible to recast the game to take place in an outdoor edge maze or swap out pac-man for your favorite character game developers could use this capability to experiment with new character IDs or game themes similarities are used to develop autonomous machines of all kinds such as warehouse robots learning how to grab and move an object around or even delivery robots that must navigate side walls to transport food or medicine game game introduces the possibility that the work of writing a simulator for tests like these could one day be replaced by simply training a neural network suppose you install a camera on a car it can record what the environment looks like or what the driver is doing like turning the steering wheel are hitting the accelerator this data could then be used to train a deep learning model that can predict what will happen in the real world if a human driver Namaskar took an action like slamming the brakes of course this was just a simple overview of the game gain network I strongly recommend to read the paper and the interesting post and Nvidia's blog both linked in the description for more information leave a like if you went this far in the video and since they are over 90% of you guys watching that are not subscribed yet please consider subscribing to the channel to not miss any further news clearly explained [Music]
Original Description
This week my interest was directed towards the new paper: GameGAN. Their AI recreated the PACMAN game! Ask any questions or remarks you have in the comments, I will gladly answer to everything!
Subscribe to not miss any AI news and terms clearly vulgarized! Share this to someone who needs to learn more about Artificial Intelligence! Spread knowledge, not germs!
NVIDIA's blog post: https://blogs.nvidia.com/blog/2020/05/22/gamegan-research-pacman-anniversary/
The Paper: https://arxiv.org/pdf/2005.12126.pdf
The game will be available later this year on: https://www.nvidia.com/en-us/research/ai-playground/
Follow me for more AI content:
Instagram: https://www.instagram.com/whats_ai/
LinkedIn: www.linkedin.com/in/whats-ai
Twitter: https://twitter.com/Whats_AI
Facebook: https://www.facebook.com/whats.artificial.intelligence/
The best courses to start and progress in AI:
https://www.omologapps.com/whats-ai
Join Our Discord channel, Learn AI Together:
https://discord.gg/SVse4Sr
Chapters:
0:00 Don't forget to like the video if you enjoyed it, and subscribe to the channel, you won,t regret it, I promise!
0:32 Paper explanation - GameGAN
6:41 Conclusion
Song credit: https://soundcloud.com/mattis-rodrigue/sans-titre
#NVIDIA #GameGAN#PACMAN
Watch on YouTube ↗
(saves to browser)
Sign in to unlock AI tutor explanation · ⚡30
Playlist
Uploads from What's AI by Louis-François Bouchard · What's AI by Louis-François Bouchard · 36 of 60
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
▶
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
What is Artificial intelligence? | Artificial Intelligence terms explained for everyone 1
What's AI by Louis-François Bouchard
What is Machine Learning? | Introduction to ML for beginners in a minute 2
What's AI by Louis-François Bouchard
What is Deep Learning | Introduction to DL for beginners in a minute 3
What's AI by Louis-François Bouchard
What is Supervised Learning | Machine Learning basics explained for beginners 4
What's AI by Louis-François Bouchard
What is Unsupervised Learning | Machine Learning basics explained for beginners 5
What's AI by Louis-François Bouchard
What is Semi-Supervised Learning | Machine Learning basics explained for beginners 6
What's AI by Louis-François Bouchard
What is Reinforcement Learning | Machine Learning basics explained for beginners 7
What's AI by Louis-François Bouchard
What is Classification | Introduction to Machine Learning for beginners | The Most Used Terms 8
What's AI by Louis-François Bouchard
What is Regression | Introduction to Machine Learning for beginners | The Most Used Terms 9
What's AI by Louis-François Bouchard
What is Clustering | Introduction to Machine Learning for beginners | The Most Used Terms 10
What's AI by Louis-François Bouchard
What is Backpropagation | Artificial Intelligence & Machine Learning Basics for Beginners 11
What's AI by Louis-François Bouchard
What is NLP ? | Introduction to Natural Language Processing for Beginners | Machine Learning 12
What's AI by Louis-François Bouchard
Comparing AGI and Traditional AI: Now and Beyond
What's AI by Louis-François Bouchard
Demystifying Neural Network: A Beginner's Guide to Machine Learning Fundamentals
What's AI by Louis-François Bouchard
Understanding Computer Vision: An Entry-Level Introduction to ML-Driven CV
What's AI by Louis-François Bouchard
Chatbots for Beginners: A Comprehensive Intro to Machine Learning Applications
What's AI by Louis-François Bouchard
What is Image Segmentation ? | Computer Vision & ML Techniques Explained for Beginners 17
What's AI by Louis-François Bouchard
Object Detection Clearly Explained for Everyone
What's AI by Louis-François Bouchard
What is a RNN ? | Introduction to Recurrent Neural Network FOR EVERYONE 19
What's AI by Louis-François Bouchard
What is Transfer Learning ? | Deep Learning Basics Explained for Beginners 20
What's AI by Louis-François Bouchard
Data Science Demystified - An Essential Introduction
What's AI by Louis-François Bouchard
Demystifying Data Mining - A Clear and Concise Explanation
What's AI by Louis-François Bouchard
Decoding Logistic Regression - A Simple and Comprehensive Explanation
What's AI by Louis-François Bouchard
What is the YOLO algorithm? | Introduction to You Only Look Once, Real Time Object Detection 24
What's AI by Louis-François Bouchard
AI or Human? What is the Turing Test
What's AI by Louis-François Bouchard
Genetic Algorithms Demystified - How Algorithms Evolve
What's AI by Louis-François Bouchard
What is Data Labeling ? | Prepare Your Data for ML and AI | Attaching meaning to digital data 27
What's AI by Louis-François Bouchard
Human Pose Estimation in Machine Learning Explained (2D & 3D)
What's AI by Louis-François Bouchard
What is Self-Supervised Learning ? | Will machines be able to learn like humans ? 29
What's AI by Louis-François Bouchard
What are GANs ? | Introduction to Generative Adversarial Networks | Face Generation & Editing - 30
What's AI by Louis-François Bouchard
Introduction to Energy-Based Learning | Yann LeCun Paper
What's AI by Louis-François Bouchard
The Science Behind Google Translate: Understanding Transformers
What's AI by Louis-François Bouchard
Mastering CNNs in 5 Minutes | ConvNets Explained
What's AI by Louis-François Bouchard
Discover the Power of YOLOv4 - Real-Time Object Detection Simplified
What's AI by Louis-François Bouchard
Learn to Draw Real People using AI: Unveiling Future of Image-to-Image Translation
What's AI by Louis-François Bouchard
AI Powers PAC-MAN - The Game Engine-Free Revolution
What's AI by Louis-François Bouchard
This AI makes blurry faces look 60 times sharper! Introduction to PULSE: photo upsampling
What's AI by Louis-François Bouchard
Facebook's TransCoder: Converting Programming Languages with AI
What's AI by Louis-François Bouchard
Transforming Images to 3D Models with AI - Discover PIFuHD
What's AI by Louis-François Bouchard
Optimize Your ML Models - Avoid Underfitting and Overfitting
What's AI by Louis-François Bouchard
Behind the Scenes - Disney's Secrets to High-Res Face Swaps
What's AI by Louis-François Bouchard
Linear Regression in Machine Learning Explained in 5 Minutes
What's AI by Louis-François Bouchard
Style Transfer Better Than GANs! Swapping Autoencoder Explained
What's AI by Louis-François Bouchard
Use AI to Remove Objects from Videos
What's AI by Louis-François Bouchard
OpenAI's Language Generator: GPT | The first AI Generating Text, Code, Websites...
What's AI by Louis-François Bouchard
Autocomplete Images With AI: image-GPT explained
What's AI by Louis-François Bouchard
Turning Reality into Art - AI That Cartoonizes Your Pictures and Videos
What's AI by Louis-François Bouchard
From Portrait to Cartoon - Discover the Power of FreezeG
What's AI by Louis-François Bouchard
Transfer clothes between photos using AI. From a single image!
What's AI by Louis-François Bouchard
Precise 3D Human Pose and Mesh Estimation from a Single RGB Image
What's AI by Louis-François Bouchard
Smart Navigation - How AI Robots Understand and Explore Environments
What's AI by Louis-François Bouchard
Techfitlab Breaks Down Tesla Autopilot, AI, ML, and DL Complexities
What's AI by Louis-François Bouchard
ECCV 2020 Best Paper Award | RAFT: A New Deep Network Architecture For Optical Flow | WITH CODE
What's AI by Louis-François Bouchard
Maximize Business Efficiency with AI / GPT Technology!
What's AI by Louis-François Bouchard
AI Transforms Google Photos into Real-Life Scenes
What's AI by Louis-François Bouchard
Old Photo Restoration Using Deep Learning | 2020 Novel Approach Explained & Results
What's AI by Louis-François Bouchard
This computer vision algorithm removes the water from underwater images !
What's AI by Louis-François Bouchard
DeepFakes in 5 minutes | Understand how deepfakes work and create your own!
What's AI by Louis-François Bouchard
A new brain-inspired intelligent system can drive a car using only 19 control neurons!
What's AI by Louis-François Bouchard
Toonify: Turn Real Faces into Animated Disney Characters
What's AI by Louis-François Bouchard
More on: Reading ML Papers
View skill →Related AI Lessons
⚡
⚡
⚡
⚡
I Spent Weeks Looking for a Research Gap Before I Realized I Was Searching the Wrong Way
Medium · AI
ICMI 2026 Reviews [D]
Reddit r/MachineLearning
Workshop submission for main conference paper under review [D]
Reddit r/MachineLearning
Kept context-switching between arxiv, OpenReview, GitHub, and HuggingFace for every paper, so I built this. Chrome extension + website with everything inline, plus citation graph + SPECTER2 neighbors. 3M papers, free, feedback welcome [P]
Reddit r/MachineLearning
Chapters (3)
Don't forget to like the video if you enjoyed it, and subscribe to the channel,
0:32
Paper explanation - GameGAN
6:41
Conclusion
🎓
Tutor Explanation
DeepCamp AI