AI Powers PAC-MAN - The Game Engine-Free Revolution

What's AI by Louis-François Bouchard · Beginner ·📄 Research Papers Explained ·6y ago

Skills: Reading ML Papers90%LLM Foundations80%ML Pipelines70%

Key Takeaways

The video discusses the GameGAN model, a neural network that can recreate games like PAC-MAN without a game engine, using adversarial training and a new memory module to ensure long-term consistency. The model is trained on screen recordings and agent keystrokes from past gameplay, allowing it to learn the rules of the game and generate new frames.

Full Transcript

and video managed to recreate the whole pacman game with an AI trained on the game itself without any game engine and only using games the best thing from this paper is that this pac-man game copy is even playable this is a first in the field and here's how it's made this is what's AI and I share artificial intelligence news every week if you are new to the channel and want to stay up-to-date please consider subscribing to not miss any further news NVIDIA recently published a paper called learning to simulate dynamic environments with game gam where they visually imitated the pac-man game only by ingesting screen play and keyboard actions during training they made that happen without any underlying game engine they called this model game game it leverages adversarial training to learn to simulate games it is trained by observing screen play along with user's actions and does not require access to the game's logic or engine itself in fact it does not even require a game engine at all in this case they trained the neural networks on the pac-man episodes a few million frames in total paired with data on the keystrokes of an AI agent playing the game the main difference with this new model is that game gained features a new memory module to ensure long-term consistency and is trying to separate static and dynamic elements and yes before you ask what you are currently seeing was a hundred percent made with neural networks with no game engine at all to simplify the problem they framed this as a 2d image generation problem given sequences of observed image frames and the corresponding actions the agent took their goal was to emulate image creation as if it was rendered from a real dynamic environment that is reacting to the agents actions so game can ingest screen play and keyboard actions during training and aims to predict the next frame like conditioning and the action in this pac-man example an action will be a key pressed by the agent game gained is composed of three main modules first there's the dynamics engine which enables game Gann to learn how various aspects of an environment change with respect to the given user action for instance it needs to learn that certain actions are not possible like walking through a wall and how other objects behave as a consequence of the action this permit component is able to learn such transitions by implementing it as an action conditioned lsdm the engine maintains the standard state variables for LS TM HT and city which contain information about every aspect of the current environment at time T then it computes the state variables given a t ZT m t minus 1 and XT to communicate with the other modules and itself as you can see in this illustration the next module is optionally applied for environments that require a long-term consistency for example it's useful if you have an agent that needs to navigate through an environment this environment shall not change when the agent comes back to the same location a few moments later it's an external memory module which uses the neural Turing machine that allows their model to remember every scene it generates in the hidden State and design Ellis that enforces such long-term consistency which is a challenging task for typical models such as Aaron ins this module has a memory block and the attendant location at time T as you can see in this picture at all time the model knows the current location that the agent is located at and their previous t minus 1 location as well as the action taken during this previous step to get to where it is currently in short this new memory module encourages the model to build an internal map of the environment allowing the agent to return to previously visited locations with high visual consistency the last module is a rendering engine Theory Clee it can be simply implemented with standard transpose convolution layers however they decided to introduce a specialized rendering engine architecture for answering long-term consistency by learning to produce these entangled scenes I will not dive deeper into the architecture of this module in this video but I invite you to read their paper if you are interested in this part basically it is responsible for rendering the next stimulated image T plus 1 given a state at a certain time frame T using a purposely designed decoder that learns to disentangle static and dynamic components within the image this makes the behavior of the model more interpretable and if further allows us to modify existing games by swapping out different components to sum up everything the model learns key rules of the game both simple and complex just like in the original game pac-man can't walk through the maze walls he eats up dots as he moves around and when he consumes a power palette the ghosts turned blue and flee when pac-man exists the maze from the one side he is teleported to the opposite end if he runs into a ghost the screen flashes and the game ends the game can addition relies on neural networks instead of a traditional game engine to generate pac-man's environment the AI keeps track of the virtual world remembering what's already been generated to maintain visual consistency from frame to frame no matter the game the gang can learn its rules simply by ingesting screen recordings and agent keystrokes from past gameplay since the model can disentangle the background from the moving character it's possible to recast the game to take place in an outdoor edge maze or swap out pac-man for your favorite character game developers could use this capability to experiment with new character IDs or game themes similarities are used to develop autonomous machines of all kinds such as warehouse robots learning how to grab and move an object around or even delivery robots that must navigate side walls to transport food or medicine game game introduces the possibility that the work of writing a simulator for tests like these could one day be replaced by simply training a neural network suppose you install a camera on a car it can record what the environment looks like or what the driver is doing like turning the steering wheel are hitting the accelerator this data could then be used to train a deep learning model that can predict what will happen in the real world if a human driver Namaskar took an action like slamming the brakes of course this was just a simple overview of the game gain network I strongly recommend to read the paper and the interesting post and Nvidia's blog both linked in the description for more information leave a like if you went this far in the video and since they are over 90% of you guys watching that are not subscribed yet please consider subscribing to the channel to not miss any further news clearly explained [Music]

Original Description

This week my interest was directed towards the new paper: GameGAN. Their AI recreated the PACMAN game! Ask any questions or remarks you have in the comments, I will gladly answer to everything! Subscribe to not miss any AI news and terms clearly vulgarized! Share this to someone who needs to learn more about Artificial Intelligence! Spread knowledge, not germs! NVIDIA's blog post: https://blogs.nvidia.com/blog/2020/05/22/gamegan-research-pacman-anniversary/ The Paper: https://arxiv.org/pdf/2005.12126.pdf The game will be available later this year on: https://www.nvidia.com/en-us/research/ai-playground/ Follow me for more AI content: Instagram: https://www.instagram.com/whats_ai/ LinkedIn: www.linkedin.com/in/whats-ai Twitter: https://twitter.com/Whats_AI Facebook: https://www.facebook.com/whats.artificial.intelligence/ The best courses to start and progress in AI: https://www.omologapps.com/whats-ai Join Our Discord channel, Learn AI Together: https://discord.gg/SVse4Sr Chapters: 0:00 Don't forget to like the video if you enjoyed it, and subscribe to the channel, you won,t regret it, I promise! 0:32 Paper explanation - GameGAN 6:41 Conclusion Song credit: https://soundcloud.com/mattis-rodrigue/sans-titre #NVIDIA #GameGAN#PACMAN

Watch on YouTube ↗ (saves to browser)

Sign in to unlock AI tutor explanation · ⚡30

Playlist

Uploads from What's AI by Louis-François Bouchard · What's AI by Louis-François Bouchard · 36 of 60

← Previous Next →

What is Artificial intelligence? | Artificial Intelligence terms explained for everyone 1

What is Artificial intelligence? | Artificial Intelligence terms explained for everyone 1

What's AI by Louis-François Bouchard

What is Machine Learning? | Introduction to ML for beginners in a minute 2

What is Machine Learning? | Introduction to ML for beginners in a minute 2

What's AI by Louis-François Bouchard

What is Deep Learning | Introduction to DL for beginners in a minute 3

What is Deep Learning | Introduction to DL for beginners in a minute 3

What's AI by Louis-François Bouchard

What is Supervised Learning | Machine Learning basics explained for beginners 4

What is Supervised Learning | Machine Learning basics explained for beginners 4

What's AI by Louis-François Bouchard

What is Unsupervised Learning | Machine Learning basics explained for beginners 5

What is Unsupervised Learning | Machine Learning basics explained for beginners 5

What's AI by Louis-François Bouchard

What is Semi-Supervised Learning | Machine Learning basics explained for beginners 6

What is Semi-Supervised Learning | Machine Learning basics explained for beginners 6

What's AI by Louis-François Bouchard

What is Reinforcement Learning | Machine Learning basics explained for beginners 7

What is Reinforcement Learning | Machine Learning basics explained for beginners 7

What's AI by Louis-François Bouchard

What is Classification | Introduction to Machine Learning for beginners | The Most Used Terms 8

What is Classification | Introduction to Machine Learning for beginners | The Most Used Terms 8

What's AI by Louis-François Bouchard

What is Regression | Introduction to Machine Learning for beginners | The Most Used Terms 9

What is Regression | Introduction to Machine Learning for beginners | The Most Used Terms 9

What's AI by Louis-François Bouchard

What is Clustering | Introduction to Machine Learning for beginners | The Most Used Terms 10

What is Clustering | Introduction to Machine Learning for beginners | The Most Used Terms 10

What's AI by Louis-François Bouchard

What is Backpropagation | Artificial Intelligence & Machine Learning Basics for Beginners 11

What is Backpropagation | Artificial Intelligence & Machine Learning Basics for Beginners 11

What's AI by Louis-François Bouchard

What is NLP ? | Introduction to Natural Language Processing for Beginners | Machine Learning 12

What is NLP ? | Introduction to Natural Language Processing for Beginners | Machine Learning 12

What's AI by Louis-François Bouchard

Comparing AGI and Traditional AI: Now and Beyond

Comparing AGI and Traditional AI: Now and Beyond

What's AI by Louis-François Bouchard

Demystifying Neural Network: A Beginner's Guide to Machine Learning Fundamentals

Demystifying Neural Network: A Beginner's Guide to Machine Learning Fundamentals

What's AI by Louis-François Bouchard

Understanding Computer Vision: An Entry-Level Introduction to ML-Driven CV

Understanding Computer Vision: An Entry-Level Introduction to ML-Driven CV

What's AI by Louis-François Bouchard

Chatbots for Beginners: A Comprehensive Intro to Machine Learning Applications

Chatbots for Beginners: A Comprehensive Intro to Machine Learning Applications

What's AI by Louis-François Bouchard

What is Image Segmentation ? | Computer Vision & ML Techniques Explained for Beginners 17

What is Image Segmentation ? | Computer Vision & ML Techniques Explained for Beginners 17

What's AI by Louis-François Bouchard

Object Detection Clearly Explained for Everyone

Object Detection Clearly Explained for Everyone

What's AI by Louis-François Bouchard

What is a RNN ? | Introduction to Recurrent Neural Network FOR EVERYONE 19

What is a RNN ? | Introduction to Recurrent Neural Network FOR EVERYONE 19

What's AI by Louis-François Bouchard

What is Transfer Learning ? | Deep Learning Basics Explained for Beginners 20

What is Transfer Learning ? | Deep Learning Basics Explained for Beginners 20

What's AI by Louis-François Bouchard

Data Science Demystified - An Essential Introduction

Data Science Demystified - An Essential Introduction

What's AI by Louis-François Bouchard

Demystifying Data Mining - A Clear and Concise Explanation

Demystifying Data Mining - A Clear and Concise Explanation

What's AI by Louis-François Bouchard

Decoding Logistic Regression - A Simple and Comprehensive Explanation

Decoding Logistic Regression - A Simple and Comprehensive Explanation

What's AI by Louis-François Bouchard

What is the YOLO algorithm? | Introduction to You Only Look Once, Real Time Object Detection 24

What is the YOLO algorithm? | Introduction to You Only Look Once, Real Time Object Detection 24

What's AI by Louis-François Bouchard

AI or Human? What is the Turing Test

AI or Human? What is the Turing Test

What's AI by Louis-François Bouchard

Genetic Algorithms Demystified - How Algorithms Evolve

Genetic Algorithms Demystified - How Algorithms Evolve

What's AI by Louis-François Bouchard

What is Data Labeling ? | Prepare Your Data for ML and AI | Attaching meaning to digital data 27

What is Data Labeling ? | Prepare Your Data for ML and AI | Attaching meaning to digital data 27

What's AI by Louis-François Bouchard

Human Pose Estimation in Machine Learning Explained (2D & 3D)

Human Pose Estimation in Machine Learning Explained (2D & 3D)

What's AI by Louis-François Bouchard

What is Self-Supervised Learning ? | Will machines be able to learn like humans ? 29

What is Self-Supervised Learning ? | Will machines be able to learn like humans ? 29

What's AI by Louis-François Bouchard

What are GANs ? | Introduction to Generative Adversarial Networks | Face Generation & Editing - 30

What are GANs ? | Introduction to Generative Adversarial Networks | Face Generation & Editing - 30

What's AI by Louis-François Bouchard

Introduction to Energy-Based Learning | Yann LeCun Paper

Introduction to Energy-Based Learning | Yann LeCun Paper

What's AI by Louis-François Bouchard

The Science Behind Google Translate: Understanding Transformers

The Science Behind Google Translate: Understanding Transformers

What's AI by Louis-François Bouchard

Mastering CNNs in 5 Minutes | ConvNets Explained

Mastering CNNs in 5 Minutes | ConvNets Explained

What's AI by Louis-François Bouchard

Discover the Power of YOLOv4 - Real-Time Object Detection Simplified

Discover the Power of YOLOv4 - Real-Time Object Detection Simplified

What's AI by Louis-François Bouchard

Learn to Draw Real People using AI: Unveiling Future of Image-to-Image Translation

Learn to Draw Real People using AI: Unveiling Future of Image-to-Image Translation

What's AI by Louis-François Bouchard

AI Powers PAC-MAN - The Game Engine-Free Revolution

AI Powers PAC-MAN - The Game Engine-Free Revolution

What's AI by Louis-François Bouchard

This AI makes blurry faces look 60 times sharper! Introduction to PULSE: photo upsampling

This AI makes blurry faces look 60 times sharper! Introduction to PULSE: photo upsampling

What's AI by Louis-François Bouchard

Facebook's TransCoder: Converting Programming Languages with AI

Facebook's TransCoder: Converting Programming Languages with AI

What's AI by Louis-François Bouchard

Transforming Images to 3D Models with AI - Discover PIFuHD

Transforming Images to 3D Models with AI - Discover PIFuHD

What's AI by Louis-François Bouchard

Optimize Your ML Models - Avoid Underfitting and Overfitting

Optimize Your ML Models - Avoid Underfitting and Overfitting

What's AI by Louis-François Bouchard

Behind the Scenes - Disney's Secrets to High-Res Face Swaps

Behind the Scenes - Disney's Secrets to High-Res Face Swaps

What's AI by Louis-François Bouchard

Linear Regression in Machine Learning Explained in 5 Minutes

Linear Regression in Machine Learning Explained in 5 Minutes

What's AI by Louis-François Bouchard

Style Transfer Better Than GANs! Swapping Autoencoder Explained

Style Transfer Better Than GANs! Swapping Autoencoder Explained

What's AI by Louis-François Bouchard

Use AI to Remove Objects from Videos

Use AI to Remove Objects from Videos

What's AI by Louis-François Bouchard

OpenAI's Language Generator: GPT | The first AI Generating Text, Code, Websites...

OpenAI's Language Generator: GPT | The first AI Generating Text, Code, Websites...

What's AI by Louis-François Bouchard

Autocomplete Images With AI: image-GPT explained

Autocomplete Images With AI: image-GPT explained

What's AI by Louis-François Bouchard

Turning Reality into Art - AI That Cartoonizes Your Pictures and Videos

Turning Reality into Art - AI That Cartoonizes Your Pictures and Videos

What's AI by Louis-François Bouchard

From Portrait to Cartoon - Discover the Power of FreezeG

From Portrait to Cartoon - Discover the Power of FreezeG

What's AI by Louis-François Bouchard

Transfer clothes between photos using AI. From a single image!

Transfer clothes between photos using AI. From a single image!

What's AI by Louis-François Bouchard

Precise 3D Human Pose and Mesh Estimation from a Single RGB Image

Precise 3D Human Pose and Mesh Estimation from a Single RGB Image

What's AI by Louis-François Bouchard

Smart Navigation - How AI Robots Understand and Explore Environments

Smart Navigation - How AI Robots Understand and Explore Environments

What's AI by Louis-François Bouchard

Techfitlab Breaks Down Tesla Autopilot, AI, ML, and DL Complexities

Techfitlab Breaks Down Tesla Autopilot, AI, ML, and DL Complexities

What's AI by Louis-François Bouchard

ECCV 2020 Best Paper Award | RAFT: A New Deep Network Architecture For Optical Flow | WITH CODE

ECCV 2020 Best Paper Award | RAFT: A New Deep Network Architecture For Optical Flow | WITH CODE

What's AI by Louis-François Bouchard

Maximize Business Efficiency with AI / GPT Technology!

Maximize Business Efficiency with AI / GPT Technology!

What's AI by Louis-François Bouchard

AI Transforms Google Photos into Real-Life Scenes

AI Transforms Google Photos into Real-Life Scenes

What's AI by Louis-François Bouchard

Old Photo Restoration Using Deep Learning | 2020 Novel Approach Explained & Results

Old Photo Restoration Using Deep Learning | 2020 Novel Approach Explained & Results

What's AI by Louis-François Bouchard

This computer vision algorithm removes the water from underwater images !

This computer vision algorithm removes the water from underwater images !

What's AI by Louis-François Bouchard

DeepFakes in 5 minutes | Understand how deepfakes work and create your own!

DeepFakes in 5 minutes | Understand how deepfakes work and create your own!

What's AI by Louis-François Bouchard

A new brain-inspired intelligent system can drive a car using only 19 control neurons!

A new brain-inspired intelligent system can drive a car using only 19 control neurons!

What's AI by Louis-François Bouchard

Toonify: Turn Real Faces into Animated Disney Characters

Toonify: Turn Real Faces into Animated Disney Characters

What's AI by Louis-François Bouchard

The GameGAN model uses neural networks to recreate games like PAC-MAN without a game engine. It is trained on screen recordings and agent keystrokes from past gameplay, allowing it to learn the rules of the game and generate new frames. This technology has potential applications in game development, autonomous machines, and other fields.

Key Takeaways

Read the research paper on GameGAN
Understand the architecture of the GameGAN model
Learn about adversarial training and its applications
Explore the potential applications of GameGAN in game development and autonomous machines

💡 The GameGAN model can learn the rules of a game and generate new frames without a game engine, using adversarial training and a new memory module to ensure long-term consistency.

🔒 Pro feature: Ask AI to explain this lesson →

More on: Reading ML Papers

View skill →

Automatic Literature Review with GPT-3 - I embedded and indexed all of arXiv into a search engine!

Automatic Literature Review with GPT-3 - I embedded and indexed all of arXiv into a search engine!

Marcos Lopez Caniego - ESASky's JupyterLab widget| JupyterCon 2020

Marcos Lopez Caniego - ESASky's JupyterLab widget| JupyterCon 2020

Obsidian Zotero Integration Plugin | Streamline Your Research Paper Workflow 📝️

Obsidian Zotero Integration Plugin | Streamline Your Research Paper Workflow 📝️

This FULLY FREE Research Agent can BUILD Reports in Minutes!!!

This FULLY FREE Research Agent can BUILD Reports in Minutes!!!

Claude 3.7 Sonnet API | Build a Research Assistant

Claude 3.7 Sonnet API | Build a Research Assistant

I Built An Obsidian AI Research Assistant with Oz...

I Built An Obsidian AI Research Assistant with Oz...

Related AI Lessons

I Spent Weeks Looking for a Research Gap Before I Realized I Was Searching the Wrong Way

Learn how to effectively find research gaps by changing your approach, a crucial skill for AI researchers and academics

ICMI 2026 Reviews [D]

Learn how to interpret ICMI 2026 reviews and improve your paper's acceptance chances

Reddit r/MachineLearning

Workshop submission for main conference paper under review [D]

Learn how to navigate submitting a paper to a non-archival workshop before the final decision of a main conference like ECCV

Reddit r/MachineLearning

Kept context-switching between arxiv, OpenReview, GitHub, and HuggingFace for every paper, so I built this. Chrome extension + website with everything inline, plus citation graph + SPECTER2 neighbors. 3M papers, free, feedback welcome [P]

Streamline your research with a new Chrome extension and website that integrates 3M papers from arxiv, OpenReview, GitHub, and HuggingFace, including citation graphs and SPECTER2 neighbors, and provide feedback to improve it

Reddit r/MachineLearning

Chapters (3)

Don't forget to like the video if you enjoyed it, and subscribe to the channel,

0:32 Paper explanation - GameGAN

6:41 Conclusion

Beyond Big Vendors: ERP Systems Explained #shorts

Digital Transformation with Eric Kimberling