Teaching Robots to Play | AI and Games #12

AI and Games · Advanced ·📄 Research Papers Explained ·10y ago

Skills: Research Methods90%Reading ML Papers80%RAG Basics70%Vector Stores60%RAG Evaluation60%

Key Takeaways

The video discusses AI research in games, focusing on procedural content generation, general intelligence, and evaluation of generated content, using tools like Angelina and Mario AI competition.

Full Transcript

[Music] this is AI games and welcome to a new lecture entitled teaching robots to play why science loves gaming quick heads up that the original article as well as the video as part of this is a reproduction of a talk originally given during the res sessions of the egx 2015 event at the NEC in Birmingham UK in September mber 2015 one of the most interesting aspects of human behavior and by extension applications of intelligence as a species is the relationship that we humans have with games in principle games are tasks or challenges designed to stimulate the brain however while our brains are often forced to deal with demanding and often stressful mental tasks as part of our daily lives we still have time for games and the Notions of play games are designed to keep us challenged But ultimately do not prove so taxing that they prove too demanding or stressful for our regular consumption it's certainly interesting to observe the relationship between humans and the notion of play as a means of relaxation there are a number of reasons why games prove to be an interesting domain to explore scientific problems and more importantly AI challenges perhaps unsurprisingly these are largely reasons that humans already embrace them one major reason that good games prove so effective is that they act as Frameworks for reward through structured activ ity games Define Loops of behavior that have completed and repeated successfully will reward the player through a number of means some of these rewards are mild in nature and maybe purely cosmetic whereas others allow for a sense of progression to be conceptualized by the player these interactions continue to increase in scale but help maintain a player's interest and momentum until the long-term and explicit reward is achieved the Super Mario Brothers series is one of the finest examples of how reward Frameworks can be used to drive and maintain player interest reward interactions in the loops of behavior required to release them are often referred to as compulsion Loops whereby we maintain a player's interest by insuring a reward within an abstract and relative time frame short-term Loops are often the result of simple interactions that may be largely cosmetic but help maintain a player's engagement the interaction and response from the collection of coins and super mar Brothers may seem simple but the use of counters and sound effects provide positive reinforcement to users that their actions not only make sense but also work to BS their long-term goals this subsequently scales to medium-term compulsion Loops conceptualized through levels given every level of Super Mario games celebrate the fact the player has completed that activity returning back to coin collection continued adoption of the shortterm loop rewards you with extra lives this medium-term Loop is now reinforcing a player's continued adoption of the short-term Loop and gives not only context but a real quantifiable reason to continue doing it this scales farther into long-term Loops of activity as levels are grouped into Worlds with the requirement to defeat a boss enemy with closure achieved through defeat of Bowser in the eighth and final world ultimately the point here is that we have this confined system within which Intelligent Decisions can be made we can quantify their value as well as identify their position within the road map of future actions you can take in order to win that game conversely good games are able to quickly point out bad interactions you're making and thus reinforce to you that doing certain things is bad while Mario does a good job of this it's arguably his competitor son at the Hedgehog who signifies this even better with significantly exaggerated behavior in the event rings are lost thus losing all progress on that medium-term Loop but also when losing lives the next major factor that helps build a game as a valid scientific problem is that it has to be fun while fun is a largely subjective notion there's evidence to suggest that the level of challenge involved must meet a certain threshold in order for it to be interesting in the eyes of when we consider this from a computer science perspective we would actually classify that games must be at least non-deterministic polinomial time hard or NP hyen hard for short in short this means it's something of a non-trivial problem in computer science NP problems are typically ones that require some intelligent algorithmic process in order for them to be solved in a reasonable amount of time if we were to look at the range of games out there from online FPS games such as Call of Duty to racing games like Forza and even smaller and functionally simpler games such as Flappy Bird we can begin to recognize a large range of problems that are not only sufficiently difficult video games but carry a range of equally interesting decision problems in their own right despite this assertion of a certain level of computationally defined difficulty we would not paint all games as the same games can carry a variety of problem traits that make them interesting for autonomous systems to try and solve these traits can change between genres of games and even releases within the same game series these traits not only result in games exhibiting particular artificial intelligence problems but also begin to necessitate the use of certain AI techniques and methodologies given they are useful for that particular type of problem contrary to the popular opinion AI is not some black box design that will work in any and all circumstances AI systems and the techniques used to build them are typically specialist in nature focusing on very particular types of problems only now after over 50 years of research in this area are we seriously looking at the challenge es of building general intelligence systems which we'll discuss later there are several properties of a game that we will typically consider when trying to figure out how best to approach the problem there are three that prove to be rather important number one accessible knowledge just how much do we know about the game we're playing at any point in time this can be both a blessing and a curse depending on how much we actually know in some games we may not actually know everything about the current state of the game at this point in time this typically the case case in card games ranging from Texas holdom to Hearthstone we don't know what cards the other player is holding but can make some educated guesses that ultimately guide our decision- making conversely we exploit this imperfect information given that the opponent does not know what hands we might play but this doesn't mean that knowing everything about the world will help us that much either one of the best examples of this can be found in fighting games such as Street Fighter Mortal Combat and Killer Instinct in each case we can see the whole state of the world where the player is how much health or energy bar they have and the time remaining in that round despite this the number of possible actions that can be executed in that state leads to a large number of possible future States also known as successors this large number of actions in future States implies the branching factor of a given State meaning that even if we start thinking three or four moves ahead we need to start filtering out decisions that we don't think the opposing player will make given the number of possibilities is massive number two predicting the unpredictable one vital aspect of gaming has been able to see things before they actually happen allowing us to make quick decisions and react to changes in the world when playing platforming games we quickly learn the Mantia of the movement mechanics meaning we can predict whether we can make certain jumps in particular circumstances and quickly adapt to survive predictability can also come in really handy for dealing with enemies learning the behavior patterns of bosses in games such as as Dark Souls is key to knowing when to attack and when to fall back and defend but sometimes our model of that predictability is broken and that makes things so much harder for us one of the best examples of this can be found in Pac-Man where the original ghosts are deterministic in nature meaning we can learn in time what an enemy will do at any point however in the sequel Miss Pac-Man the ghosts are able to make random moves at Junctions if they so wish this results in a non-deterministic system meaning that we can no longer predict safely making the game significantly more difficult number three the players the enemies and the actors just how many characters are in this game and making semi-intelligent decisions this ties back to not only the complexity issue but also the branching Factor discussed earlier the branching factors influenced not just on how many actions you can make at any given state or frame of that game but also the actions that any other character can make in that world the number of unique configurations of the game world can explode at an exponential rate once you have multiple characters that can all do different things at once we need to figure out a which information is useful to us B what we can ignore and C how do we ensure that the space of all potential game configurations is tractable meaning that an AI can actually search it to find answers despite all this forboding and Gloom there's still an awful lot to celebrate AI research and games kicked off in full swing in the mid 2000s with a number of big projects bringing the community together as well as solving some interesting challenges the second major problem area to gather attention was AI that complained Mario which actually turned out to be a lot easier than we originally envisaged in fact the more interesting problems were not whether AI could play Mario but whether it could build Mario levels leading to a sudden Surge and research in procedural content generation be sure to check out the rather large overview of the gam plan level generation tracks we have here at AI games something of an interesting challenge to create AI that can play Unreal Tournament 2000 4 however unlike most challenges this was not about trying to beat the game or be the most effective at it but whether you could fool other humans into believing that the bot was not an AI this is an example of the churing test in which you build AI that can tackle tasks we would expect of a human but design it such that it cannot be distinguished from humans when observed this competition ran for several years until a winner was found that was able to fool judges into believing it was actually a human player despite this there's still a lot of work to do and many challenges yet to be solved we break down some of the bigger talking points here as well as point out to some interesting reading material for you to check out if you're interested PCG has become a big talking point in the academic Community for a number of reasons it's perhaps not considered AI as such in The Wider discussion but generative systems are making Intelligent Decisions to craft artifacts what makes this an even bigger task is that how to evaluate this content is highly subjective unlike many other AI problems such as robotics shed eding and even playing games we cannot wholly evaluate the quality of the final output in a robotics problem we can evaluate against the expected Behavior or even how well the robot Works in specific circumstances however with generated content while we can evaluate whether aderes to specific functional aspects we might struggle to identify more aesthetic and subjective aspects of that content so while we can quantify whether a gun can actually hurt an enemy or of a level is playable it's much harder to establish whether that gun was interesting to use or if that level was fun to play some links to check out include Angelina which is the AI system built to create entire games by itself the aforementioned Mario AI competition piece here on AI and games.com as well as checking out proc Jam a game Jam all about procedural content generation one of the most exciting Fields happening in AI right now is the notion of general intelligence the reason for this is in actuality AI systems are typically specialist in nature another they're very good at one thing and one thing alone this is contrary to a lot of Science Fiction in that for example Skynet and the Terminator or showdown in the system shock are systems that are largely omnipotent and can solve any problem placed in front of them this can be seen When developing AI that can play a particular game while we can write an AI that can play Pac-Man it cannot play Super Mario Brothers and vice versa this is an issue that spreads far beyond games and into larger real world problems general intelligence is the challenge of building AI that can solve any problem you give it which is far more in line with the original aspirations of AI from the early 20th century this is now a big problem with research departments at universities as well as big tech companies attempting to solve it some links to check out include the three-part series The Challenge of general intelligence in games hosted on AI and games as well as looking at the general video game AI competition hosted at the University of Essex in the UK as well as Google deep minds work in exactly the same problem area to conclude as games become more increasingly complex so do the artificially intelligent systems that seek to learn from them we are fortunate in that gaming is such a vibrant and creative field given it provides a continuous body of complex and interesting problem spaces to be working within in our own way science loves gaming for our own selfish reasons with complex problem spaces that require reactive and long-term decision-making systems to handle some of the most dynamic and multifaceted domains outside of the real world itself though to be honest science is into games were pretty much the same reasons as everyone else we're here to have fun this has been teaching robots to play why science loves gaming on AI and games thanks for listening and be sure to check out more over on aiam.com

Original Description

www.aiandgames.com www.patreon.com/ai_and_games In September 2015 we gave a talk at the Rezzed sessions of the EGX 2015 event in the NEC in Birmingham, UK. In this talk, we aimed at establishing 'what' AI research is, why people do it and what are the big issues we are tackling in this modern day. A write-up of the lecture can be found over on AIandGames at: http://aiandgames.com/why-science-loves-gaming/

Watch on YouTube ↗ (saves to browser)

Sign in to unlock AI tutor explanation · ⚡30

Playlist

Uploads from AI and Games · AI and Games · 5 of 60

← Previous Next →

Evolving Particle Weapons in Galactic Arms Race | AI and Games #04

Evolving Particle Weapons in Galactic Arms Race | AI and Games #04

Pac-Man AI Research and Competitions | AI and Games #06

Pac-Man AI Research and Competitions | AI and Games #06

The Behaviour Tree AI of Halo 2 | AI and Games #09

The Behaviour Tree AI of Halo 2 | AI and Games #09

Researching Super Mario Bros. Level Design | AI and Games #10

Researching Super Mario Bros. Level Design | AI and Games #10

Teaching Robots to Play | AI and Games #12

Teaching Robots to Play | AI and Games #12

The Quest for AI Game Designers | AI and Games #13

The Quest for AI Game Designers | AI and Games #13

HTN Planning in Transformers: Fall of Cybertron | AI and Games #14

HTN Planning in Transformers: Fall of Cybertron | AI and Games #14

The AI of Alien: Isolation | AI and Games #15

The AI of Alien: Isolation | AI and Games #15

Status Performance Analysis in Team Fortress 2 | AI and Games #17

Status Performance Analysis in Team Fortress 2 | AI and Games #17

Training the Shadow AI of Killer Instinct (2013) | AI and Games #18

Training the Shadow AI of Killer Instinct (2013) | AI and Games #18

Resurrection & Reverence: The Return of DOOM | Design Dive

Resurrection & Reverence: The Return of DOOM | Design Dive

Left Behind on LV-426 - The Design of Aliens: Colonial Marines | Design Dive

Left Behind on LV-426 - The Design of Aliens: Colonial Marines | Design Dive

Games By ANGELINA, the AI Game Designer | AI and Games #20

Games By ANGELINA, the AI Game Designer | AI and Games #20

Prepare to Die by Simple AI - Dark Souls and Difficulty | Design Dive

Prepare to Die by Simple AI - Dark Souls and Difficulty | Design Dive

Looking for Love on Pandora: PCG and Borderlands 2 | Design Dive

Looking for Love on Pandora: PCG and Borderlands 2 | Design Dive

The AI of Shogun: Total War | AI and Games #21

The AI of Shogun: Total War | AI and Games #21

The Campaign AI of Total War: Rome II | AI and Games #23

The Campaign AI of Total War: Rome II | AI and Games #23

AI 101: Monte Carlo Tree Search

AI 101: Monte Carlo Tree Search

The Diplomacy AI in Total War: Attila | AI and Games #24

The Diplomacy AI in Total War: Attila | AI and Games #24

A History of AI Research in StarCraft | AI and Games #26

A History of AI Research in StarCraft | AI and Games #26

Dota 2, MOBA's and the Future of AI Research | AI and Games #27

Dota 2, MOBA's and the Future of AI Research | AI and Games #27

Procedural Level Generation in Sure Footing | AI and Games #28

Procedural Level Generation in Sure Footing | AI and Games #28

Behind the AI and Storytelling of Spec Ops: The Line | AI and Games #29

Behind the AI and Storytelling of Spec Ops: The Line | AI and Games #29

The AI of DOOM (2016) | AI and Games #30

The AI of DOOM (2016) | AI and Games #30

Machine Learning Analysis of Player Behaviour in Tomb Raider: Underworld | AI and Games #31

Machine Learning Analysis of Player Behaviour in Tomb Raider: Underworld | AI and Games #31

How A Navigation Mesh Works in 3D Games | AI 101

How A Navigation Mesh Works in 3D Games | AI 101

How Halo 3 Builds Large-Scale AI Battles | AI and Games #33

How Halo 3 Builds Large-Scale AI Battles | AI and Games #33

Enemy AI Design in Tom Clancy's The Division (Part 1 of 2) | AI and Games #34

Enemy AI Design in Tom Clancy's The Division (Part 1 of 2) | AI and Games #34

Building the Online World of Tom Clancy's The Division (Part 2 of 2) | AI and Games #35

Building the Online World of Tom Clancy's The Division (Part 2 of 2) | AI and Games #35

Behaviour Trees: The Cornerstone of Modern Game AI | AI 101

Behaviour Trees: The Cornerstone of Modern Game AI | AI 101

Why Friendly AI Cheat in Ghost Recon Wildlands | AI and Games #36

Why Friendly AI Cheat in Ghost Recon Wildlands | AI and Games #36

The AI of Horizon Zero Dawn | Part 1: Rise of the Machines | AI and Games #37

The AI of Horizon Zero Dawn | Part 1: Rise of the Machines | AI and Games #37

The AI of Horizon Zero Dawn | Part 2: Metal Militia | AI and Games #38

The AI of Horizon Zero Dawn | Part 2: Metal Militia | AI and Games #38

Augmented Reaction: Vanquish - 9 Years Later | Design Dive

Augmented Reaction: Vanquish - 9 Years Later | Design Dive

Building Mario Levels with Machine Learning | AI and Games #39

Building Mario Levels with Machine Learning | AI and Games #39

The AI of Half-Life: Finite State Machines | AI 101

The AI of Half-Life: Finite State Machines | AI 101

Building a Pirate's Paradise in Sea of Thieves | AI and Games #40

Building a Pirate's Paradise in Sea of Thieves | AI and Games #40

The Secrets of Skeleton and Shark AI in Sea of Thieves | AI and Games #41

The Secrets of Skeleton and Shark AI in Sea of Thieves | AI and Games #41

How Megalodon, Kraken and Skeleton Ships Haunt the Sea of Thieves | AI and Games #42

How Megalodon, Kraken and Skeleton Ships Haunt the Sea of Thieves | AI and Games #42

How Rare Tests Sea of Thieves to Stop Bugs Reaching Players | AI and Games #43

How Rare Tests Sea of Thieves to Stop Bugs Reaching Players | AI and Games #43

The Legacy of GoldenEye 007 | Design Dive

The Legacy of GoldenEye 007 | Design Dive

The Secrets of GoldenEye's AI Revealed | AI and Games #44

The Secrets of GoldenEye's AI Revealed | AI and Games #44

Sandbox Assassin: The AI of Hitman (2016) | AI and Games #45

Sandbox Assassin: The AI of Hitman (2016) | AI and Games #45

The Dangers of AI, Microtransactions & Lootboxes | Design Dive

The Dangers of AI, Microtransactions & Lootboxes | Design Dive

Minecraft Villages Built by AI - The Generative Design in Minecraft Competition | AI and Games #46

Minecraft Villages Built by AI - The Generative Design in Minecraft Competition | AI and Games #46

The Secret Reward Systems of Dark Souls II | Design Dive

The Secret Reward Systems of Dark Souls II | Design Dive

Why Adding Bots to Fortnite Was a Great Idea | Design Dive

Why Adding Bots to Fortnite Was a Great Idea | Design Dive

The Best Games Engines for AI (2019) | AI 101

The Best Games Engines for AI (2019) | AI 101

How Atriox Can Beat You in Halo Wars 2 Without Cheating | AI and Games #47

How Atriox Can Beat You in Halo Wars 2 Without Cheating | AI and Games #47

How AlphaStar Became a StarCraft Grandmaster | AI and Games #48

How AlphaStar Became a StarCraft Grandmaster | AI and Games #48

Why AlphaStar Does Not Solve Gaming's AI Problems | Design Dive

Why AlphaStar Does Not Solve Gaming's AI Problems | Design Dive

Designing the Enemy AI of Tom Clancy's The Division 2 | AI and Games

Designing the Enemy AI of Tom Clancy's The Division 2 | AI and Games

Bringing Washington D.C. to Life: The AI of Tom Clancy's The Division 2 | AI and Games

Bringing Washington D.C. to Life: The AI of Tom Clancy's The Division 2 | AI and Games

The Secret AI Testers Inside Tom Clancy's The Division 2 | AI and Games

The Secret AI Testers Inside Tom Clancy's The Division 2 | AI and Games

DOOM 64 Revisited | Design Dive

DOOM 64 Revisited | Design Dive

The Story of Facade: The AI-Powered Interactive Drama | AI and Games #49

The Story of Facade: The AI-Powered Interactive Drama | AI and Games #49

Building the AI of F.E.A.R. with Goal Oriented Action Planning | AI 101

Building the AI of F.E.A.R. with Goal Oriented Action Planning | AI 101

Revisiting the AI of Alien: Isolation | AI and Games #50

Revisiting the AI of Alien: Isolation | AI and Games #50

How Splinter Cell: Blacklist Builds Balance for Stealth | AI and Games #51

How Splinter Cell: Blacklist Builds Balance for Stealth | AI and Games #51

Endure and Survive: the AI of The Last of Us | AI and Games #52

Endure and Survive: the AI of The Last of Us | AI and Games #52

The video teaches about AI research in games, focusing on procedural content generation, general intelligence, and evaluation of generated content. It provides an overview of the challenges and opportunities in this field, highlighting the importance of games as a complex and interesting problem space for AI to learn from.

Key Takeaways

Understand the basics of procedural content generation
Learn about general intelligence and its challenges
Evaluate generated content using subjective and quantitative methods
Apply research methods to game development
Use tools like Angelina and Mario AI competition to develop AI systems

💡 Games provide a complex and interesting problem space for AI to learn from, making them an ideal domain for AI research and development.

🔒 Pro feature: Ask AI to explain this lesson →

More on: Research Methods

View skill →

Mechanics of Materials III: Beam Bending

Mechanics of Materials III: Beam Bending

Inaugural Lecture: Juliane Reinecke

Inaugural Lecture: Juliane Reinecke

Saïd Business School, University of Oxford

Hands-On Learning: How and Why You Should Build a Home Lab

Hands-On Learning: How and Why You Should Build a Home Lab

SANS Live Online Interactive Remote Lab and Range Demo – SEC599: Defeating Advanced Adversaries

SANS Live Online Interactive Remote Lab and Range Demo – SEC599: Defeating Advanced Adversaries

Does Water Swirl the Other Way in the Southern Hemisphere?

Does Water Swirl the Other Way in the Southern Hemisphere?

Undergraduate Research Forum 2026

Undergraduate Research Forum 2026

Related Reads

I Spent Weeks Looking for a Research Gap Before I Realized I Was Searching the Wrong Way

Learn how to effectively find research gaps by changing your approach, a crucial skill for AI researchers and academics

ICMI 2026 Reviews [D]

Learn how to interpret ICMI 2026 reviews and improve your paper's acceptance chances

Reddit r/MachineLearning

Workshop submission for main conference paper under review [D]

Learn how to navigate submitting a paper to a non-archival workshop before the final decision of a main conference like ECCV

Reddit r/MachineLearning

Kept context-switching between arxiv, OpenReview, GitHub, and HuggingFace for every paper, so I built this. Chrome extension + website with everything inline, plus citation graph + SPECTER2 neighbors. 3M papers, free, feedback welcome [P]

Streamline your research with a new Chrome extension and website that integrates 3M papers from arxiv, OpenReview, GitHub, and HuggingFace, including citation graphs and SPECTER2 neighbors, and provide feedback to improve it

Reddit r/MachineLearning

Indians Under House Arrest in America? 😱 Immigration Crisis Explained | SumanTV Classroom

SumanTV Classroom