How DeepMind Conquered Go With Deep Learning (AlphaGo) | Two Minute Papers #42

Two Minute Papers · Beginner ·📐 ML Fundamentals ·10y ago

Key Takeaways

The video discusses how Google DeepMind's AlphaGo algorithm used deep learning to master the game of Go, defeating a professional player and surpassing the skill level of previous AI programs. The algorithm's ability to learn from raw pixel input and improve its skill level through self-play is highlighted.

Full Transcript

dear fellow Scholars this is 2minute papers with Caro here in 1997 the news took the World by storm gari Kasparov world champion and Grandmaster chess player was defeated by an artificial intelligence program by the name deep blue in 2011 IBM Watson won first place in the famous American quiz show Jeopardy in 2014 Google Deep Mind created an algorithm that mastered a number of Atari games by working on a raw pixel input this algorithm learned in a similar way as a human would this time around Google Deep Mind embarked on a journey to write an algorithm that plays go go is an ancient Chinese board game where the opposing players try to capture each other's stones on the board behind the veil of this deceptively simple rule set lies an enormous layer of depth and complexity as scientists like to say the search space of this problem is significantly larger than that of Chess so large that one often has to rely on human intuition to find a suitable next move therefore it is not surprising that go on a high level is or maybe was widely believed to be intractable for machines this chart shows the skill level of previous artificial intelligence programs the green bar shows the skill level of a professional player used as a reference the red bars mean that these older techniques required a significant starting advantage to be able to contend with human opponents as you can see deep M's new program skill level is well beyond most professional players an elite pro player and European Champion fan huie was challenged to play Alpha go Google deep mind's newest invention and got defeated in all five matches they played together during these games each turn it took approximately 2 seconds for the algorithm to come up with the next move an interesting detail is that these strange black bars show confidence intervals which means that the smaller they are the more confident one can be in the validity of the measurements as one can see these confidence intervals are much shorter for the artificial intelligence programs than the human player likely because one can fire up a machine and let it play a million games and get a great estimation of its skill level while the human player can only play a very limited number of matches there is still a lot left to be excited for in March the algorithm will play a world champion the rate of improvement in artificial intelligence research is accelerating at a staggering Pace the only question that remains is not if something is possible but when it will become possible I wake up every day excited to read the newest breakthroughs in the field and of course trying to add some leaves to the tree of knowledge with my own projects I feel privileged to be Al live in such an amazing time as always there's lots of references in the description box make sure to check them out thanks for watching and for your generous support and I'll see you next time

Original Description

This time around, Google DeepMind embarked on a journey to write an algorithm that plays Go. Go is an ancient chinese board game where the opposing players try to capture each other's stones on the board. Behind the veil of this deceptively simple ruleset, lies an enormous layer of depth and complexity. As scientists like to say, the search space of this problem is significantly larger than that of chess. So large, that one often has to rely on human intuition to find a suitable next move, therefore it is not surprising that playing Go on a high level is, or maybe was widely believed to be intractable for machines. The result is Google DeepMind's AlphaGo, the deep learning technique that defeated a professional player and European champion, Fan Hui. __________________ The paper "Mastering the Game of Go with Deep Neural Networks and Tree Search" is available here: https://storage.googleapis.com/deepmind-data/assets/papers/deepmind-mastering-go.pdf http://www.nature.com/nature/journal/v529/n7587/full/nature16961.html Wired's coverage of AlphaGo: http://www.wired.com/2016/01/in-a-huge-breakthrough-googles-ai-beats-a-top-player-at-the-game-of-go/ Video coverage from DeepMind and Nature: https://www.youtube.com/watch?v=g-dKXOlsf98 https://www.youtube.com/watch?v=SUbqykXVx0A Myungwan Kim analysis: https://www.youtube.com/watch?v=NHRHUHW6HQE Photo credits: Watson - AP Photo/Jeopardy Productions, Inc. Fan Hui match photo - Google DeepMind - https://www.youtube.com/watch?v=SUbqykXVx0A Go board image credits (all CC BY 2.0): Renato Ganoza - https://flic.kr/p/7nX4kK Jaro Larnos (changes were applied, mostly recoloring) - https://flic.kr/p/dDeQU9 Luis de Bethencourt - https://flic.kr/p/4c5RaR Detailed analysis of the games against Fan Hui and some more speculation: https://www.reddit.com/r/MachineLearning/comments/43fl90/synopsis_of_top_go_professionals_analysis_of/ Subscribe if you would like to see more of these! - http://www.youtube.com/subscription_center?add_user
Watch on YouTube ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Playlist

Uploads from Two Minute Papers · Two Minute Papers · 48 of 60

1 Fluid Simulations with Blender and Wavelet Turbulence | Two Minute Papers #1
Fluid Simulations with Blender and Wavelet Turbulence | Two Minute Papers #1
Two Minute Papers
2 Capturing Waves of Light With Femto-photography | Two Minute Papers #2
Capturing Waves of Light With Femto-photography | Two Minute Papers #2
Two Minute Papers
3 Artificial Neural Networks and Deep Learning | Two Minute Papers #3
Artificial Neural Networks and Deep Learning | Two Minute Papers #3
Two Minute Papers
4 Blender Rendering - Top 7 LuxRender Features
Blender Rendering - Top 7 LuxRender Features
Two Minute Papers
5 Simulating Breaking Glass | Two Minute Papers #4
Simulating Breaking Glass | Two Minute Papers #4
Two Minute Papers
6 Time Lapse Videos From Community Photos | Two Minute Papers #5
Time Lapse Videos From Community Photos | Two Minute Papers #5
Two Minute Papers
7 AI Learns Van Gogh's Art
AI Learns Van Gogh's Art
Two Minute Papers
8 Hydrographic Printing | Two Minute Papers #7
Hydrographic Printing | Two Minute Papers #7
Two Minute Papers
9 Announcing LuxRender 1.5
Announcing LuxRender 1.5
Two Minute Papers
10 Digital Creatures Learn To Walk | Two Minute Papers #8
Digital Creatures Learn To Walk | Two Minute Papers #8
Two Minute Papers
11 Manipulating Photorealistic Renderings | Two Minute Papers #9
Manipulating Photorealistic Renderings | Two Minute Papers #9
Two Minute Papers
12 Adaptive Fluid Simulations | Two Minute Papers #10
Adaptive Fluid Simulations | Two Minute Papers #10
Two Minute Papers
13 Building Bridges With Flying Machines | Two Minute Papers #11
Building Bridges With Flying Machines | Two Minute Papers #11
Two Minute Papers
14 Reconstructing Sound From Vibrations | Two Minute Papers #12
Reconstructing Sound From Vibrations | Two Minute Papers #12
Two Minute Papers
15 Creating Photographs Using Deep Learning | Two Minute Papers #13
Creating Photographs Using Deep Learning | Two Minute Papers #13
Two Minute Papers
16 Adaptive Cloth Simulations | Two Minute Papers #14
Adaptive Cloth Simulations | Two Minute Papers #14
Two Minute Papers
17 Synthesizing Sound From Collisions | Two Minute Papers #15
Synthesizing Sound From Collisions | Two Minute Papers #15
Two Minute Papers
18 Metropolis Light Transport | Two Minute Papers #16
Metropolis Light Transport | Two Minute Papers #16
Two Minute Papers
19 3D Printing a Glockenspiel | Two Minute Papers #17
3D Printing a Glockenspiel | Two Minute Papers #17
Two Minute Papers
20 Modeling Colliding and Merging Fluids | Two Minute Papers #18
Modeling Colliding and Merging Fluids | Two Minute Papers #18
Two Minute Papers
21 Recurrent Neural Network Writes Music and Shakespeare Novels | Two Minute Papers #19
Recurrent Neural Network Writes Music and Shakespeare Novels | Two Minute Papers #19
Two Minute Papers
22 Gradients, Poisson's Equation and Light Transport | Two Minute Papers #20
Gradients, Poisson's Equation and Light Transport | Two Minute Papers #20
Two Minute Papers
23 Real-Time Facial Expression Transfer | Two Minute Papers #21
Real-Time Facial Expression Transfer | Two Minute Papers #21
Two Minute Papers
24 Automatic Lecture Notes From Videos | Two Minute Papers #22
Automatic Lecture Notes From Videos | Two Minute Papers #22
Two Minute Papers
25 Be a Part of Two Minute Papers on Patreon!
Be a Part of Two Minute Papers on Patreon!
Two Minute Papers
26 Recurrent Neural Network Writes Sentences About Images | Two Minute Papers #23
Recurrent Neural Network Writes Sentences About Images | Two Minute Papers #23
Two Minute Papers
27 How Does Deep Learning Work? | Two Minute Papers #24
How Does Deep Learning Work? | Two Minute Papers #24
Two Minute Papers
28 Cryptography, Perfect Secrecy and One Time Pads | Two Minute Papers #25
Cryptography, Perfect Secrecy and One Time Pads | Two Minute Papers #25
Two Minute Papers
29 Terrain Traversal with Reinforcement Learning | Two Minute Papers #26
Terrain Traversal with Reinforcement Learning | Two Minute Papers #26
Two Minute Papers
30 Multiple-Scattering Microfacet BSDFs with the Smith Model
Multiple-Scattering Microfacet BSDFs with the Smith Model
Two Minute Papers
31 Google DeepMind's Deep Q-Learning & Superhuman Atari Gameplays | Two Minute Papers #27
Google DeepMind's Deep Q-Learning & Superhuman Atari Gameplays | Two Minute Papers #27
Two Minute Papers
32 Are We Living In a Computer Simulation? | Two Minute Papers #28
Are We Living In a Computer Simulation? | Two Minute Papers #28
Two Minute Papers
33 Artificial Superintelligence [Audio only] | Two Minute Papers #29
Artificial Superintelligence [Audio only] | Two Minute Papers #29
Two Minute Papers
34 Automatic Parameter Control for Metropolis Light Transport | Two Minute Papers #30
Automatic Parameter Control for Metropolis Light Transport | Two Minute Papers #30
Two Minute Papers
35 Randomness and Bell's Inequality [Audio only] | Two Minute Papers #31
Randomness and Bell's Inequality [Audio only] | Two Minute Papers #31
Two Minute Papers
36 OpenAI - Non-profit AI company by Elon Musk and Sam Altman
OpenAI - Non-profit AI company by Elon Musk and Sam Altman
Two Minute Papers
37 How Do Genetic Algorithms Work? | Two Minute Papers #32
How Do Genetic Algorithms Work? | Two Minute Papers #32
Two Minute Papers
38 Painting with Fluid Simulations | Two Minute Papers #33
Painting with Fluid Simulations | Two Minute Papers #33
Two Minute Papers
39 Peer Review #1 [Audio only] | Two Minute Papers
Peer Review #1 [Audio only] | Two Minute Papers
Two Minute Papers
40 Neural Programmer-Interpreters Learn To Write Programs | Two Minute Papers #34
Neural Programmer-Interpreters Learn To Write Programs | Two Minute Papers #34
Two Minute Papers
41 9 Cool Deep Learning Applications | Two Minute Papers #35
9 Cool Deep Learning Applications | Two Minute Papers #35
Two Minute Papers
42 Designing Cities and Furnitures With Machine Learning | Two Minute Papers #36
Designing Cities and Furnitures With Machine Learning | Two Minute Papers #36
Two Minute Papers
43 Designing 3D Printable Robotic Creatures | Two Minute Papers #37
Designing 3D Printable Robotic Creatures | Two Minute Papers #37
Two Minute Papers
44 3D Printing Objects With Caustics | Two Minute Papers #38
3D Printing Objects With Caustics | Two Minute Papers #38
Two Minute Papers
45 Interactive Editing of Subsurface Scattering | Two Minute Papers #39
Interactive Editing of Subsurface Scattering | Two Minute Papers #39
Two Minute Papers
46 Simulating Viscosity and Melting Fluids | Two Minute Papers #40
Simulating Viscosity and Melting Fluids | Two Minute Papers #40
Two Minute Papers
47 What Do Virtual Objects Sound Like? | Two Minute Papers #41
What Do Virtual Objects Sound Like? | Two Minute Papers #41
Two Minute Papers
How DeepMind Conquered Go With Deep Learning (AlphaGo) | Two Minute Papers #42
How DeepMind Conquered Go With Deep Learning (AlphaGo) | Two Minute Papers #42
Two Minute Papers
49 Breaking Deep Learning Systems With Adversarial Examples | Two Minute Papers #43
Breaking Deep Learning Systems With Adversarial Examples | Two Minute Papers #43
Two Minute Papers
50 Extrapolations and Crowdfunded Research (Experiment) | Two Minute Papers #44
Extrapolations and Crowdfunded Research (Experiment) | Two Minute Papers #44
Two Minute Papers
51 Biophysical Skin Aging Simulations | Two Minute Papers #45
Biophysical Skin Aging Simulations | Two Minute Papers #45
Two Minute Papers
52 What is Impostor Syndrome? | Two Minute Papers #46
What is Impostor Syndrome? | Two Minute Papers #46
Two Minute Papers
53 Should You Take the Stairs at Work? (For Weight Loss) | Two Minute Papers #47
Should You Take the Stairs at Work? (For Weight Loss) | Two Minute Papers #47
Two Minute Papers
54 Artistic Manipulation of Caustics | Two Minute Papers #48
Artistic Manipulation of Caustics | Two Minute Papers #48
Two Minute Papers
55 Deep Learning Program Learns to Paint | Two Minute Papers #49
Deep Learning Program Learns to Paint | Two Minute Papers #49
Two Minute Papers
56 Interactive Photo Recoloring | Two Minute Papers #50
Interactive Photo Recoloring | Two Minute Papers #50
Two Minute Papers
57 How To Get Started With Machine Learning? | Two Minute Papers #51
How To Get Started With Machine Learning? | Two Minute Papers #51
Two Minute Papers
58 Awesome Research For Everyone! - Two Minute Papers Channel Trailer
Awesome Research For Everyone! - Two Minute Papers Channel Trailer
Two Minute Papers
59 10 More Cool Deep Learning Applications | Two Minute Papers #52
10 More Cool Deep Learning Applications | Two Minute Papers #52
Two Minute Papers
60 How DeepMind's AlphaGo Defeated Lee Sedol | Two Minute Papers #53
How DeepMind's AlphaGo Defeated Lee Sedol | Two Minute Papers #53
Two Minute Papers

The video explains how AlphaGo used deep learning to master the game of Go, and how this achievement demonstrates the power of artificial intelligence. Viewers can learn about the basics of machine learning and deep learning, and how these concepts can be applied to complex problems. The video also highlights the importance of self-play and reinforcement learning in improving the skill level of AI algorithms.

Key Takeaways
  1. Understand the rules of the game of Go
  2. Learn about the basics of deep learning and neural networks
  3. Design a neural network architecture for playing Go
  4. Train the neural network using self-play and reinforcement learning
  5. Evaluate the performance of the neural network
  6. Apply the concepts learned to other complex problems
💡 The video highlights the importance of self-play and reinforcement learning in improving the skill level of AI algorithms, and demonstrates how these concepts can be applied to complex problems like playing Go.

Related Reads

📰
What Is MLIR and Why Does It Exist?
Learn about MLIR, a intermediate representation for machine learning models, and its purpose in optimizing ML workflows
Dev.to · Fedor Nikolaev
📰
Why Choosing the Right Machine Learning Development Company Matters More Than the AI Model
Choosing the right machine learning development company is crucial for turning AI investments into measurable results, as it can make or break the success of AI projects
Medium · Machine Learning
📰
Data privacy in AI training: federated learning, differential privacy, and synthetic data
Learn how federated learning, differential privacy, and synthetic data preserve data privacy in AI training, and why they matter for secure machine learning
Dev.to AI
📰
Data Preprocessing: Encoding and Feature Scaling in Machine Learning
Learn to preprocess data by encoding and scaling features for better machine learning model performance
Medium · Machine Learning
Up next
Is Python Dead in 2026?| Truth About Python in AI Era | 90 Days Roadmap @FameWorldEducationalHub
FAME WORLD EDUCATIONAL HUB
Watch →