Reinforcement Learning: AlphaGo
How AlphaGo works, based on Reinforcement Learning.
Part 2 of RL from scratch series.
https://youtu.be/vXtfdGphr3c
0:00 - intro
0:06 - how to play Go
0:21 - introducing alphaGo
0:46 - analyzing expert games
2:17 - training an expert policy
2:47 - value functions
4:05 - search trees
5:42 - reinforcement learning
6:17 - alphaGo's value function
7:47 - alphaZero
Watch on YouTube ↗
(saves to browser)
Sign in to unlock AI tutor explanation · ⚡30
More on: RL Foundations
View skill →Related AI Lessons
⚡
⚡
⚡
⚡
Meta Is in Crisis, Google Search’s Makeover, and AI Gets Booed by Graduates
Wired AI
What I Do Between Biotech Jobs, Part 1: The 20-Line Script That Outsmarted an AI
Medium · AI
Spotify and Universal Music strike deal allowing fan-made AI covers and remixes
TechCrunch AI
Google Just Turned Search Into Something It Has Never Been Before
Medium · AI
Chapters (10)
intro
0:06
how to play Go
0:21
introducing alphaGo
0:46
analyzing expert games
2:17
training an expert policy
2:47
value functions
4:05
search trees
5:42
reinforcement learning
6:17
alphaGo's value function
7:47
alphaZero
🎓
Tutor Explanation
DeepCamp AI