Reinforcement Learning from scratch
How does Reinforcement Learning work? A short cartoon that intuitively explains this amazing machine learning approach, and how it was used in AlphaGo and ChatGPT.
Part 1 of 3.
0:00 - intro
0:13 - pong
0:28 - the policy
0:51 - policy as neural network
1:32 - supervised learning
2:51 - reinforcement learning using policy gradient
4:24 - minimizing error using gradient descent
4:45 - probabilistic policy
5:01 - pong from pixels
6:58 - visualizing learned weights
8:18 - pointer to Karpathy "pong from pixels" blogpost
Watch on YouTube ↗
(saves to browser)
Chapters (11)
intro
0:13
pong
0:28
the policy
0:51
policy as neural network
1:32
supervised learning
2:51
reinforcement learning using policy gradient
4:24
minimizing error using gradient descent
4:45
probabilistic policy
5:01
pong from pixels
6:58
visualizing learned weights
8:18
pointer to Karpathy "pong from pixels" blogpost
DeepCamp AI