Reinforcement Learning from scratch

Graphics in 5 Minutes · Beginner ·📐 ML Fundamentals ·2y ago
How does Reinforcement Learning work? A short cartoon that intuitively explains this amazing machine learning approach, and how it was used in AlphaGo and ChatGPT. Part 1 of 3. 0:00 - intro 0:13 - pong 0:28 - the policy 0:51 - policy as neural network 1:32 - supervised learning 2:51 - reinforcement learning using policy gradient 4:24 - minimizing error using gradient descent 4:45 - probabilistic policy 5:01 - pong from pixels 6:58 - visualizing learned weights 8:18 - pointer to Karpathy "pong from pixels" blogpost
Watch on YouTube ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Related AI Lessons

Six Choices Every AI Engineer Has to Make (and Nobody Teaches)
AI engineers face crucial production trade-offs when deploying models, and understanding these choices is vital for success
Towards Data Science
Predicting Satellite Collisions: How Machine Learning is Saving Earth’s Orbit
Machine learning helps predict satellite collisions in Earth's orbit, preventing catastrophic consequences
Medium · Machine Learning
Predicting Satellite Collisions: How Machine Learning is Saving Earth’s Orbit
Learn how machine learning is used to predict satellite collisions and save Earth's orbit from catastrophic consequences
Medium · Data Science
Swiggy Improves Search Autocomplete Using Real Time Machine Learning Ranking
Learn how Swiggy improved search autocomplete using real-time machine learning ranking, enabling continuous model updates and strict latency constraints
InfoQ AI/ML

Chapters (11)

intro
0:13 pong
0:28 the policy
0:51 policy as neural network
1:32 supervised learning
2:51 reinforcement learning using policy gradient
4:24 minimizing error using gradient descent
4:45 probabilistic policy
5:01 pong from pixels
6:58 visualizing learned weights
8:18 pointer to Karpathy "pong from pixels" blogpost
Up next
Top 5 Machine Learning Courses In 2016 | Best Machine Learning Courses Online | #Shorts #Simplilearn
Simplilearn
Watch →