Reinforcement Learning from scratch

Graphics in 5 Minutes · Beginner ·📐 ML Fundamentals ·2y ago

Skills: RL Foundations90%

How does Reinforcement Learning work? A short cartoon that intuitively explains this amazing machine learning approach, and how it was used in AlphaGo and ChatGPT. Part 1 of 3. 0:00 - intro 0:13 - pong 0:28 - the policy 0:51 - policy as neural network 1:32 - supervised learning 2:51 - reinforcement learning using policy gradient 4:24 - minimizing error using gradient descent 4:45 - probabilistic policy 5:01 - pong from pixels 6:58 - visualizing learned weights 8:18 - pointer to Karpathy "pong from pixels" blogpost

Watch on YouTube ↗ (saves to browser)

Sign in to unlock AI tutor explanation · ⚡30

More on: RL Foundations

View skill →

Build a Doom AI Model with Python | Gaming Reinforcement Learning Full Course

Build a Doom AI Model with Python | Gaming Reinforcement Learning Full Course

Nicholas Renotte

Deep Reinforcement Learning for Atari Games Python Tutorial | AI Plays Space Invaders

Deep Reinforcement Learning for Atari Games Python Tutorial | AI Plays Space Invaders

Nicholas Renotte

Training & Testing Deep reinforcement learning (DQN) Agent - Reinforcement Learning p.6

Training & Testing Deep reinforcement learning (DQN) Agent - Reinforcement Learning p.6

Build a Game Bot (LIVE)

Build a Game Bot (LIVE)

How to Win Slot Machines - Intro to Deep Learning #13

How to Win Slot Machines - Intro to Deep Learning #13

Build an Mario AI Model with Python | Gaming Reinforcement Learning

Build an Mario AI Model with Python | Gaming Reinforcement Learning

Nicholas Renotte

Related AI Lessons

Six Choices Every AI Engineer Has to Make (and Nobody Teaches)

AI engineers face crucial production trade-offs when deploying models, and understanding these choices is vital for success

Towards Data Science

Predicting Satellite Collisions: How Machine Learning is Saving Earth’s Orbit

Machine learning helps predict satellite collisions in Earth's orbit, preventing catastrophic consequences

Medium · Machine Learning

Predicting Satellite Collisions: How Machine Learning is Saving Earth’s Orbit

Learn how machine learning is used to predict satellite collisions and save Earth's orbit from catastrophic consequences

Medium · Data Science

Swiggy Improves Search Autocomplete Using Real Time Machine Learning Ranking

Learn how Swiggy improved search autocomplete using real-time machine learning ranking, enabling continuous model updates and strict latency constraints

Chapters (11)

intro

0:13 pong

0:28 the policy

0:51 policy as neural network

1:32 supervised learning

2:51 reinforcement learning using policy gradient

4:24 minimizing error using gradient descent

4:45 probabilistic policy

5:01 pong from pixels

6:58 visualizing learned weights

8:18 pointer to Karpathy "pong from pixels" blogpost

Top 5 Machine Learning Courses In 2016 | Best Machine Learning Courses Online | #Shorts #Simplilearn