Foundations

Reinforcement Learning

RL algorithms, reward modelling, RLHF, policy gradients, Q-learning and multi-agent RL

36

lessons

Skills in this topic

3 skills — Sign in to track your progress

View full skill map →

Formalise a problem as an MDP

Policy Gradient Methods

Implement REINFORCE from scratch

RLHF & Alignment

Describe the RLHF pipeline end-to-end

Videos 19 Reads 17

Level: All Beginner Intermediate Advanced

Any Length Short (<5m) Medium (5-20m) Long (>20m)

Newest Popular Oldest

Understanding Reinforcement Learning with Prime Intellect and Unsloth | Nemotron Labs

Reinforcement Learning

Understanding Reinforcement Learning with Prime Intellect and Unsloth | Nemotron Labs

NVIDIA Developer Advanced 1mo ago

Huggingface TRL vs Unsloth RL: Reinforcement Learning Frameworks. How to fine tuning LLMs - Gemma 4

Reinforcement Learning

Huggingface TRL vs Unsloth RL: Reinforcement Learning Frameworks. How to fine tuning LLMs - Gemma 4

Byte Goose AI. Advanced 2mo ago

Supervised vs Unsupervised vs Reinforcement Learning

Reinforcement Learning

Supervised vs Unsupervised vs Reinforcement Learning

Analytics Vidhya Beginner 2mo ago

Tour C El Nido: A Guide to Secret Beach, Talisay Beach & Snorkeling Adventures | Philippine

Reinforcement Learning

Tour C El Nido: A Guide to Secret Beach, Talisay Beach & Snorkeling Adventures | Philippine

ConnollyCove Beginner 1y ago

Unlocking Innovation with Vultr: Navigating the Vultr Hackathon

Reinforcement Learning

Unlocking Innovation with Vultr: Navigating the Vultr Hackathon

GeeksforGeeks Beginner 1y ago

How to participate and win any hackathon | A complete guide

Reinforcement Learning

How to participate and win any hackathon | A complete guide

GeeksforGeeks Beginner 1y ago

Reinforcement Learning Course - Full Machine Learning Tutorial

Reinforcement Learning

Reinforcement Learning Course - Full Machine Learning Tutorial

freeCodeCamp.org Beginner 7y ago

Habitual negative thoughts

Reinforcement Learning

Habitual negative thoughts

Fun Fun Function Beginner 8y ago

Geek-O-Lympics 2023 | 1st - 31st July | GeeksforGeeks

Reinforcement Learning

Geek-O-Lympics 2023 | 1st - 31st July | GeeksforGeeks

GeeksforGeeks Beginner 2y ago

Special Rewards In #100 (Link in Comments)

Reinforcement Learning

Special Rewards In #100 (Link in Comments)

GeeksforGeeks Intermediate 3y ago

Start your writing Journey | Geek Author Badges | GeeksforGeeks

Reinforcement Learning

Start your writing Journey | Geek Author Badges | GeeksforGeeks

GeeksforGeeks Beginner 3y ago

Correct Your Mistakes in 1 minute | Data Science | GeeksforGeeks

Reinforcement Learning

Correct Your Mistakes in 1 minute | Data Science | GeeksforGeeks

GeeksforGeeks Beginner 3y ago

Write an ARTICLE and win assured REWARDS | Technical Scripter Event 2022

Reinforcement Learning

Write an ARTICLE and win assured REWARDS | Technical Scripter Event 2022

GeeksforGeeks Intermediate 3y ago

Geek-O-Lympics 2022 LIVE Now | GeeksforGeeks

Reinforcement Learning

Geek-O-Lympics 2022 LIVE Now | GeeksforGeeks

GeeksforGeeks Intermediate 3y ago

Dispelling Myths and Pre conceptions of Programming Languages

Reinforcement Learning

Dispelling Myths and Pre conceptions of Programming Languages

GeeksforGeeks Beginner 4y ago

Web Scraping in Action | Geeks Summer Carnival 2022 | GeeksforGeeks

Reinforcement Learning

Web Scraping in Action | Geeks Summer Carnival 2022 | GeeksforGeeks

GeeksforGeeks Beginner 4y ago

Introduction to Open Source and Roadmap to GSOC 2022 | Geeks Summer Carnival 2022 | GeeksforGeeks

Reinforcement Learning

Introduction to Open Source and Roadmap to GSOC 2022 | Geeks Summer Carnival 2022 | GeeksforGeeks

GeeksforGeeks Beginner 4y ago

Geeks Summer Carnival 2022 | 5th April- 11th April | GeeksforGeeks

Reinforcement Learning

Geeks Summer Carnival 2022 | 5th April- 11th April | GeeksforGeeks

GeeksforGeeks Intermediate 4y ago

Lets Prepare for GATE'23 the Right Way | Sakshi Singhal | GeekSummerCarnival

Reinforcement Learning

Lets Prepare for GATE'23 the Right Way | Sakshi Singhal | GeekSummerCarnival

GeeksforGeeks Beginner 4y ago

📚 Continue on Coursera External links · Free to audit

View all →

📚 External: Coursera ↗

Total Rewards and Employee Development

Opens on Coursera ↗