Foundations

Reinforcement Learning

RL algorithms, reward modelling, RLHF, policy gradients, Q-learning and multi-agent RL

36
lessons
Skills in this topic
View full skill map →
RL Foundations
beginner
Formalise a problem as an MDP
Policy Gradient Methods
intermediate
Implement REINFORCE from scratch
RLHF & Alignment
advanced
Describe the RLHF pipeline end-to-end
Understanding Reinforcement Learning with Prime Intellect and Unsloth | Nemotron Labs
Reinforcement Learning
Understanding Reinforcement Learning with Prime Intellect and Unsloth | Nemotron Labs
NVIDIA Developer Advanced 1mo ago
Huggingface TRL vs Unsloth RL: Reinforcement Learning Frameworks. How to fine tuning LLMs - Gemma 4
Reinforcement Learning
Huggingface TRL vs Unsloth RL: Reinforcement Learning Frameworks. How to fine tuning LLMs - Gemma 4
Byte Goose AI. Advanced 2mo ago
Supervised vs Unsupervised vs Reinforcement Learning
Reinforcement Learning
Supervised vs Unsupervised vs Reinforcement Learning
Analytics Vidhya Beginner 2mo ago
Tour C El Nido: A Guide to Secret Beach, Talisay Beach & Snorkeling Adventures | Philippine
Reinforcement Learning
Tour C El Nido: A Guide to Secret Beach, Talisay Beach & Snorkeling Adventures | Philippine
ConnollyCove Beginner 1y ago
Unlocking Innovation with Vultr: Navigating the Vultr Hackathon
Reinforcement Learning
Unlocking Innovation with Vultr: Navigating the Vultr Hackathon
GeeksforGeeks Beginner 1y ago
How to participate and win any hackathon | A complete guide
Reinforcement Learning
How to participate and win any hackathon | A complete guide
GeeksforGeeks Beginner 1y ago
Reinforcement Learning Course - Full Machine Learning Tutorial
Reinforcement Learning
Reinforcement Learning Course - Full Machine Learning Tutorial
freeCodeCamp.org Beginner 7y ago
Habitual negative thoughts
Reinforcement Learning
Habitual negative thoughts
Fun Fun Function Beginner 8y ago
Geek-O-Lympics 2023 | 1st - 31st July | GeeksforGeeks
Reinforcement Learning
Geek-O-Lympics 2023 | 1st - 31st July | GeeksforGeeks
GeeksforGeeks Beginner 2y ago
Special Rewards In #100 (Link in Comments)
Reinforcement Learning
Special Rewards In #100 (Link in Comments)
GeeksforGeeks Intermediate 3y ago
Start your writing Journey | Geek Author Badges | GeeksforGeeks
Reinforcement Learning
Start your writing Journey | Geek Author Badges | GeeksforGeeks
GeeksforGeeks Beginner 3y ago
Correct Your Mistakes in 1 minute | Data Science | GeeksforGeeks
Reinforcement Learning
Correct Your Mistakes in 1 minute | Data Science | GeeksforGeeks
GeeksforGeeks Beginner 3y ago
Write an ARTICLE and win assured REWARDS | Technical Scripter Event 2022
Reinforcement Learning
Write an ARTICLE and win assured REWARDS | Technical Scripter Event 2022
GeeksforGeeks Intermediate 3y ago
Geek-O-Lympics 2022 LIVE Now | GeeksforGeeks
Reinforcement Learning
Geek-O-Lympics 2022 LIVE Now | GeeksforGeeks
GeeksforGeeks Intermediate 3y ago
Dispelling Myths and Pre conceptions of Programming Languages
Reinforcement Learning
Dispelling Myths and Pre conceptions of Programming Languages
GeeksforGeeks Beginner 4y ago
Web Scraping in Action | Geeks Summer Carnival 2022 | GeeksforGeeks
Reinforcement Learning
Web Scraping in Action | Geeks Summer Carnival 2022 | GeeksforGeeks
GeeksforGeeks Beginner 4y ago
Introduction to Open Source and Roadmap to GSOC 2022 | Geeks Summer Carnival 2022 | GeeksforGeeks
Reinforcement Learning
Introduction to Open Source and Roadmap to GSOC 2022 | Geeks Summer Carnival 2022 | GeeksforGeeks
GeeksforGeeks Beginner 4y ago
Geeks Summer Carnival 2022 | 5th April- 11th April | GeeksforGeeks
Reinforcement Learning
Geeks Summer Carnival 2022 | 5th April- 11th April | GeeksforGeeks
GeeksforGeeks Intermediate 4y ago
Lets Prepare for GATE'23 the Right Way | Sakshi Singhal | GeekSummerCarnival
Reinforcement Learning
Lets Prepare for GATE'23 the Right Way | Sakshi Singhal | GeekSummerCarnival
GeeksforGeeks Beginner 4y ago