An introduction to Reinforcement Learning

arXiv Insights · Beginner ·📄 Research Papers Explained ·8y ago
This episode gives a general introduction into the field of Reinforcement Learning: - High level description of the field - Policy gradients - Biggest challenges (sparse rewards, reward shaping, ...) This video forms the basis for a series on RL where I will dive much deeper into technical details of state-of-the-art methods for RL. Links: - "Pong from Pixels - Karpathy": http://karpathy.github.io/2016/05/31/rl/ - Concept networks for grasp & stack (Paper with heavy reward shaping): https://arxiv.org/abs/1709.06977 If you enjoy my videos, all support is super welcome! https://www.patreon.com/ArxivInsights If you have questions you would like to discuss with me personally, you can book a 1-on-1 video call through Pensight: https://pensight.com/x/xander-steenbrugge ::Chapters:: 00:00 Intro 01:03 So what is Reinforcement Learning? 03:39 Learning without explicit examples 07:25 Main challenges when doing RL 15:04 Are the robots taking over now?
Watch on YouTube ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Related AI Lessons

The ABCs of reading medical research and review papers these days
Learn to critically evaluate medical research papers by accepting nothing at face value, believing no one blindly, and checking everything
Medium · LLM
#1 DevLog Meta-research: I Got Tired of Tab Chaos While Reading Research Papers.
Learn to manage research paper tabs efficiently and apply meta-research techniques to improve productivity
Dev.to AI
How to Set Up a Karpathy-Style Wiki for Your Research Field
Learn to set up a Karpathy-style wiki for your research field to organize and share knowledge effectively
Medium · AI
The Non-Optimality of Scientific Knowledge: Path Dependence, Lock-In, and The Local Minimum Trap
Scientific knowledge may be stuck in a local minimum, hindering optimal progress, and understanding this concept is crucial for advancing research
ArXiv cs.AI

Chapters (5)

Intro
1:03 So what is Reinforcement Learning?
3:39 Learning without explicit examples
7:25 Main challenges when doing RL
15:04 Are the robots taking over now?
Up next
Stanford MS&E435 | Spring 2026 | Economics of Generative AI
Stanford Online
Watch →