An introduction to Reinforcement Learning

arXiv Insights · Beginner ·📄 Research Papers Explained ·8y ago

Skills: Research Methods90%RL Foundations85%Reading ML Papers80%

This episode gives a general introduction into the field of Reinforcement Learning: - High level description of the field - Policy gradients - Biggest challenges (sparse rewards, reward shaping, ...) This video forms the basis for a series on RL where I will dive much deeper into technical details of state-of-the-art methods for RL. Links: - "Pong from Pixels - Karpathy": http://karpathy.github.io/2016/05/31/rl/ - Concept networks for grasp & stack (Paper with heavy reward shaping): https://arxiv.org/abs/1709.06977 If you enjoy my videos, all support is super welcome! https://www.patreon.com/ArxivInsights If you have questions you would like to discuss with me personally, you can book a 1-on-1 video call through Pensight: https://pensight.com/x/xander-steenbrugge ::Chapters:: 00:00 Intro 01:03 So what is Reinforcement Learning? 03:39 Learning without explicit examples 07:25 Main challenges when doing RL 15:04 Are the robots taking over now?

Watch on YouTube ↗ (saves to browser)

Sign in to unlock AI tutor explanation · ⚡30

More on: Research Methods

View skill →

Mechanics of Materials III: Beam Bending

Mechanics of Materials III: Beam Bending

Inaugural Lecture: Juliane Reinecke

Inaugural Lecture: Juliane Reinecke

Saïd Business School, University of Oxford

Hands-On Learning: How and Why You Should Build a Home Lab

Hands-On Learning: How and Why You Should Build a Home Lab

SANS Live Online Interactive Remote Lab and Range Demo – SEC599: Defeating Advanced Adversaries

SANS Live Online Interactive Remote Lab and Range Demo – SEC599: Defeating Advanced Adversaries

NVIDIA cuOpt Wins the 2025 COIN-OR Cup

NVIDIA cuOpt Wins the 2025 COIN-OR Cup

NVIDIA Developer

Framework for Data Collection and Analysis

Framework for Data Collection and Analysis

Related AI Lessons

The ABCs of reading medical research and review papers these days

Learn to critically evaluate medical research papers by accepting nothing at face value, believing no one blindly, and checking everything

#1 DevLog Meta-research: I Got Tired of Tab Chaos While Reading Research Papers.

Learn to manage research paper tabs efficiently and apply meta-research techniques to improve productivity

How to Set Up a Karpathy-Style Wiki for Your Research Field

Learn to set up a Karpathy-style wiki for your research field to organize and share knowledge effectively

The Non-Optimality of Scientific Knowledge: Path Dependence, Lock-In, and The Local Minimum Trap

Scientific knowledge may be stuck in a local minimum, hindering optimal progress, and understanding this concept is crucial for advancing research

Chapters (5)

Intro

1:03 So what is Reinforcement Learning?

3:39 Learning without explicit examples

7:25 Main challenges when doing RL

15:04 Are the robots taking over now?

Stanford MS&E435 | Spring 2026 | Economics of Generative AI

Stanford Online