OpenAI's Q*?: Reinforcement Learning, Model-Based vs. Model-Free Methods, and Q-Learning
Skills:
RL Foundations90%
In this Brev.dev Concepts video, Harper Carroll (Head of AI/ML) covers the basics of reinforcement learning, exploration and exploitation, model-based vs. model-free methods, Q-learning, Q*, and temporal difference learning. It is accessible to those of all backgrounds, and includes a little math for those interested.
Find me on 𝕏: https://twitter.com/HarperSCarroll
Join our community on Discord: https://discord.gg/DndwhY6cjf
AI/ML Tutorial Notebooks: https://github.com/brevdev/notebooks
Intro: (0:00)
Reinforcement Learning: (1:10)
Exploration & Exploitation: (2:00)
Model-Based Methods: (3:36)
Model-Free Methods: (4:26)
Temporal Difference Learning (estimating Q): (4:36)
Q-Learning: (6:16)
Q* at OpenAI?: (7:46)
Conclusion: (8:24)
Watch on YouTube ↗
(saves to browser)
Sign in to unlock AI tutor explanation · ⚡30
More on: RL Foundations
View skill →Related AI Lessons
⚡
⚡
⚡
⚡
What makes an AI image workflow useful for real commercial output?
Dev.to AI
How to Write Better AI Image Prompts for Midjourney (With Examples That Actually Work)
Medium · ChatGPT
Image to Video AI: The Complete Workflow Playbook That Actually Produces Results
Medium · AI
Image Harvest v1.0.2: Internationalization, Free Pro Trial & Quality-of-Life Improvements
Dev.to · kyriewen
🎓
Tutor Explanation
DeepCamp AI