OpenAI's Q*?: Reinforcement Learning, Model-Based vs. Model-Free Methods, and Q-Learning
In this Brev.dev Concepts video, Harper Carroll (Head of AI/ML) covers the basics of reinforcement learning, exploration and exploitation, model-based vs. model-free methods, Q-learning, Q*, and temporal difference learning. It is accessible to those of all backgrounds, and includes a little math for those interested.
Find me on 𝕏: https://twitter.com/HarperSCarroll
Join our community on Discord: https://discord.gg/DndwhY6cjf
AI/ML Tutorial Notebooks: https://github.com/brevdev/notebooks
Intro: (0:00)
Reinforcement Learning: (1:10)
Exploration & Exploitation: (2:00)
Model-Based Methods: (3…
Watch on YouTube ↗
(saves to browser)
DeepCamp AI