OpenAI's Q*?: Reinforcement Learning, Model-Based vs. Model-Free Methods, and Q-Learning

Brev · Beginner ·🎨 Image & Video AI ·2y ago
In this Brev.dev Concepts video, Harper Carroll (Head of AI/ML) covers the basics of reinforcement learning, exploration and exploitation, model-based vs. model-free methods, Q-learning, Q*, and temporal difference learning. It is accessible to those of all backgrounds, and includes a little math for those interested. Find me on 𝕏: https://twitter.com/HarperSCarroll Join our community on Discord: https://discord.gg/DndwhY6cjf AI/ML Tutorial Notebooks: https://github.com/brevdev/notebooks Intro: (0:00) Reinforcement Learning: (1:10) Exploration & Exploitation: (2:00) Model-Based Methods: (3:36) Model-Free Methods: (4:26) Temporal Difference Learning (estimating Q): (4:36) Q-Learning: (6:16) Q* at OpenAI?: (7:46) Conclusion: (8:24)
Watch on YouTube ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Related AI Lessons

What makes an AI image workflow useful for real commercial output?
Learn how to create a useful AI image workflow for commercial output, focusing on repeatability, versatility, and clarity
Dev.to AI
How to Write Better AI Image Prompts for Midjourney (With Examples That Actually Work)
Learn to write effective AI image prompts for Midjourney with actionable examples and techniques
Medium · ChatGPT
Image to Video AI: The Complete Workflow Playbook That Actually Produces Results
Learn a step-by-step workflow for image-to-video AI that produces results, from preparation to delivery
Medium · AI
Image Harvest v1.0.2: Internationalization, Free Pro Trial & Quality-of-Life Improvements
Learn about Image Harvest v1.0.2, a Chrome extension with internationalization, free pro trial, and quality-of-life improvements, and how to utilize it for privacy-first image extraction
Dev.to · kyriewen
Up next
Krea 2 makes Diffusion FUN Again!
MattVidPro
Watch →