Decision Making and Reinforcement Learning

Coursera Courses ↗ · Coursera

Open Course on Coursera

Free to audit · Opens on Coursera

Decision Making and Reinforcement Learning

Coursera · Beginner ·📣 Digital Marketing & Growth ·1mo ago
This course is an introduction to sequential decision making and reinforcement learning. We start with a discussion of utility theory to learn how preferences can be represented and modeled for decision making. We first model simple decision problems as multi-armed bandit problems in and discuss several approaches to evaluate feedback. We will then model decision problems as finite Markov decision processes (MDPs), and discuss their solutions via dynamic programming algorithms. We touch on the notion of partial observability in real problems, modeled by POMDPs and then solved by online planning methods. Finally, we introduce the reinforcement learning problem and discuss two paradigms: Monte Carlo methods and temporal difference learning. We conclude the course by noting how the two paradigms lie on a spectrum of n-step temporal difference methods. An emphasis on algorithms and examples will be a key part of this course.
Watch on Coursera ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Related AI Lessons

Why AI Search (GEO/AEO) Is Eating Traditional SEO — And What Agencies Must Do Now
AI search is replacing traditional SEO, and agencies must adapt to survive, focusing on relevance and user experience over ranking positions
Dev.to AI
Every Telegram conversation becomes a qualified lead. BizNode captures name, email, and business details automatically while...
Automate lead capture from Telegram conversations using BizNode, an AI-powered business operator node
Dev.to AI
My 4 favorite Android Auto settings are seriously useful - but hidden by default
Unlock hidden Android Auto settings to enhance your driving experience
ZDNet
How We Generate 100+ Product Feeds From 300k SKUs Without Hitting the Database
Learn how to generate 100+ product feeds from 300k SKUs without hitting the database, improving scalability and performance
Dev.to · Peter Y
Up next
7 eBook Ideas That Sell Like Crazy Right Now
Sean Dollwet
Watch →