Prediction and Control with Function Approximation

Coursera Courses ↗ · Coursera

Open Course on Coursera

Free to audit · Opens on Coursera

Prediction and Control with Function Approximation

Coursera · Intermediate ·📐 ML Fundamentals ·1mo ago
In this course, you will learn how to solve problems with large, high-dimensional, and potentially infinite state spaces. You will see that estimating value functions can be cast as a supervised learning problem---function approximation---allowing you to build agents that carefully balance generalization and discrimination in order to maximize reward. We will begin this journey by investigating how our policy evaluation or prediction methods like Monte Carlo and TD can be extended to the function approximation setting. You will learn about feature construction techniques for RL, and representation learning via neural networks and backprop. We conclude this course with a deep-dive into policy gradient methods; a way to learn policies directly without learning a value function. In this course you will solve two continuous-state control tasks and investigate the benefits of policy gradient methods in a continuous-action environment. Prerequisites: This course strongly builds on the fundamentals of Courses 1 and 2, and learners should have completed these before starting this course. Learners should also be comfortable with probabilities & expectations, basic linear algebra, basic calculus, Python 3.0 (at least 1 year), and implementing algorithms from pseudocode. By the end of this course, you will be able to: -Understand how to use supervised learning approaches to approximate value functions -Understand objectives for prediction (value estimation) under function approximation -Implement TD with function approximation (state aggregation), on an environment with an infinite state space (continuous state space) -Understand fixed basis and neural network approaches to feature construction -Implement TD with neural network function approximation in a continuous state environment -Understand new difficulties in exploration when moving to function approximation -Contrast discounted problem formulations for control versus an average reward problem formulation -Implem
Watch on Coursera ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Related AI Lessons

The Threshold Is a Business Decision, Not a Statistical One
Learn how to build a production-grade fraud detection system and why statistical thresholds are business decisions, not just statistical ones
Medium · Machine Learning
Can Your Stress Level Predict How Much You Sleep?
Explore the relationship between stress levels and sleep patterns using data analysis and machine learning techniques to uncover hidden patterns
Medium · Machine Learning
Role of Model Architecture In Inference — Inference Series
Learn how generative AI architecture impacts inference system design and why it matters for efficient model deployment
Medium · Machine Learning
Role of Model Architecture In Inference — Inference Series
Learn how model architecture impacts inference system design in generative AI
Medium · Deep Learning
Up next
Generative Artificial Intelligence Full Course 2026 | Gen AI Tutorial For Beginners | Simplilearn
Simplilearn
Watch →