RL Foundations
Understand MDPs, the Bellman equation, and basic Q-learning.
0%
Confidence · no data yet
After this skill you can…
- Formalise a problem as an MDP
- Implement tabular Q-learning on CartPole
- Explain exploration vs exploitation tradeoff
DeepCamp AI