Foundations
Reinforcement Learning
RL algorithms, reward modelling, RLHF, policy gradients, Q-learning and multi-agent RL
Skills in this topic
3 skills — Sign in to track your progress
📚 Continue on Coursera
External links · Free to audit
DeepCamp AI