🎮 Reinforcement Learning
RL algorithms, reward modelling, RLHF, policy gradients, Q-learning and multi-agent RL
📚 Continue on Coursera
External links · Free to audit
RL algorithms, reward modelling, RLHF, policy gradients, Q-learning and multi-agent RL