🎮 Reinforcement Learning
RL algorithms, reward modelling, RLHF, policy gradients, Q-learning and multi-agent RL
Looking for written articles and micro-lessons? Switch to Reads.
RL algorithms, reward modelling, RLHF, policy gradients, Q-learning and multi-agent RL
Looking for written articles and micro-lessons? Switch to Reads.