Policy Gradient Methods
Implement policy gradient algorithms — REINFORCE, PPO, and Actor-Critic.
0%
Confidence · no data yet
After this skill you can…
- Implement REINFORCE from scratch
- Train a PPO agent with Stable-Baselines3
- Explain the advantage function in Actor-Critic
Prerequisites
Watch (10 videos)
Proximal Policy Optimization Implementation: 9 Atari-specific Details (2/3)
→ Use policy gradient methods for PPO
Implementing DeepMind's DQN from scratch! | Project Update
→ Develop policy gradient methods→ Improve reinforcement learning models
Reinforcement Learning Course: Intro to Advanced Actor Critic Methods
→ Apply policy gradient methods→ Optimize policies in reinforcement learning
An introduction to Policy Gradient methods - Deep Reinforcement Learning
→ Implement PPO in PyTorch or TensorFlow→ Analyze the trade-offs between sample efficiency and code complexity
Build a board game app with policy gradient (Reinforcement learning with TensorFlow Agents)
→ Implement policy gradient reinforcement learning→ Use TensorFlow Agents for policy-based algorithms
Proximal Policy Optimization | ChatGPT uses this
→ Apply policy gradient methods in a Reinforcement Learning algorithm
Policy Gradient in One Minute
→ Apply Policy Gradient methods to real-world problems→ Analyze GAE and TRPO algorithms
Cutting-Edge Topics in Deep Reinforcement Learning
→ Apply advanced exploration strategies in RL→ Optimize RL policies using black-box optimization
Lightning Talk: TorchRL - RLHF Support - Vincent Moens, Meta
→ Apply policy gradient methods→ Use TorchRL for RL tasks
Research talk: Safe reinforcement learning using advantage-based intervention
→ Develop policy gradient methods→ Optimize policies for safe rl
Read (10 articles)
📄
DeepCamp AI