OpenAI Five

📰 OpenAI News

OpenAI Five, a team of five neural networks, defeats amateur human teams at Dota 2 using self-play and reinforcement learning

advanced Published 25 Jun 2018

Action Steps

Train a neural network using self-play and reinforcement learning
Use a scaled-up version of Proximal Policy Optimization running on multiple GPUs and CPU cores
Implement a separate LSTM for each hero to learn recognizable strategies
Benchmark progress by hosting matches against top players

Who Needs to Know This

AI engineers and researchers benefit from understanding the application of reinforcement learning and self-play in complex games like Dota 2, while product managers and entrepreneurs can learn from the potential of AI in esports

Key Insight

💡 Reinforcement learning can yield long-term planning with large but achievable scale without fundamental advances