By defeating the Dota 2 world champion (Team OG), OpenAI Five demonstrates that self-play reinforcement learning can achieve superhuman performance on a difficult task.
On April 13th, 2019, OpenAI Five became the first AI system to defeat the world champions at an esports game. The game of Dota 2 presents novel challenges for AI systems such as long time horizons, imperfect information, and complex, continuous state-action spaces, all challenges which will become increasingly central to more capable AI systems. OpenAI Five leveraged existing reinforcement learning techniques, scaled to learn from batches of approximately 2 million frames every 2 seconds. We developed a distributed training system and tools for continual training which allowed us to train OpenAI Five for 10 months. By defeating the Dota 2 world champion (Team OG), OpenAI Five demonstrates that self-play reinforcement learning can achieve superhuman performance on a difficult task.
Brooke Chan
3 papers
Vicki Cheung
3 papers
Przemyslaw Debiak
1 papers
Christy Dennison
1 papers
David Farhi
2 papers
Quirin Fischer
1 papers
Shariq Hashme
1 papers
Christopher Hesse
5 papers
R. Józefowicz
3 papers
Scott Gray
7 papers
Catherine Olsson
4 papers
J. Pachocki
4 papers
Michael Petrov
3 papers
Henrique Pondé de Oliveira Pinto
2 papers
Jonathan Raiman
3 papers
Tim Salimans
8 papers
Jeremy Schlatter
1 papers
Jonas Schneider
4 papers
Szymon Sidor
3 papers
Jie Tang
8 papers
Filip Wolski
2 papers
Susan Zhang
3 papers