The StarCraft Multi-Agent Challenge

Published in

Adaptive Agents and Multi-Agent Systems(2019)

External Links:

Generate Graph

TL;DR

The StarCraft Multi-Agent Challenge (SMAC), based on the popular real-time strategy game StarCraft II, is proposed as a benchmark problem and an open-source deep multi-agent RL learning framework including state-of-the-art algorithms is opened.

Abstract

In the last few years, deep multi-agent reinforcement learning (RL) has become a highly active area of research. A particularly challenging class of problems in this area is partially observable, cooperative, multi-agent learning, in which teams of agents must learn to coordinate their behaviour while conditioning only on their private observations. This is an attractive research area since such problems are relevant to a large number of real-world systems and are also more amenable to evaluation than general-sum problems. Standardised environments such as the ALE and MuJoCo have allowed single-agent RL to move beyond toy domains, such as grid worlds. However, there is no comparable benchmark for cooperative multi-agent RL. As a result, most papers in this field use one-off toy problems, making it difficult to measure real progress. In this paper, we propose the StarCraft Multi-Agent Challenge (SMAC) as a benchmark problem to fill this gap. SMAC is based on the popular real-time strategy game StarCraft II and focuses on micromanagement challenges where each unit is controlled by an independent agent that must act based on local observations. We offer a diverse set of challenge maps and recommendations for best practices in benchmarking and evaluations. We also open-source a deep multi-agent RL learning framework including state-of-the-art algorithms. We believe that SMAC can provide a standard benchmark environment for years to come. Videos of our best agents for several SMAC scenarios are available at: https://youtu.be/VZ7zmQ_obZ0.

Authors

Tabish Rashid

5 papers

Mikayel Samvelyan

7 papers

C. S. D. Witt

6 papers

References37 items

Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning

QTRAN: Learning to Factorize with Transformation for Cooperative Multi-Agent Reinforcement Learning

Multi-Agent Common Knowledge Reinforcement Learning

On Reinforcement Learning for Full-length Game of StarCraft

Pommerman: A Multi-Agent Playground

The StarCraft Multi-Agent Challenge

Published in

Adaptive Agents and Multi-Agent Systems(2019)

External Links:

Generate Graph

TL;DR

Abstract

Authors

Tabish Rashid

5 papers

Mikayel Samvelyan

7 papers

C. S. D. Witt

6 papers

References37 items

Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning

QTRAN: Learning to Factorize with Transformation for Cooperative Multi-Agent Reinforcement Learning

Multi-Agent Common Knowledge Reinforcement Learning

On Reinforcement Learning for Full-length Game of StarCraft

Pommerman: A Multi-Agent Playground

Gregory Farquhar

5 papers

Jakob N. Foerster

8 papers

Shimon Whiteson

13 papers

Philip H. S. Torr

52 papers

Nantas Nardelli

9 papers

Tim G. J. Rudner

1 papers

Chia-Man Hung

1 papers

TStarBots: Defeating the Cheating Level Builtin AI in StarCraft II in the Full Game

Knowledge-Guided Agent-Tactic-Aware Learning for StarCraft Micromanagement

StarCraft Micromanagement With Reinforcement Learning and Curriculum Transfer Learning

QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning

Multi-Goal Reinforcement Learning: Challenging Robotics Environments and Request for Research

Value Propagation Networks

Mean Field Multi-Agent Reinforcement Learning

MAgent: A Many-Agent Reinforcement Learning Platform for Artificial Collective Intelligence

StarCraft II: A New Challenge for Reinforcement Learning

Value-Decomposition Networks For Cooperative Multi-Agent Learning

Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments

Counterfactual Multi-Agent Policy Gradients

Stabilising Experience Replay for Deep Multi-Agent Reinforcement Learning

Multi-agent Reinforcement Learning in Sequential Social Dilemmas

TorchCraft: a Library for Machine Learning Research on Real-Time Strategy Games

Episodic Exploration for Deep Deterministic Policies: An Application to StarCraft Micromanagement Tasks

A Concise Introduction to Decentralized POMDPs

Multi-agent reinforcement learning as a rehearsal for decentralized planning

Deep Reinforcement Learning from Self-Play in Imperfect-Information Games

Multiagent cooperation and competition with deep reinforcement learning

Deep Recurrent Q-Learning for Partially Observable MDPs

A Survey of Real-Time Strategy Game AI Research and Competition in StarCraft

The Arcade Learning Environment: An Evaluation Platform for General Agents

Optimal and Approximate Q-value Functions for Decentralized POMDPs

Half Field Offense in RoboCup Soccer: A Multiagent Reinforcement Learning Case Study

RoboCup: The Robot World Cup Initiative

Half Field Offense: An Environment for Multiagent Learning and Ad Hoc Teamwork

New blizzard custom game: Starcraft master

Keepaway Soccer: From Machine Learning Testbed to Benchmark

Multi Agent Reinforcement Learning Independent vs Cooperative Agents

Alphastar : Mastering the real - time strategy game starcraft ii , 2019

Field of Study

Computer ScienceMathematics

Venue Information

Name

Adaptive Agents and Multi-Agent Systems

Type

conference

URL

http://www.ifaamas.org/

Alternate Names

Adapt Agent Multi-agent Syst
International Joint Conference on Autonomous Agents & Multiagent Systems
Adapt Agent Multi-agents Syst
AAMAS
Adaptive Agents and Multi-Agents Systems
Int Jt Conf Auton Agent Multiagent Syst

TL;DR

Abstract

Authors

References37 items

Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning

QTRAN: Learning to Factorize with Transformation for Cooperative Multi-Agent Reinforcement Learning

Multi-Agent Common Knowledge Reinforcement Learning

On Reinforcement Learning for Full-length Game of StarCraft

Pommerman: A Multi-Agent Playground

TL;DR

Abstract

Authors

References37 items

Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning

QTRAN: Learning to Factorize with Transformation for Cooperative Multi-Agent Reinforcement Learning

Multi-Agent Common Knowledge Reinforcement Learning

On Reinforcement Learning for Full-length Game of StarCraft

Pommerman: A Multi-Agent Playground

TStarBots: Defeating the Cheating Level Builtin AI in StarCraft II in the Full Game

Knowledge-Guided Agent-Tactic-Aware Learning for StarCraft Micromanagement

StarCraft Micromanagement With Reinforcement Learning and Curriculum Transfer Learning

QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning

Multi-Goal Reinforcement Learning: Challenging Robotics Environments and Request for Research

Value Propagation Networks

Mean Field Multi-Agent Reinforcement Learning

MAgent: A Many-Agent Reinforcement Learning Platform for Artificial Collective Intelligence

StarCraft II: A New Challenge for Reinforcement Learning

Value-Decomposition Networks For Cooperative Multi-Agent Learning

Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments

Counterfactual Multi-Agent Policy Gradients

Stabilising Experience Replay for Deep Multi-Agent Reinforcement Learning

Multi-agent Reinforcement Learning in Sequential Social Dilemmas

TorchCraft: a Library for Machine Learning Research on Real-Time Strategy Games

Episodic Exploration for Deep Deterministic Policies: An Application to StarCraft Micromanagement Tasks

A Concise Introduction to Decentralized POMDPs

Multi-agent reinforcement learning as a rehearsal for decentralized planning

Deep Reinforcement Learning from Self-Play in Imperfect-Information Games

Multiagent cooperation and competition with deep reinforcement learning

Deep Recurrent Q-Learning for Partially Observable MDPs

A Survey of Real-Time Strategy Game AI Research and Competition in StarCraft

The Arcade Learning Environment: An Evaluation Platform for General Agents

Optimal and Approximate Q-value Functions for Decentralized POMDPs

Half Field Offense in RoboCup Soccer: A Multiagent Reinforcement Learning Case Study

RoboCup: The Robot World Cup Initiative

DeepMind

Half Field Offense: An Environment for Multiagent Learning and Ad Hoc Teamwork

New blizzard custom game: Starcraft master

Keepaway Soccer: From Machine Learning Testbed to Benchmark

Multi Agent Reinforcement Learning Independent vs Cooperative Agents

Alphastar : Mastering the real - time strategy game starcraft ii , 2019

Field of Study

Venue Information

Name

Type

URL

Alternate Names