3260 papers • 126 benchmarks • 313 datasets
This task has no description! Would you like to contribute one?
(Image credit: Papersgraph)
These leaderboards are used to track progress in game-of-hanabi-7
No benchmarks available.
Use these libraries to find game-of-hanabi-7 models and implementations
No subtasks available.
This paper proposes two different search techniques that can be applied to improve an arbitrary agreed-upon policy in a cooperative partially observable game and proves that these search procedures are theoretically guaranteed to at least maintain the original performance of the agreed-Upon policy (up to a bounded approximation error).
It is argued that Hanabi elevates reasoning about the beliefs and intentions of other agents to the foreground and developing novel techniques for such theory of mind reasoning will not only be crucial for success in Hanabi, but also in broader collaborative efforts, especially those with human partners.
This work formally defined a framework based on a popular cooperative multi-agent game called Hanabi to evaluate the adaptability of MARL methods and defined a new metric called adaptation regret that measures the agent's ability to efficiently adapt and improve its coordination performance when paired with some held-out pool of partners on top of its ZSC performance.
Adding a benchmark result helps the community track progress.