playing-games-9

NetHack Score

3260 papers • 126 benchmarks • 313 datasets

Mean in-game score over 1000 episodes with random seeds not seen during training. See https://arxiv.org/abs/2006.13760 (Section 2.4 Evaluation Protocol) for details.

(Image credit: Papersgraph)

Benchmarks

These leaderboards are used to track progress in nethack-score-9

Trend

Dataset

Best Model

Actions

NetHack Learning Environment

Libraries

i

Use these libraries to find nethack-score-9 models and implementations

Datasets

NetHack Learning Environment

Subtasks

No subtasks available.

Most implemented papers

The NetHack Learning Environment

Nantas Nardelli, Edward Grefenstette, Heinrich Kuttler, Alexander H. Miller, R. Raileanu, Marco Selvatici, Tim Rocktäschel•Sun May 31 2020

It is argued that NetHack is sufficiently complex to drive long-term research on problems such as exploration, planning, skill acquisition, and language-conditioned RL, while dramatically reducing the computational resources required to gather a large amount of experience.

209

Content

0

Paper Graph

Adding a benchmark result helps the community track progress.