Home Research Papers Datasets State of the Art Pricing

Discover, visualize, and connect AI research papers. Explore the latest trends and insights in artificial intelligence research.

Product

Home
Research Papers
About

Support

Contact
Terms of Service
Privacy Policy

© 2026 Papersgraph. All rights reserved.

playing-games-10

OpenAI Gym

3260 papers • 126 benchmarks • 313 datasets

An open-source toolkit from OpenAI that implements several Reinforcement Learning benchmarks including: classic control, Atari, Robotics and MuJoCo tasks. (Description by Evolutionary learning of interpretable decision trees) (Image Credit: OpenAI Gym)

(Image credit: Papersgraph)

Benchmarks

These leaderboards are used to track progress in openai-gym-10

Trend

Dataset

Best Model

Actions

LunarLander-v2

LunarLander-v2

CartPole-v1

CartPole-v1

Mountain Car

Mountain Car

Libraries

i

Use these libraries to find openai-gym-10 models and implementations

2 papers 403

Datasets

OpenAI Gym

Industrial Benchmark

Industrial Benchmark

MO-Gymnasium

Subtasks

Most implemented papers

Addressing Function Approximation Error in Actor-Critic Methods

Scott Fujimoto, D. Meger, H. V. Hoof•Sun Feb 25 2018

This paper builds on Double Q-learning, by taking the minimum value between a pair of critics to limit overestimation, and draws the connection between target networks and overestimation bias.

6063

Content

Introduction Benchmarks Datasets Subtasks Libraries Papers

Cart Pole (OpenAI Gym)

Cart Pole (OpenAI Gym)

Ant-v2

Ant-v2

HalfCheetah-v2

HalfCheetah-v2

Hopper-v2

Hopper-v2

Humanoid-v2

Humanoid-v2

Walker2d-v2

Walker2d-v2

0

Multi-Goal Reinforcement Learning: Challenging Robotics Environments and Request for Research

Jonas Schneider, Vikash Kumar, Wojciech Zaremba, Matthias Plappert, Bowen Baker, Alex Ray, Peter Welinder, Bob McGrew, Marcin Andrychowicz, Glenn Powell, Joshua Tobin, Maciek Chociej•Sun Feb 25 2018

A suite of challenging continuous control tasks (integrated with OpenAI Gym) based on currently existing robotics hardware and following a Multi-Goal Reinforcement Learning (RL) framework are introduced.

620 0

Decision Transformer: Reinforcement Learning via Sequence Modeling

Aditya Grover, P. Abbeel, A. Srinivas, A. Rajeswaran, Kimin Lee, M. Laskin, Kevin Lu, Igor Mordatch, Lili Chen•Tue Jun 01 2021

Despite its simplicity, Decision Transformer matches or exceeds the performance of state-of-the-art model-free offline RL baselines on Atari, OpenAI Gym, and Key-to-Door tasks.

2050 0

Deep Recurrent Q-Learning for Partially Observable MDPs

P. Stone, Matthew J. Hausknecht•Wed Jul 22 2015

The effects of adding recurrency to a Deep Q-Network is investigated by replacing the first post-convolutional fully-connected layer with a recurrent LSTM, which successfully integrates information through time and replicates DQN's performance on standard Atari games and partially observed equivalents featuring flickering game screens.

1797 0

Advantage-Weighted Regression: Simple and Scalable Off-Policy Reinforcement Learning

S. Levine, Aviral Kumar, Xue Bin Peng, Grace Zhang•Mon Sep 30 2019

A simple and scalable reinforcement learning algorithm that uses standard supervised learning methods as subroutines and is able to acquire more effective policies than most off-policy algorithms when learning from purely static datasets with no additional environmental interactions is developed.

716 0

Deep Reinforcement Learning for Playing 2.5D Fighting Games

Y. Wang, Yu-Jhe Li, Hsin-Yu Chang, Yu-Jing Lin, Po-Wei Wu•Fri May 04 2018

An OpenAI-gym-like gaming environment is created with the game of Little Fighter 2 (LF2), and a novel A3C+ network is presented for learning RL agents, which includes a Recurrent Info network, which utilizes game-related info features with recurrent layers to observe combo skills for fighting.

6 0

TorchBeast: A PyTorch Platform for Distributed RL

Nantas Nardelli, Edward Grefenstette, Thibaut Lavril, Marco Selvatici, Tim Rocktäschel, Heinrich Küttler, V. Sivakumar•Mon Oct 07 2019

The TorchBeast design principles and implementation are described and it is demonstrated that it performs on-par with IMPALA on Atari.

62 0

Maximum Entropy-Regularized Multi-Goal Reinforcement Learning

Volker Tresp, Rui Zhao, Xudong Sun•Mon May 20 2019

A novel multi-goal RL objective based on weighted entropy is proposed, which encourages the agent to maximize the expected return, as well as to achieve more diverse goals and a maximum entropy-based prioritization framework is developed to optimize the proposed objective.

92 0

Implicit Distributional Reinforcement Learning

Zhendong Wang, Yuguang Yue, Mingyuan Zhou•Sun Jul 12 2020

An implicit distributional actor critic that consists of a distributional critic, built on two deep generator networks, and a semi-implicit actor (SIA), powered by a flexible policy distribution to improve the sample efficiency of policy-gradient based reinforcement learning algorithms.

17 0

COOL-MC: A Comprehensive Tool for Reinforcement Learning and Model Checking

Dennis Gross, N. Jansen, Sebastian Junges, G. Pérez•Wed Sep 14 2022

COOL-MC is presented, a tool that integrates state-of-the-art reinforcement learning (RL) and model checking and algorithms to obtain bounds on the performance of so-called permissive policies.

19 0

Adding a benchmark result helps the community track progress.

OpenAI Gym | State-of-the-Art