1
Magnetic control of tokamak plasmas through deep reinforcement learning
2
RLDS: an Ecosystem to Generate, Share and Use Datasets in Reinforcement Learning
3
Continuous Control with Action Quantization from Demonstrations
4
Offline RL Without Off-Policy Evaluation
5
A Minimalist Approach to Offline Reinforcement Learning
6
Launchpad: A Programming Model for Distributed Machine Learning Research
7
What Matters for Adversarial Imitation Learning?
8
Regularized Behavior Value Estimation
9
Reverb: A Framework For Experience Replay
10
Autonomous navigation of stratospheric balloons using reinforcement learning
11
Munchausen Reinforcement Learning
12
Hyperparameter Selection for Offline Reinforcement Learning
13
Critic Regularized Regression
14
Primal Wasserstein Imitation Learning
15
Conservative Q-Learning for Offline Reinforcement Learning
16
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
17
D4RL: Datasets for Deep Data-Driven Reinforcement Learning
18
Improving Reproducibility in Machine Learning Research (A Report from the NeurIPS 2019 Reproducibility Program)
19
Fiber: A Platform for Efficient Development and Distributed Training for Reinforcement Learning and Population-Based Methods
20
Never Give Up: Learning Directed Exploration Strategies
21
Dota 2 with Large Scale Deep Reinforcement Learning
22
Grandmaster level in StarCraft II using multi-agent reinforcement learning
23
TorchBeast: A PyTorch Platform for Distributed RL
24
Benchmarking Batch Deep Reinforcement Learning Algorithms
25
Scaling data-driven robotics with reward sketching and batch reinforcement learning
26
Making Efficient Use of Demonstrations to Solve Hard Exploration Problems
27
Behaviour Suite for Reinforcement Learning
28
An Optimistic Perspective on Offline Reinforcement Learning
29
When to use parametric models in reinforcement learning?
30
Imitation Learning as f-Divergence Minimization
31
SQIL: Imitation Learning via Reinforcement Learning with Sparse Rewards
32
Arena: A General Evaluation Platform and Building Toolkit for Multi-Agent Intelligence
33
Soft Actor-Critic Algorithms and Applications
34
Off-Policy Deep Reinforcement Learning without Exploration
35
A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play
36
Horizon: Facebook's Open Source Applied Reinforcement Learning Platform
37
SURREAL: Open-Source Reinforcement Learning Framework and Robot Manipulation Benchmark
38
Social Influence as Intrinsic Motivation for Multi-Agent Deep Reinforcement Learning
39
Recurrent Experience Replay in Distributed Reinforcement Learning
40
Dopamine: A Research Framework for Deep Reinforcement Learning
41
Discriminator-Actor-Critic: Addressing Sample Inefficiency and Reward Bias in Adversarial Imitation Learning
42
Discriminator-Actor-Critic: Addressing Sample Inefficiency and Reward Bias in Adversarial Imitation Learning
43
Learning dexterous in-hand manipulation
44
Implicit Quantile Networks for Distributional Reinforcement Learning
45
Randomized Prior Functions for Deep Reinforcement Learning
46
Observe and Look Further: Achieving Consistent Performance on Atari
47
Meta-Gradient Reinforcement Learning
48
Reinforcement Learning and Control as Probabilistic Inference: Tutorial and Review
49
Addressing Function Approximation Error in Actor-Critic Methods
50
Distributed Prioritized Experience Replay
51
Distributed Distributional Deterministic Policy Gradients
52
Maximum a Posteriori Policy Optimisation
53
Spectral Normalization for Generative Adversarial Networks
54
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
55
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
57
Ray: A Distributed Framework for Emerging AI Applications
58
Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments
59
Rainbow: Combining Improvements in Deep Reinforcement Learning
60
Overcoming Exploration in Reinforcement Learning with Demonstrations
61
Learning Complex Dexterous Manipulation with Deep Reinforcement Learning and Demonstrations
62
Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents
63
Leveraging Demonstrations for Deep Reinforcement Learning on Robotics Problems with Sparse Rewards
64
Proximal Policy Optimization Algorithms
65
A Distributional Perspective on Reinforcement Learning
66
Deep Q-learning From Demonstrations
67
Learning from Demonstrations for Real World Reinforcement Learning
68
Improved Training of Wasserstein GANs
69
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
70
FeUdal Networks for Hierarchical Reinforcement Learning
71
Reinforcement Learning with Unsupervised Auxiliary Tasks
72
Sample Efficient Actor-Critic with Experience Replay
73
Generative Adversarial Imitation Learning
74
Unifying Count-Based Exploration and Intrinsic Motivation
76
Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation
77
TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems
78
Asynchronous Methods for Deep Reinforcement Learning
79
Mastering the game of Go with deep neural networks and tree search
80
Dueling Network Architectures for Deep Reinforcement Learning
81
Prioritized Experience Replay
82
Deep Reinforcement Learning with Double Q-Learning
83
Continuous control with deep reinforcement learning
84
Massively Parallel Methods for Deep Reinforcement Learning
85
High-Dimensional Continuous Control Using Generalized Advantage Estimation
86
Human-level control through deep reinforcement learning
87
Trust Region Policy Optimization
88
Boosted Bellman Residual Minimization Handling Expert Demonstrations
89
Deterministic Policy Gradient Algorithms
90
Playing Atari with Deep Reinforcement Learning
91
Learning from Limited Demonstrations
92
MuJoCo: A physics engine for model-based control
93
The Arcade Learning Environment: An Evaluation Platform for General Agents
95
Efficient Reductions for Imitation Learning
96
Maximum Entropy Inverse Reinforcement Learning
97
Universal Intelligence: A Definition of Machine Intelligence
98
Boosting Structured Prediction for Imitation Learning
99
Tree-Based Batch Mode Reinforcement Learning
100
Neural Fitted Q Iteration - First Experiences with a Data Efficient Neural Reinforcement Learning Method
101
Apprenticeship learning via inverse reinforcement learning
102
Least-Squares Policy Iteration
103
Algorithms for Inverse Reinforcement Learning
104
Policy Gradient Methods for Reinforcement Learning with Function Approximation
105
Planning and Acting in Partially Observable Stochastic Domains
106
Learning from Demonstration
107
Residual Algorithms: Reinforcement Learning with Function Approximation
108
Acting Optimally in Partially Observable Stochastic Domains
109
Markov Decision Processes: Discrete Stochastic Dynamic Programming
110
Self-improving reactive agents based on reinforcement learning, planning and teaching
112
Efficient Training of Artificial Neural Networks for Autonomous Navigation
113
A Markovian Decision Process
114
RL Unplugged: A Suite of Benchmarks for Offline Reinforcement Learning
115
Reverb: An efficient data storage and transport system for ml research
116
Seed rl: Scalable and efficient deep-rl with accelerated central inference
117
dm_env: A python interface for reinforcement learning environments
118
Learning Robust Rewards with Adverserial Inverse Reinforcement Learning
119
JAX: composable transformations of Python+NumPy programs
121
ALVINN, an autonomous land vehicle in a neural network
123
Rationality and Intelligence: A Brief Update
124
Batch Reinforcement Learning
125
Reinforcement Learning: An Introduction
126
Learning agents for uncertain environments
127
Knowledge, Learning and Machine Intelligence