1
Rethink reporting of evaluation results in AI
2
Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning
3
Mapping global dynamics of benchmark creation and saturation in artificial intelligence
4
Spurious normativity enhances learning of compliance and enforcement behavior in artificial agents
5
Ethical and social risks of harm from Language Models
6
Statistical discrimination in learning agents
7
Collaborating with Humans without Human Data
9
Scalable Evaluation of Multi-Agent Reinforcement Learning with Melting Pot
10
The Option Keyboard: Combining Skills in Reinforcement Learning
11
A learning agent that acquires social norms from public sanctions in decentralized multi-agent settings
12
Podracer architectures for scalable Reinforcement Learning
13
Modelling Cooperation in Network Games with Spatio-Temporal Complexity
14
Open Problems in Cooperative AI
15
Towards Playing Full MOBA Games with Deep Reinforcement Learning
16
Model-free conventions in multi-agent reinforcement learning with heterogeneous preferences
17
The Origins and Psychology of Human Cooperation.
18
OPtions as REsponses: Grounding behavioural hierarchies in multi-agent reinforcement learning
19
Too Many Cooks: Coordinating Multi-agent Collaboration Through Inverse Planning
20
Social Diversity and Social Preferences in Mixed-Motive Reinforcement Learning
21
Dota 2 with Large Scale Deep Reinforcement Learning
22
Grandmaster level in StarCraft II using multi-agent reinforcement learning
23
Dissecting racial bias in an algorithm used to manage the health of populations
24
On the Utility of Learning about Humans for Human-AI Coordination
25
Human Compatible: Artificial Intelligence and the Problem of Control
26
V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control
27
Emergent Tool Use From Multi-Agent Autocurricula
28
A Survey on Bias and Fairness in Machine Learning
29
Item response theory in AI: Analysing machine learning classifiers at the instance level
31
Open-ended Learning in Symmetric Zero-sum Games
32
Malthusian Reinforcement Learning
33
A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play
34
Quantifying Generalization in Reinforcement Learning
35
Generalization and Regularization in DQN
36
Multi-task Deep Reinforcement Learning with PopArt
37
Representation Learning with Contrastive Predictive Coding
38
Value-Decomposition Networks For Cooperative Multi-Agent Learning Based On Team Reward
39
Human-level performance in 3D multiplayer games with population-based reinforcement learning
40
A Dissection of Overfitting and Generalization in Continuous Reinforcement Learning
41
QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning
42
Inequity aversion improves cooperation in intertemporal social dilemmas
43
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
44
A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning
45
Mastering the game of Go without human knowledge
46
Evaluation in artificial intelligence: from task-oriented to ability-oriented measurement
47
Learning with Opponent-Learning Awareness
48
Prosocial learning agents solve generalized Stag Hunts better than selfish ones
49
A multi-agent reinforcement learning model of common-pool resource appropriation
50
Maintaining cooperation in complex social dilemmas using deep reinforcement learning
51
Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments
52
Six Challenges for Neural Machine Translation
53
FeUdal Networks for Hierarchical Reinforcement Learning
54
Multi-agent Reinforcement Learning in Sequential Social Dilemmas
55
Reinforcement Learning with Unsupervised Auxiliary Tasks
56
Concrete Problems in AI Safety
57
Cooperative Inverse Reinforcement Learning
58
Asynchronous Methods for Deep Reinforcement Learning
59
Mastering the game of Go with deep neural networks and tree search
60
Research Priorities for Robust and Beneficial Artificial Intelligence
61
ImageNet Large Scale Visual Recognition Challenge
62
On the difficulty of training recurrent neural networks
63
Horde: a scalable real-time architecture for learning knowledge from unsupervised sensorimotor interaction
64
Meta-analysis in medical research.
65
Lab Experiments for the Study of Social-Ecological Systems
66
The Elements of Statistical Learning: Data Mining, Inference, and Prediction
67
Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning
68
The Dynamics of Reinforcement Learning in Cooperative Multiagent Systems
69
The evolution of cooperation
70
A Treatise of Human Nature: Being an Attempt to introduce the experimental Method of Reasoning into Moral Subjects
71
Games and Decisions: Introduction and Critical Survey.
72
Review: R. Duncan Luce and Howard Raiffa, Games and decisions: Introduction and critical survey
73
Agent-Based Computational Economics: Overview and Brief History 1
74
Evolutionary Game Theory
75
Tracking the Impact and Evolution of AI: The AIcollaboratory
77
A Meta-Analysis of Overfitting in Machine Learning
78
Aligning Superintelligence with Human Interests: A Technical Research Agenda
79
Understanding Institutional Diversity
80
Reinforcement Learning: An Introduction
81
Territory Inside Out: scenario 1
82
Running with Scissors in the Matrix Repeated: scenarios 0, 1, 2, 3, 4 for ACB exploiter was no better than random. We trained an OPRE exploiter as replacement
83
Collaborative Cooking Asymmetric / Circuit / Cramped / Figure Eight: scenario 2
84
Predator Prey Random Forest: scenario 3