It is demonstrated that models trained only in simulation can be used to solve a manipulation problem of unprecedented complexity on a real robot, made possible by a novel algorithm, which is called automatic domain randomization (ADR), and a robot platform built for machine learning.
We demonstrate that models trained only in simulation can be used to solve a manipulation problem of unprecedented complexity on a real robot. This is made possible by two key components: a novel algorithm, which we call automatic domain randomization (ADR) and a robot platform built for machine learning. ADR automatically generates a distribution over randomized environments of ever-increasing difficulty. Control policies and vision state estimators trained with ADR exhibit vastly improved sim2real transfer. For control policies, memory-augmented models trained on an ADR-generated distribution of environments show clear signs of emergent meta-learning at test time. The combination of ADR with our custom robot platform allows us to solve a Rubik's cube with a humanoid robot hand, which involves both control and state estimation problems. Videos summarizing our results are available: this https URL
Matthias Plappert
6 papers
Jerry Tworek
5 papers
Qiming Yuan
2 papers
Peter Welinder
4 papers
Bob McGrew
4 papers
Ma-teusz Litwin
3 papers
Marcin Andrychowicz
2 papers
Glenn Powell
2 papers
Maciek Chociej
2 papers
Ilge Akkaya
2 papers
N. Tezak
2 papers
Lilian Weng
2 papers
OpenAI
1 papers
Arthur Petron
1 papers
Raphael Ribas
1 papers
Lei M. Zhang
1 papers