Publication | Open Access
Solving Rubik's Cube with a Robot Hand
630
Citations
91
References
2019
Year
Artificial IntelligenceEngineeringMachine LearningDexterous ManipulationField RoboticsIntelligent RoboticsCognitive RoboticsObject ManipulationIntelligent SystemsRobot HandReal RobotIndustrial RoboticsRobot LearningKinematicsControl PoliciesAutomatic Domain RandomizationMechatronicsComputer ScienceWorld ModelAutomationRobotics
The study demonstrates that simulation‑only trained models can solve a highly complex real‑world manipulation task. The authors employ automatic domain randomization (ADR) to generate increasingly difficult randomized environments and use a machine‑learning‑optimized robot platform to train policies. Policies and vision estimators trained with ADR achieve strong sim‑to‑real transfer, exhibit emergent meta‑learning, and enable a humanoid robot hand to solve a Rubik’s Cube. Videos summarizing the results are available at https://openai.com/blog/solving-rubiks-cube/.
We demonstrate that models trained only in simulation can be used to solve a manipulation problem of unprecedented complexity on a real robot. This is made possible by two key components: a novel algorithm, which we call automatic domain randomization (ADR) and a robot platform built for machine learning. ADR automatically generates a distribution over randomized environments of ever-increasing difficulty. Control policies and vision state estimators trained with ADR exhibit vastly improved sim2real transfer. For control policies, memory-augmented models trained on an ADR-generated distribution of environments show clear signs of emergent meta-learning at test time. The combination of ADR with our custom robot platform allows us to solve a Rubik's cube with a humanoid robot hand, which involves both control and state estimation problems. Videos summarizing our results are available: https://openai.com/blog/solving-rubiks-cube/
| Year | Citations | |
|---|---|---|
Page 1
Page 1