Publication | Closed Access
Policy gradient reinforcement learning for fast quadrupedal locomotion
580
Citations
14
References
2004
Year
Unknown Venue
Artificial IntelligenceEngineeringField RoboticsPolicy Gradient ReinforcementMotor ControlIntelligent SystemsLearning ControlKinesiologyLegged RobotKinematicsRobot LearningHumanoid RobotHealth SciencesSpecific RobotMotion SynthesisSony Aibo RobotComputer ScienceQuadrupedal Trot GaitHuman MovementRobotics
This paper presents a machine learning approach to optimizing a quadrupedal trot gait for forward speed. Given a parameterized walk designed for a specific robot, we propose using a form of policy gradient reinforcement learning to automatically search the set of possible parameters with the goal of finding the fastest possible walk. We implement and test our approach on a commercially available quadrupedal robot platform, namely the Sony Aibo robot. After about three hours of learning, all on the physical robots and with no human intervention other than to change the batteries, the robots achieved a gait faster than any previously known gait known for the Aibo, significantly outperforming a variety of existing hand-coded and learned solutions.
| Year | Citations | |
|---|---|---|
1993 | 18K | |
1989 | 5.5K | |
1999 | 5K | |
1988 | 4.8K | |
1989 | 1.3K | |
2001 | 328 | |
2003 | 283 | |
2002 | 277 | |
2004 | 140 | |
2002 | 125 |
Page 1
Page 1