Publication | Closed Access
Policy Transfer using Reward Shaping
51
Citations
22
References
2015
Year
Artificial IntelligenceEngineeringMachine LearningGame TheoryLearning AlgorithmMotor ControlIntelligent SystemsLearning ControlKinesiologyReinforcement Learning (Computer Engineering)Stochastic GameRobot LearningDecision TheoryMechanism DesignPublic PolicyCognitive ScienceAction Model LearningSequential Decision MakingComputer SciencePolicy TransferBusinessTransfer Learning
Transfer learning has proven to be a wildly successful approach for speeding up reinforcement learning. Techniques often use low-level information obtained in the source task to achieve successful transfer in the target task. Yet, a most general transfer approach can only assume access to the output of the learning algorithm in the source task, i.e. the learned policy, enabling transfer irrespective of the learning algorithm used in the source task. We advance the state-of-the-art by using a reward shaping approach to policy transfer. One of the advantages in following such an approach, is that it firmly grounds policy transfer in an actively developing body of theoretical research on reward shaping. Experiments in Mountain Car, Cart Pole and Mario demonstrate the practical usefulness of the approach.
| Year | Citations | |
|---|---|---|
Page 1
Page 1