Publication | Open Access
Sample Efficient Actor-Critic with Experience Replay
221
Citations
20
References
2016
Year
Artificial IntelligenceEngineeringMachine LearningDeep Reinforcement LearningStochastic GameExperience ReplayGame TheoryActor-critic Deep ReinforcementBias CorrectionSequential Decision MakingComputer ScienceMulti-agent LearningRobot LearningLearning ControlExploration V Exploitation
This paper presents an actor-critic deep reinforcement learning agent with experience replay that is stable, sample efficient, and performs remarkably well on challenging environments, including the discrete 57-game Atari domain and several continuous control problems. To achieve this, the paper introduces several innovations, including truncated importance sampling with bias correction, stochastic dueling network architectures, and a new trust region policy optimization method.
| Year | Citations | |
|---|---|---|
Page 1
Page 1