Deep Reinforcement Learning: A Brief Survey

TLDR

Deep reinforcement learning is poised to revolutionise AI, enabling autonomous systems with higher visual understanding, scaling to previously intractable problems such as learning from pixels and controlling robots directly from camera inputs. The survey introduces reinforcement learning, discusses value‑based and policy‑based methods, highlights deep neural networks’ visual‑learning advantages, and outlines current research directions. The survey reviews central deep reinforcement learning algorithms, including deep Q‑networks, trust‑region policy optimisation, and asynchronous advantage actor‑critic.

Abstract

Deep reinforcement learning is poised to revolutionise the field of AI and represents a step towards building autonomous systems with a higher level understanding of the visual world. Currently, deep learning is enabling reinforcement learning to scale to problems that were previously intractable, such as learning to play video games directly from pixels. Deep reinforcement learning algorithms are also applied to robotics, allowing control policies for robots to be learned directly from camera inputs in the real world. In this survey, we begin with an introduction to the general field of reinforcement learning, then progress to the main streams of value-based and policy-based methods. Our survey will cover central algorithms in deep reinforcement learning, including the deep $Q$-network, trust region policy optimisation, and asynchronous advantage actor-critic. In parallel, we highlight the unique advantages of deep neural networks, focusing on visual understanding via reinforcement learning. To conclude, we describe several current areas of research within the field.

References

Page 1

	Year	Citations
Learning representations by back-propagating errors David E. Rumelhart, Geoffrey E. Hinton, Ronald J. Williams Nature EngineeringMachine LearningFeature LearningPattern RecognitionKnowledge Discovery	1986	29.7K
Human-level control through deep reinforcement learning Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Nature Artificial IntelligenceEngineeringDeep Reinforcement LearningReinforcement Learning (Educational Psychology)Computer Science	2015	28.8K
Reinforcement Learning: An Introduction Richard S. Sutton, Andy Barto IEEE Transactions on Neural Networks Artificial IntelligenceEngineeringDeep Reinforcement LearningComputer ScienceRobot Learning	1998	26.8K
Reinforcement Learning: An Introduction IEEE Transactions on Neural Networks Artificial IntelligenceEngineeringDeep Reinforcement LearningStochastic GameGame Theory	2005	25.7K
GAN（Generative Adversarial Nets） Journal of Japan Society for Fuzzy Theory and Intelligent Informatics Artificial IntelligenceGenerative Artificial IntelligenceEngineeringMachine LearningData Science	2017	21.7K
Auto-Encoding Variational Bayes Diederik P. Kingma, Max Welling UvA-DARE (University of Amsterdam) Structured PredictionAuto-encoding Variational BayesMachine LearningEngineeringAutoencoders	2013	15.5K
Mastering the game of Go with deep neural networks and tree search David Silver, Aja Huang, Chris J. Maddison, Nature Artificial IntelligenceGame AiDeep Neural NetworksEngineeringMachine Learning	2016	15.5K
Distilling the Knowledge in a Neural Network Geoffrey E. Hinton, Oriol Vinyals arXiv (Cornell University) Artificial IntelligenceEngineeringMachine LearningNeural NetworkAi Foundation	2015	13.9K
Diagnosing Non-Intermittent Anomalies in Reinforcement Learning Policy Executions (Short Paper) arXiv (Cornell University)	2017	11.2K
Q-learning Christopher J. Watkins, Peter Dayan Machine Learning	1992	8.9K

Page 1