Packet Routing in Dynamically Changing Networks: A Reinforcement Learning Approach

Abstract

This paper describes the Q-routing algorithm for packet routing, in which a reinforcement learning module is embedded into each node of a switching network. Only local communication is used by each node to keep accurate statistics on which routing decisions lead to minimal delivery times. In simple experiments involving a 36-node, irregularly connected network, Q-routing proves superior to a nonadaptive algorithm based on precomputed shortest paths and is able to route efficiently even when critical aspects of the simulation, such as the network load, are allowed to vary dynamically. The paper concludes with a discussion of the tradeoff between discovering shortcuts and maintaining stable policies. 1 INTRODUCTION The field of reinforcement learning has grown dramatically over the past several years, but with the exception of backgammon [8, 2], has had few successful applications to large-scale, practical tasks. This paper demonstrates that the practical task of routing packets through...

References

Page 1

	Year	Citations
Learning from delayed rewards Chris Watkins OpenGrey (Institut de l'Information Scientifique et Technique) Artificial IntelligenceEngineeringMachine LearningStochastic GameGame Theory	1989	5.5K
On a routing problem Richard Bellman Quarterly of Applied Mathematics Mathematical ProgrammingTransport Network AnalysisEngineeringNetwork RoutingNetwork Analysis	1958	2.7K
Reinforcement learning for robots using neural networks Long-Ji Lin Defense Technical Information Center (DTIC)	1992	882
Practical issues in temporal difference learning Gerald Tesauro Machine Learning Artificial IntelligenceCognitive ScienceEngineeringMachine LearningTemporal Dynamic	1992	795
The role of exploration in learning control Sebastian Thrun Artificial IntelligenceEngineeringAgent Decision-makingEducationCognition	1992	227
On Routing and "Delta Routing": A Taxonomy and Performance Comparison of Techniques for Packet-Switched Networks H. Rudin IRE Transactions on Communications Systems EngineeringNetwork RoutingNetwork AnalysisAdaptive RoutingScalable Routing	1976	73

Page 1