Publication | Closed Access
Packet Routing in Dynamically Changing Networks: A Reinforcement Learning Approach
696
Citations
6
References
1993
Year
Unknown Venue
This paper describes the Q-routing algorithm for packet routing, in which a reinforcement learning module is embedded into each node of a switching network. Only local communication is used by each node to keep accurate statistics on which routing decisions lead to minimal delivery times. In simple experiments involving a 36-node, irregularly connected network, Q-routing proves superior to a nonadaptive algorithm based on precomputed shortest paths and is able to route efficiently even when critical aspects of the simulation, such as the network load, are allowed to vary dynamically. The paper concludes with a discussion of the tradeoff between discovering shortcuts and maintaining stable policies. 1 INTRODUCTION The field of reinforcement learning has grown dramatically over the past several years, but with the exception of backgammon [8, 2], has had few successful applications to large-scale, practical tasks. This paper demonstrates that the practical task of routing packets through...
| Year | Citations | |
|---|---|---|
1989 | 5.5K | |
1958 | 2.7K | |
1992 | 882 | |
1992 | 795 | |
1992 | 227 | |
1976 | 73 |
Page 1
Page 1