Publication | Closed Access
DRL-OR: Deep Reinforcement Learning-based Online Routing for Multi-type Service Requirements
77
Citations
35
References
2021
Year
Unknown Venue
Artificial IntelligenceEngineeringMachine LearningMulti-type Service RequirementsNetwork RoutingEducationReinforcement Learning (Educational Psychology)Multi-agent LearningReinforcement Learning (Computer Engineering)Implement Drl-orSystems EngineeringAdvanced NetworkingMulti-agent PlanningCritical Qos RequirementsNetwork FlowsSoftware-defined NetworkingComputer EngineeringComputer ScienceDeep LearningMulti-agent Deep ReinforcementNetwork Routing AlgorithmDeep Reinforcement LearningNetwork Traffic Control
Emerging applications raise critical QoS requirements for the Internet. The improvements of flow classification technologies, software defined networks (SDN), and programmable network devices make it possible to fast identify users' requirements and control the routing for fine-grained traffic flows. Meanwhile, the problem of optimizing the forwarding paths for traffic flows with multiple QoS requirements in an online fashion is not addressed sufficiently. To address the problem, we propose DRL-OR, an online routing algorithm using multi-agent deep reinforcement learning. DRL-OR organizes the agents to generate routes in a hop-by-hop manner, which inherently has good scalability. It adopts a comprehensive reward function, an efficient learning algorithm, and a novel deep neural network structure to learn an appropriate routing policy for different types of flow requirements. To guarantee the reliability and accelerate the online learning process, we further introduce safe learning mechanism to DRL-OR. We implement DRL-OR under SDN architecture and conduct Mininet-based experiments by using real network topologies and traffic traces. The results validate that DRL-OR can well satisfy the requirements of latency-sensitive, throughput-sensitive, latency-throughput-sensitive, and latency-loss-sensitive flows at the same time, while exhibiting great adaptiveness and reliability under the scenarios of link failure, traffic change, and partial deployment.
| Year | Citations | |
|---|---|---|
Page 1
Page 1