Publication | Closed Access
Multiagent Meta-Reinforcement Learning for Adaptive Multipath Routing Optimization
73
Citations
32
References
2021
Year
Artificial IntelligenceMultiagent Reinforcement LearningEngineeringEdge ComputingNetwork Traffic ControlRoute PlanningMultiagent Meta-reinforcement LearningNetwork Traffic DemandComputer ScienceMulti-agent LearningNetwork OptimizationRouting ProblemMulti-agent PlanningCombinatorial OptimizationOperations Research
In this article, we investigate the routing problem of packet networks through multiagent reinforcement learning (RL), which is a very challenging topic in distributed and autonomous networked systems. In specific, the routing problem is modeled as a networked multiagent partially observable Markov decision process (MDP). Since the MDP of a network node is not only affected by its neighboring nodes' policies but also the network traffic demand, it becomes a multitask learning problem. Inspired by recent success of RL and metalearning, we propose two novel model-free multiagent RL algorithms, named multiagent proximal policy optimization (MAPPO) and multiagent metaproximal policy optimization (meta-MAPPO), to optimize the network performances under fixed and time-varying traffic demand, respectively. A practicable distributed implementation framework is designed based on the separability of exploration and exploitation in training MAPPO. Compared with the existing routing optimization policies, our simulation results demonstrate the excellent performances of the proposed algorithms.
| Year | Citations | |
|---|---|---|
Page 1
Page 1