Publication | Closed Access
D<inf>++</inf>: Structural credit assignment in tightly coupled multiagent domains
35
Citations
17
References
2016
Year
Unknown Venue
Artificial IntelligenceMulti-robot TeamEngineeringAutomationDistributed RoboticsMultirobot SystemStructural Credit AssignmentComputer ScienceIntelligent SystemsRobot LearningGlobal RewardMulti-agent LearningRoboticsMulti-agent Mechanism DesignMechanism DesignMulti-agent PlanningAutonomous Multi-robot TeamsDifference Reward
Autonomous multi-robot teams can be used in complex coordinated exploration tasks to improve exploration performance in terms of both speed and effectiveness. However, use of multi-robot systems presents additional challenges. Specifically, in domains where the robots' actions are tightly coupled, coordinating multiple robots to achieve cooperative behavior at the group level is difficult. In this paper, we demonstrate that reward shaping can greatly benefit learning in multi-robot exploration tasks. We propose a novel reward framework based on the idea of counterfactuals to tackle the coordination problem in tightly coupled domains. We show that the proposed algorithm provides superior performance (166% performance improvement and a quadruple convergence speed up) compared to policies learned using either the global reward or the difference reward [1].
| Year | Citations | |
|---|---|---|
Page 1
Page 1