Publication | Closed Access
Value-based observation compression for DEC-POMDPs
39
Citations
14
References
2008
Year
Artificial IntelligenceMathematical ProgrammingEngineeringAgent Decision-makingGame TheoryMulti-agent LearningData ScienceUncertainty QuantificationManagementRobot LearningCombinatorial OptimizationMechanism DesignMulti-agent PlanningInformation TheoryValue-based Observation CompressionCompact PoliciesComputer ScienceMulti-agent Mechanism DesignMulti-agent Planning AlgorithmsData CompressionSignal ProcessingMarkov Decision ProcessAgent PoliciesData Modeling
Representing agent policies compactly is essential for improving the scalability of multi-agent planning algorithms. In this paper, we focus on developing a pruning technique that allows us to merge certain observations within agent policies, while minimizing loss of value. This is particularly important for solving finite-horizon decentralized POMDPs, where agent policies are represented as trees, and where the size of policy trees grows exponentially with the number of observations. We introduce a value-based observation compression technique that prunes the least valuable observations while maintaining an error bound on the value lost as a result of pruning. We analyze the characteristics of this pruning strategy and show empirically that it is effective. Thus, we use compact policies to obtain signicantly higher values compared with the best existing DEC-POMDP algorithm.
| Year | Citations | |
|---|---|---|
Page 1
Page 1