Publication | Closed Access
Monte Carlo tree search in continuous action spaces with execution uncertainty
49
Citations
18
References
2016
Year
Mathematical ProgrammingArtificial IntelligenceEngineeringMachine LearningIntelligent SystemsState Space SearchData ScienceMcts MethodsUncertainty QuantificationStochastic GameRobot LearningCombinatorial OptimizationMonte CarloAction Model LearningSequential Decision MakingProbability TheoryComputer ScienceMonte Carlo SamplingExecution UncertaintyReal World ApplicationsMarkov Decision ProcessExploration V ExploitationContinuous Action SpacesReward Hacking
Real world applications of artificial intelligence often require agents to sequentially choose actions from continuous action spaces with execution uncertainty. When good actions are sparse, domain knowledge is often used to identify a discrete set of promising actions. These actions and their uncertain effects are typically evaluated using a recursive search procedure. The reduction of the problem to a discrete search problem causes severe limitations, notably, not exploiting all of the sampled outcomes when evaluating actions, and not using outcomes to help find new actions outside the original set. We propose a new Monte Carlo tree search (MCTS) algorithm specifically designed for exploiting an execution model in this setting. Using kernel regression, it generalizes the information about action quality between actions and to unexplored parts of the action space. In a high fidelity simulator of the Olympic sport of curling, we show that this approach significantly outperforms existing MCTS methods.
| Year | Citations | |
|---|---|---|
Page 1
Page 1