Motion planning under uncertainty using iterative local optimization in belief space

TLDR

The authors propose a motion‑planning method that computes a locally optimal solution to a continuous POMDP under sensing and motion uncertainty. Their approach models beliefs as Gaussians, approximates belief dynamics with an extended Kalman filter, represents the value function quadratically near a nominal trajectory, and applies iterative LQG to converge to a locally optimal linear control policy without assuming maximum‑likelihood observations, fixed estimator or control gains, or discretizing state and action spaces, while handling obstacles and achieving polynomial runtime O(n⁶). Simulations show the method enables holonomic and non‑holonomic robots to navigate obstacle‑laden environments with noisy, partial sensing and nonlinear dynamics and observation models.

Abstract

We present a new approach to motion planning under sensing and motion uncertainty by computing a locally optimal solution to a continuous partially observable Markov decision process (POMDP). Our approach represents beliefs (the distributions of the robot’s state estimate) by Gaussian distributions and is applicable to robot systems with non-linear dynamics and observation models. The method follows the general POMDP solution framework in which we approximate the belief dynamics using an extended Kalman filter and represent the value function by a quadratic function that is valid in the vicinity of a nominal trajectory through belief space. Using a belief space variant of iterative LQG (iLQG), our approach iterates with second-order convergence towards a linear control policy over the belief space that is locally optimal with respect to a user-defined cost function. Unlike previous work, our approach does not assume maximum-likelihood observations, does not assume fixed estimator or control gains, takes into account obstacles in the environment, and does not require discretization of the state and action spaces. The running time of the algorithm is polynomial (O[n 6 ]) in the dimension n of the state space. We demonstrate the potential of our approach in simulation for holonomic and non-holonomic robots maneuvering through environments with obstacles with noisy and partial sensing and with non-linear dynamics and observation models.

References

Page 1

	Year	Citations

Page 1