← Back ICRA 2023

Model Predictive Optimized Path Integral Strategies

Dylan Asmar, Ransalu Senanayake, Shawn Manuel, Mykel Kochenderfer

PDF

Abstract

We generalize the derivation of model predictive path integral control (MPPI) to allow for a single joint distribu- tion across controls in the control sequence. This reformation allows for the implementation of adaptive importance sampling (AIS) algorithms into the original importance sampling step while still maintaining the benefits of MPPI such as working with arbitrary system dynamics and cost functions. The benefit of optimizing the proposal distribution by integrating AIS at each control step is demonstrated in simulated environments including controlling multiple cars around a track. The new algorithm is more sample efficient than MPPI, achieving better performance with fewer samples. This performance disparity grows as the dimension of the action space increases. Results from simulations suggest the new algorithm can be used as an anytime algorithm, increasing the value of control at each iteration versus relying on a large set of samples. Repository— https://github.com/sisl/MPOPIS

Index terms

Optimization and Optimal Control Probability and Statistical Methods Reinforcement Learning