← Back ICRA 2026

Safety Evaluation of Motion Plans Using Trajectory Predictors As Forward Reachable Set Estimators

Kaustav Chakraborty, Zeyuan Feng, Sushant Veer, Apoorva Sharma, Wenhao Ding, Sever Topan, Boris Ivanovic, Marco Pavone, Somil Bansal

PDF

AI summary

Key figure (auto-extracted from paper)

FORCE-OPT leverages conformal prediction and convex optimization on trajectory predictors to generate calibrated, less conservative reachable sets, significantly improving safety monitor soundness without sacrificing completeness.

Autonomous driving Safety monitoring Forward reachable sets Conformal prediction Trajectory prediction Motion planning

Problem

Learning-based autonomy stacks lack interpretable safety guarantees, making it difficult to verify motion plans. Existing reachability monitors struggle to balance soundness (avoiding false alarms) and completeness (catching all unsafe plans).

Approach

The method treats multi-modal trajectory predictors as data-driven forward reachable set estimators, using convex optimization to extract tight sets and conformal prediction to guarantee ground-truth coverage. A Bayesian filter dynamically adjusts uncertainty to handle out-of-distribution scenarios and predictor failures.

Key results

Rigorous probabilistic formulation bridging deterministic and stochastic forward reachable sets
FORCE-OPT algorithm for efficient, calibrated FRS extraction from Gaussian Mixture Models
Bayesian filtering mechanism for dynamic uncertainty adaptation under distribution shift
Empirical validation on nuScenes demonstrating superior false positive/negative rate balance over baselines

Why it matters

Enables reliable, real-time safety verification for end-to-end autonomous driving systems, bridging the gap between data-driven prediction and formal safety guarantees.

Abstract

The advent of end-to-end autonomy stacks—often lacking interpretable intermediate modules—has placed an increased burden on ensuring that the final output, i.e., the motion plan, is safe in order to validate the safety of the entire stack. This requires a safety monitor that is both complete (able to detect all unsafe plans) and sound (does not flag safe plans). In this work, we propose a principled safety monitor that leverages modern multi-modal trajectory predictors to approximate forward reachable sets (FRS) of surrounding agents. By formulating a convex program, we efficiently extract these data-driven FRSs directly from the predicted state distributions, conditioned on scene context such as lane topology and agent history. To ensure completeness, we leverage conformal prediction to calibrate the FRS and guarantee coverage of ground-truth trajectories with high probability. To preserve soundness in out-of-distribution (OOD) scenarios or under predictor failure, we introduce a Bayesian filter that dynamically adjusts the FRS conservativeness based on the predictor’s observed performance. We then assess the safety of the ego vehicle’s motion plan by checking for intersections with these calibrated FRSs, ensuring the plan remains collision-free under plausible future behaviors of others. Extensive experiments on the nuScenes dataset show our approach significantly improves soundness while maintaining completeness, offering a practical and reliable safety monitor for learned autonomy stacks. Project Website: vatsuak.github.io/forceopt/

Index terms

Robot Safety Autonomous Vehicle Navigation Planning under Uncertainty