M3TR: A Generalist Model for Real-World HD Map Completion
Fabian Immel, Richard Schwarzkopf, Frank Bieder, Jan-Hendrik Pauls, Christoph Stiller
AI summary
Problem
Offline HD maps quickly become outdated, and existing online map construction methods either ignore prior information, specialize in only one type of map change, or lack realistic evaluation benchmarks and metrics for partial map completion.
Approach
M3TR introduces a multi-masking map transformer with a novel query design and a generalist training regime that uses map masking as data augmentation, enabling it to handle any combination of outdated map priors and sensor data.
Key results
- First comprehensive HD map completion benchmark with improved ground truth and prior-aware metric
- Novel point query and query set design that fully utilizes prior map information
- Generalist model matches specialized expert models across all prior scenarios without extra memory or compute
- Achieves +1.4 mAP improvement without a prior and +4.3 mAP with priors on Argoverse 2 and nuScenes
Why it matters
Enables real-world deployment of autonomous vehicles by providing a single, robust model that dynamically adapts to partially outdated HD maps using live sensor data.
Abstract
Autonomous vehicles rely on HD maps for their operation, but offline HD maps eventually become outdated. For this reason, online HD map construction methods use live sensor data to infer map information instead. Research on real map changes shows that oftentimes entire parts of an HD map remain unchanged and can be used as a prior. We therefore introduce M3TR (Multi-Masking Map Transformer), a generalist approach for HD map completion both with and without offline HD map priors. As a necessary foundation, we address shortcomings in ground truth labels for Argoverse 2 and nuScenes and propose the first comprehensive benchmark for HD map completion. Unlike existing models that specialize in a single kind of map change, which is unrealistic for deployment, our Generalist model handles all kinds of changes, matching the effectiveness of Expert models. With our map masking as augmentation regime, we can even achieve a +1.4 mAP improvement without a prior. Finally, by fully utilizing prior HD map elements and optimizing query designs, M3TR outperforms existing methods by +4.3 mAP while being the first real-world deployable model for offline HD map priors. https://github.com/immel-f/m3tr