M3TR: A Generalist Model for Real-World HD Map Completion

Immel, Fabian; Fehler, Richard; Bieder, Frank; Pauls, Jan-Hendrik; Stiller, Christoph

Computer Science > Computer Vision and Pattern Recognition

arXiv:2411.10316 (cs)

[Submitted on 15 Nov 2024 (v1), last revised 21 May 2025 (this version, v4)]

Title:M3TR: A Generalist Model for Real-World HD Map Completion

Authors:Fabian Immel, Richard Fehler, Frank Bieder, Jan-Hendrik Pauls, Christoph Stiller

View PDF HTML (experimental)

Abstract:Autonomous vehicles rely on HD maps for their operation, but offline HD maps eventually become outdated. For this reason, online HD map construction methods use live sensor data to infer map information instead. Research on real map changes shows that oftentimes entire parts of an HD map remain unchanged and can be used as a prior. We therefore introduce M3TR (Multi-Masking Map Transformer), a generalist approach for HD map completion both with and without offline HD map priors. As a necessary foundation, we address shortcomings in ground truth labels for Argoverse 2 and nuScenes and propose the first comprehensive benchmark for HD map completion. Unlike existing models that specialize in a single kind of map change, which is unrealistic for deployment, our Generalist model handles all kinds of changes, matching the effectiveness of Expert models. With our map masking as augmentation regime, we can even achieve a +1.4 mAP improvement without a prior. Finally, by fully utilizing prior HD map elements and optimizing query designs, M3TR outperforms existing methods by +4.3 mAP while being the first real-world deployable model for offline HD map priors. Code is available at this https URL

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
Cite as:	arXiv:2411.10316 [cs.CV]
	(or arXiv:2411.10316v4 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2411.10316

Submission history

From: Fabian Immel [view email]
[v1] Fri, 15 Nov 2024 16:14:48 UTC (17,991 KB)
[v2] Tue, 10 Dec 2024 18:41:14 UTC (39,025 KB)
[v3] Mon, 10 Mar 2025 18:24:00 UTC (39,261 KB)
[v4] Wed, 21 May 2025 14:09:13 UTC (39,262 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:M3TR: A Generalist Model for Real-World HD Map Completion

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:M3TR: A Generalist Model for Real-World HD Map Completion

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators