Video Killed the HD-Map: Predicting Multi-Agent Behavior Directly From Aerial Images

Liu, Yunpeng; Lioutas, Vasileios; Lavington, Jonathan Wilder; Niedoba, Matthew; Sefas, Justice; Dabiri, Setareh; Green, Dylan; Liang, Xiaoxuan; Zwartsenberg, Berend; Ścibior, Adam; Wood, Frank

Computer Science > Computer Vision and Pattern Recognition

arXiv:2305.11856 (cs)

[Submitted on 19 May 2023 (v1), last revised 20 Sep 2023 (this version, v2)]

Title:Video Killed the HD-Map: Predicting Multi-Agent Behavior Directly From Aerial Images

Authors:Yunpeng Liu, Vasileios Lioutas, Jonathan Wilder Lavington, Matthew Niedoba, Justice Sefas, Setareh Dabiri, Dylan Green, Xiaoxuan Liang, Berend Zwartsenberg, Adam Ścibior, Frank Wood

View PDF

Abstract:The development of algorithms that learn multi-agent behavioral models using human demonstrations has led to increasingly realistic simulations in the field of autonomous driving. In general, such models learn to jointly predict trajectories for all controlled agents by exploiting road context information such as drivable lanes obtained from manually annotated high-definition (HD) maps. Recent studies show that these models can greatly benefit from increasing the amount of human data available for training. However, the manual annotation of HD maps which is necessary for every new location puts a bottleneck on efficiently scaling up human traffic datasets. We propose an aerial image-based map (AIM) representation that requires minimal annotation and provides rich road context information for traffic agents like pedestrians and vehicles. We evaluate multi-agent trajectory prediction using the AIM by incorporating it into a differentiable driving simulator as an image-texture-based differentiable rendering module. Our results demonstrate competitive multi-agent trajectory prediction performance especially for pedestrians in the scene when using our AIM representation as compared to models trained with rasterized HD maps.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
ACM classes:	I.2.9; I.4.9
Cite as:	arXiv:2305.11856 [cs.CV]
	(or arXiv:2305.11856v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2305.11856
Journal reference:	2023 IEEE 26th International Conference on Intelligent Transportation Systems (ITSC)

Submission history

From: Yunpeng Liu [view email]
[v1] Fri, 19 May 2023 17:48:01 UTC (7,736 KB)
[v2] Wed, 20 Sep 2023 00:09:13 UTC (11,685 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Video Killed the HD-Map: Predicting Multi-Agent Behavior Directly From Aerial Images

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Video Killed the HD-Map: Predicting Multi-Agent Behavior Directly From Aerial Images

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators