Fast-BEV: Towards Real-time On-vehicle Bird's-Eye View Perception

Huang, Bin; Li, Yangguang; Xie, Enze; Liang, Feng; Wang, Luya; Shen, Mingzhu; Liu, Fenggang; Wang, Tianqi; Luo, Ping; Shao, Jing

Computer Science > Computer Vision and Pattern Recognition

arXiv:2301.07870 (cs)

[Submitted on 19 Jan 2023]

Title:Fast-BEV: Towards Real-time On-vehicle Bird's-Eye View Perception

Authors:Bin Huang, Yangguang Li, Enze Xie, Feng Liang, Luya Wang, Mingzhu Shen, Fenggang Liu, Tianqi Wang, Ping Luo, Jing Shao

View PDF

Abstract:Recently, the pure camera-based Bird's-Eye-View (BEV) perception removes expensive Lidar sensors, making it a feasible solution for economical autonomous driving. However, most existing BEV solutions either suffer from modest performance or require considerable resources to execute on-vehicle inference. This paper proposes a simple yet effective framework, termed Fast-BEV, which is capable of performing real-time BEV perception on the on-vehicle chips. Towards this goal, we first empirically find that the BEV representation can be sufficiently powerful without expensive view transformation or depth representation. Starting from M2BEV baseline, we further introduce (1) a strong data augmentation strategy for both image and BEV space to avoid over-fitting (2) a multi-frame feature fusion mechanism to leverage the temporal information (3) an optimized deployment-friendly view transformation to speed up the inference. Through experiments, we show Fast-BEV model family achieves considerable accuracy and efficiency on edge. In particular, our M1 model (R18@256x704) can run over 50FPS on the Tesla T4 platform, with 47.0% NDS on the nuScenes validation set. Our largest model (R101@900x1600) establishes a new state-of-the-art 53.5% NDS on the nuScenes validation set. The code is released at: this https URL.

Comments:	Accepted by NeurIPS2022_ML4AD on October 22, 2022
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2301.07870 [cs.CV]
	(or arXiv:2301.07870v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2301.07870
Journal reference:	NeurIPS2022_ML4AD

Submission history

From: Yangguang Li [view email]
[v1] Thu, 19 Jan 2023 03:58:48 UTC (5,181 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Fast-BEV: Towards Real-time On-vehicle Bird's-Eye View Perception

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Fast-BEV: Towards Real-time On-vehicle Bird's-Eye View Perception

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators