VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking

Chen, Yukang; Liu, Jianhui; Zhang, Xiangyu; Qi, Xiaojuan; Jia, Jiaya

Computer Science > Computer Vision and Pattern Recognition

arXiv:2303.11301 (cs)

[Submitted on 20 Mar 2023]

Title:VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking

Authors:Yukang Chen, Jianhui Liu, Xiangyu Zhang, Xiaojuan Qi, Jiaya Jia

View PDF

Abstract:3D object detectors usually rely on hand-crafted proxies, e.g., anchors or centers, and translate well-studied 2D frameworks to 3D. Thus, sparse voxel features need to be densified and processed by dense prediction heads, which inevitably costs extra computation. In this paper, we instead propose VoxelNext for fully sparse 3D object detection. Our core insight is to predict objects directly based on sparse voxel features, without relying on hand-crafted proxies. Our strong sparse convolutional network VoxelNeXt detects and tracks 3D objects through voxel features entirely. It is an elegant and efficient framework, with no need for sparse-to-dense conversion or NMS post-processing. Our method achieves a better speed-accuracy trade-off than other mainframe detectors on the nuScenes dataset. For the first time, we show that a fully sparse voxel-based representation works decently for LIDAR 3D object detection and tracking. Extensive experiments on nuScenes, Waymo, and Argoverse2 benchmarks validate the effectiveness of our approach. Without bells and whistles, our model outperforms all existing LIDAR methods on the nuScenes tracking test benchmark.

Comments:	In CVPR 2023, Code and models are available at this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2303.11301 [cs.CV]
	(or arXiv:2303.11301v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2303.11301

Submission history

From: Chen Yukang [view email]
[v1] Mon, 20 Mar 2023 17:40:44 UTC (4,274 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators