SphereVLAD++: Attention-based and Signal-enhanced Viewpoint Invariant Descriptor

Zhao, Shiqi; Yin, Peng; Yi, Ge; Scherer, Sebastian

Computer Science > Computer Vision and Pattern Recognition

arXiv:2207.02958 (cs)

[Submitted on 6 Jul 2022 (v1), last revised 3 Oct 2022 (this version, v2)]

Title:SphereVLAD++: Attention-based and Signal-enhanced Viewpoint Invariant Descriptor

Authors:Shiqi Zhao, Peng Yin, Ge Yi, Sebastian Scherer

View PDF

Abstract:LiDAR-based localization approach is a fundamental module for large-scale navigation tasks, such as last-mile delivery and autonomous driving, and localization robustness highly relies on viewpoints and 3D feature extraction. Our previous work provides a viewpoint-invariant descriptor to deal with viewpoint differences; however, the global descriptor suffers from a low signal-noise ratio in unsupervised clustering, reducing the distinguishable feature extraction ability. We develop SphereVLAD++, an attention-enhanced viewpoint invariant place recognition method in this work. SphereVLAD++ projects the point cloud on the spherical perspective for each unique area and captures the contextual connections between local features and their dependencies with global 3D geometry distribution. In return, clustered elements within the global descriptor are conditioned on local and global geometries and support the original viewpoint-invariant property of SphereVLAD. In the experiments, we evaluated the localization performance of SphereVLAD++ on both public KITTI360 datasets and self-generated datasets from the city of Pittsburgh. The experiment results show that SphereVLAD++ outperforms all relative state-of-the-art 3D place recognition methods under small or even totally reversed viewpoint differences and shows 0.69% and 15.81% successful retrieval rates with better than the second best. Low computation requirements and high time efficiency also help its application for low-cost robots.

Comments:	8 pages, 7 figures, IEEE Robotics and Automation Letters
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
Cite as:	arXiv:2207.02958 [cs.CV]
	(or arXiv:2207.02958v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2207.02958

Submission history

From: Peng Yin [view email]
[v1] Wed, 6 Jul 2022 20:32:43 UTC (5,667 KB)
[v2] Mon, 3 Oct 2022 07:28:40 UTC (6,037 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:SphereVLAD++: Attention-based and Signal-enhanced Viewpoint Invariant Descriptor

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:SphereVLAD++: Attention-based and Signal-enhanced Viewpoint Invariant Descriptor

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators