MambaMOS: LiDAR-based 3D Moving Object Segmentation with Motion-aware State Space Model

Zeng, Kang; Shi, Hao; Lin, Jiacheng; Li, Siyu; Cheng, Jintao; Wang, Kaiwei; Li, Zhiyong; Yang, Kailun

Computer Science > Computer Vision and Pattern Recognition

arXiv:2404.12794 (cs)

[Submitted on 19 Apr 2024 (v1), last revised 6 Aug 2024 (this version, v2)]

Title:MambaMOS: LiDAR-based 3D Moving Object Segmentation with Motion-aware State Space Model

Authors:Kang Zeng, Hao Shi, Jiacheng Lin, Siyu Li, Jintao Cheng, Kaiwei Wang, Zhiyong Li, Kailun Yang

View PDF HTML (experimental)

Abstract:LiDAR-based Moving Object Segmentation (MOS) aims to locate and segment moving objects in point clouds of the current scan using motion information from previous scans. Despite the promising results achieved by previous MOS methods, several key issues, such as the weak coupling of temporal and spatial information, still need further study. In this paper, we propose a novel LiDAR-based 3D Moving Object Segmentation with Motion-aware State Space Model, termed MambaMOS. Firstly, we develop a novel embedding module, the Time Clue Bootstrapping Embedding (TCBE), to enhance the coupling of temporal and spatial information in point clouds and alleviate the issue of overlooked temporal clues. Secondly, we introduce the Motion-aware State Space Model (MSSM) to endow the model with the capacity to understand the temporal correlations of the same object across different time steps. Specifically, MSSM emphasizes the motion states of the same object at different time steps through two distinct temporal modeling and correlation steps. We utilize an improved state space model to represent these motion differences, significantly modeling the motion states. Finally, extensive experiments on the SemanticKITTI-MOS and KITTI-Road benchmarks demonstrate that the proposed MambaMOS achieves state-of-the-art performance. The source code is publicly available at this https URL.

Comments:	Accepted to ACM MM 2024. The source code is publicly available at this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Robotics (cs.RO); Image and Video Processing (eess.IV)
Cite as:	arXiv:2404.12794 [cs.CV]
	(or arXiv:2404.12794v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2404.12794

Submission history

From: Kailun Yang [view email]
[v1] Fri, 19 Apr 2024 11:17:35 UTC (6,542 KB)
[v2] Tue, 6 Aug 2024 03:28:12 UTC (6,640 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:MambaMOS: LiDAR-based 3D Moving Object Segmentation with Motion-aware State Space Model

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:MambaMOS: LiDAR-based 3D Moving Object Segmentation with Motion-aware State Space Model

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators