DynaMem: Online Dynamic Spatio-Semantic Memory for Open World Mobile Manipulation

Liu, Peiqi; Guo, Zhanqiu; Warke, Mohit; Chintala, Soumith; Paxton, Chris; Shafiullah, Nur Muhammad Mahi; Pinto, Lerrel

Computer Science > Robotics

arXiv:2411.04999 (cs)

[Submitted on 7 Nov 2024 (v1), last revised 29 May 2025 (this version, v2)]

Title:DynaMem: Online Dynamic Spatio-Semantic Memory for Open World Mobile Manipulation

Authors:Peiqi Liu, Zhanqiu Guo, Mohit Warke, Soumith Chintala, Chris Paxton, Nur Muhammad Mahi Shafiullah, Lerrel Pinto

View PDF HTML (experimental)

Abstract:Significant progress has been made in open-vocabulary mobile manipulation, where the goal is for a robot to perform tasks in any environment given a natural language description. However, most current systems assume a static environment, which limits the system's applicability in real-world scenarios where environments frequently change due to human intervention or the robot's own actions. In this work, we present DynaMem, a new approach to open-world mobile manipulation that uses a dynamic spatio-semantic memory to represent a robot's environment. DynaMem constructs a 3D data structure to maintain a dynamic memory of point clouds, and answers open-vocabulary object localization queries using multimodal LLMs or open-vocabulary features generated by state-of-the-art vision-language models. Powered by DynaMem, our robots can explore novel environments, search for objects not found in memory, and continuously update the memory as objects move, appear, or disappear in the scene. We run extensive experiments on the Stretch SE3 robots in three real and nine offline scenes, and achieve an average pick-and-drop success rate of 70% on non-stationary objects, which is more than a 2x improvement over state-of-the-art static systems. Our code as well as our experiment and deployment videos are open sourced and can be found on our project website: this https URL

Comments:	Website: this https URL
Subjects:	Robotics (cs.RO); Machine Learning (cs.LG)
Cite as:	arXiv:2411.04999 [cs.RO]
	(or arXiv:2411.04999v2 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2411.04999

Submission history

From: Peiqi Liu [view email]
[v1] Thu, 7 Nov 2024 18:59:27 UTC (4,506 KB)
[v2] Thu, 29 May 2025 13:57:04 UTC (4,709 KB)

Computer Science > Robotics

Title:DynaMem: Online Dynamic Spatio-Semantic Memory for Open World Mobile Manipulation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:DynaMem: Online Dynamic Spatio-Semantic Memory for Open World Mobile Manipulation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators