Zero-Shot Object Goal Visual Navigation With Class-Independent Relationship Network

Li, Xinting; Zhang, Shiguang; LU, Yue; Dang, Kerry; Ran, Lingyan

Computer Science > Computer Vision and Pattern Recognition

arXiv:2310.09883 (cs)

[Submitted on 15 Oct 2023 (v1), last revised 14 Mar 2024 (this version, v2)]

Title:Zero-Shot Object Goal Visual Navigation With Class-Independent Relationship Network

Authors:Xinting Li, Shiguang Zhang, Yue LU, Kerry Dang, Lingyan Ran

View PDF HTML (experimental)

Abstract:This paper investigates the zero-shot object goal visual navigation problem. In the object goal visual navigation task, the agent needs to locate navigation targets from its egocentric visual input. "Zero-shot" means that the target the agent needs to find is not trained during the training phase. To address the issue of coupling navigation ability with target features during training, we propose the Class-Independent Relationship Network (CIRN). This method combines target detection information with the relative semantic similarity between the target and the navigation target, and constructs a brand new state representation based on similarity ranking, this state representation does not include target feature or environment feature, effectively decoupling the agent's navigation ability from target features. And a Graph Convolutional Network (GCN) is employed to learn the relationships between different objects based on their similarities. During testing, our approach demonstrates strong generalization capabilities, including zero-shot navigation tasks with different targets and environments. Through extensive experiments in the AI2-THOR virtual environment, our method outperforms the current state-of-the-art approaches in the zero-shot object goal visual navigation task. Furthermore, we conducted experiments in more challenging cross-target and cross-scene settings, which further validate the robustness and generalization ability of our method. Our code is available at: this https URL.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
ACM classes:	I.2.9; I.2.10
Cite as:	arXiv:2310.09883 [cs.CV]
	(or arXiv:2310.09883v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2310.09883

Submission history

From: Xinting Li [view email]
[v1] Sun, 15 Oct 2023 16:42:14 UTC (1,918 KB)
[v2] Thu, 14 Mar 2024 14:40:15 UTC (1,918 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Zero-Shot Object Goal Visual Navigation With Class-Independent Relationship Network

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Zero-Shot Object Goal Visual Navigation With Class-Independent Relationship Network

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators