Moving Forward by Moving Backward: Embedding Action Impact over Action Semantics

Zeng, Kuo-Hao; Weihs, Luca; Mottaghi, Roozbeh; Farhadi, Ali

Computer Science > Computer Vision and Pattern Recognition

arXiv:2304.12289 (cs)

[Submitted on 24 Apr 2023]

Title:Moving Forward by Moving Backward: Embedding Action Impact over Action Semantics

Authors:Kuo-Hao Zeng, Luca Weihs, Roozbeh Mottaghi, Ali Farhadi

View PDF

Abstract:A common assumption when training embodied agents is that the impact of taking an action is stable; for instance, executing the "move ahead" action will always move the agent forward by a fixed distance, perhaps with some small amount of actuator-induced noise. This assumption is limiting; an agent may encounter settings that dramatically alter the impact of actions: a move ahead action on a wet floor may send the agent twice as far as it expects and using the same action with a broken wheel might transform the expected translation into a rotation. Instead of relying that the impact of an action stably reflects its pre-defined semantic meaning, we propose to model the impact of actions on-the-fly using latent embeddings. By combining these latent action embeddings with a novel, transformer-based, policy head, we design an Action Adaptive Policy (AAP). We evaluate our AAP on two challenging visual navigation tasks in the AI2-THOR and Habitat environments and show that our AAP is highly performant even when faced, at inference-time with missing actions and, previously unseen, perturbed action space. Moreover, we observe significant improvement in robustness against these actions when evaluating in real-world scenarios.

Comments:	21 pages, 17 figures, ICLR 2023
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
Cite as:	arXiv:2304.12289 [cs.CV]
	(or arXiv:2304.12289v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2304.12289

Submission history

From: Kuo-Hao Zeng [view email]
[v1] Mon, 24 Apr 2023 17:35:47 UTC (39,656 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Moving Forward by Moving Backward: Embedding Action Impact over Action Semantics

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Moving Forward by Moving Backward: Embedding Action Impact over Action Semantics

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators