LS3: Latent Space Safe Sets for Long-Horizon Visuomotor Control of Iterative Tasks

Wilcox, Albert; Balakrishna, Ashwin; Thananjeyan, Brijen; Gonzalez, Joseph E.; Goldberg, Ken

Computer Science > Machine Learning

arXiv:2107.04775v1 (cs)

[Submitted on 10 Jul 2021 (this version), latest version 21 Sep 2021 (v2)]

Title:LS3: Latent Space Safe Sets for Long-Horizon Visuomotor Control of Iterative Tasks

Authors:Albert Wilcox, Ashwin Balakrishna, Brijen Thananjeyan, Joseph E. Gonzalez, Ken Goldberg

View PDF

Abstract:Reinforcement learning (RL) algorithms have shown impressive success in exploring high-dimensional environments to learn complex, long-horizon tasks, but can often exhibit unsafe behaviors and require extensive environment interaction when exploration is unconstrained. A promising strategy for safe learning in dynamically uncertain environments is requiring that the agent can robustly return to states where task success (and therefore safety) can be guaranteed. While this approach has been successful in low-dimensions, enforcing this constraint in environments with high-dimensional state spaces, such as images, is challenging. We present Latent Space Safe Sets (LS3), which extends this strategy to iterative, long-horizon tasks with image observations by using suboptimal demonstrations and a learned dynamics model to restrict exploration to the neighborhood of a learned Safe Set where task completion is likely. We evaluate LS3 on 4 domains, including a challenging sequential pushing task in simulation and a physical cable routing task. We find that LS3 can use prior task successes to restrict exploration and learn more efficiently than prior algorithms while satisfying constraints. See this https URL for code and supplementary material.

Comments:	Preprint, Under Review. First two authors contributed equally
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Robotics (cs.RO)
Cite as:	arXiv:2107.04775 [cs.LG]
	(or arXiv:2107.04775v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2107.04775

Submission history

From: Ashwin Balakrishna [view email]
[v1] Sat, 10 Jul 2021 06:46:10 UTC (570 KB)
[v2] Tue, 21 Sep 2021 01:37:24 UTC (1,891 KB)

Computer Science > Machine Learning

Title:LS3: Latent Space Safe Sets for Long-Horizon Visuomotor Control of Iterative Tasks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:LS3: Latent Space Safe Sets for Long-Horizon Visuomotor Control of Iterative Tasks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators