Advancing Autonomous VLM Agents via Variational Subgoal-Conditioned Reinforcement Learning

Wu, Qingyuan; Liu, Jianheng; Hao, Jianye; Wang, Jun; Shao, Kun

Computer Science > Machine Learning

arXiv:2502.07949 (cs)

[Submitted on 11 Feb 2025 (v1), last revised 20 May 2025 (this version, v2)]

Title:Advancing Autonomous VLM Agents via Variational Subgoal-Conditioned Reinforcement Learning

Authors:Qingyuan Wu, Jianheng Liu, Jianye Hao, Jun Wang, Kun Shao

View PDF HTML (experimental)

Abstract:State-of-the-art (SOTA) reinforcement learning (RL) methods have enabled vision-language model (VLM) agents to learn from interaction with online environments without human supervision. However, these methods often struggle with learning inefficiencies when applied to complex, real-world decision-making tasks with sparse rewards and long-horizon dependencies. We propose a novel framework, Variational Subgoal-Conditioned Reinforcement Learning (VSC-RL), advancing the VLM agents in resolving challenging decision-making tasks. Fundamentally distinct from existing methods, VSC-RL reformulates the decision-making problem as a variational subgoal-conditioned RL problem with the newly derived optimization objective, Subgoal Evidence Lower BOund (SGC-ELBO), which comprises two key components: (a) maximizing the subgoal-conditioned return, and (b) minimizing the divergence from a reference goal-conditioned policy. We theoretically and empirically demonstrate that the VSC-RL can efficiently improve the learning efficiency without compromising performance guarantees. Across a diverse set of challenging benchmarks, including mobile device and web control tasks, VSC-RL consistently outperforms existing SOTA methods, achieving superior learning efficiency and performance.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2502.07949 [cs.LG]
	(or arXiv:2502.07949v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2502.07949

Submission history

From: Qingyuan Wu [view email]
[v1] Tue, 11 Feb 2025 20:57:46 UTC (17,295 KB)
[v2] Tue, 20 May 2025 19:54:36 UTC (25,299 KB)

Computer Science > Machine Learning

Title:Advancing Autonomous VLM Agents via Variational Subgoal-Conditioned Reinforcement Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Advancing Autonomous VLM Agents via Variational Subgoal-Conditioned Reinforcement Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators