Efficient Monte Carlo Tree Search via On-the-Fly State-Conditioned Action Abstraction

Kwak, Yunhyeok; Hwang, Inwoo; Kim, Dooyoung; Lee, Sanghack; Zhang, Byoung-Tak

Computer Science > Machine Learning

arXiv:2406.00614 (cs)

[Submitted on 2 Jun 2024]

Title:Efficient Monte Carlo Tree Search via On-the-Fly State-Conditioned Action Abstraction

Authors:Yunhyeok Kwak, Inwoo Hwang, Dooyoung Kim, Sanghack Lee, Byoung-Tak Zhang

View PDF HTML (experimental)

Abstract:Monte Carlo Tree Search (MCTS) has showcased its efficacy across a broad spectrum of decision-making problems. However, its performance often degrades under vast combinatorial action space, especially where an action is composed of multiple sub-actions. In this work, we propose an action abstraction based on the compositional structure between a state and sub-actions for improving the efficiency of MCTS under a factored action space. Our method learns a latent dynamics model with an auxiliary network that captures sub-actions relevant to the transition on the current state, which we call state-conditioned action abstraction. Notably, it infers such compositional relationships from high-dimensional observations without the known environment model. During the tree traversal, our method constructs the state-conditioned action abstraction for each node on-the-fly, reducing the search space by discarding the exploration of redundant sub-actions. Experimental results demonstrate the superior sample efficiency of our method compared to vanilla MuZero, which suffers from expansive action space.

Comments:	UAI 2024 (Oral). The first two authors contributed equally
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2406.00614 [cs.LG]
	(or arXiv:2406.00614v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2406.00614

Submission history

From: Inwoo Hwang [view email]
[v1] Sun, 2 Jun 2024 04:31:30 UTC (2,363 KB)

Computer Science > Machine Learning

Title:Efficient Monte Carlo Tree Search via On-the-Fly State-Conditioned Action Abstraction

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Efficient Monte Carlo Tree Search via On-the-Fly State-Conditioned Action Abstraction

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators