Levin Tree Search with Context Models

Orseau, Laurent; Hutter, Marcus; Lelis, Levi H. S.

Computer Science > Machine Learning

arXiv:2305.16945v2 (cs)

[Submitted on 26 May 2023 (v1), revised 27 Jun 2023 (this version, v2), latest version 12 Nov 2024 (v3)]

Title:Levin Tree Search with Context Models

Authors:Laurent Orseau, Marcus Hutter, Levi H.S. Lelis

View PDF

Abstract:Levin Tree Search (LTS) is a search algorithm that makes use of a policy (a probability distribution over actions) and comes with a theoretical guarantee on the number of expansions before reaching a goal node, depending on the quality of the policy. This guarantee can be used as a loss function, which we call the LTS loss, to optimize neural networks representing the policy (LTS+NN). In this work we show that the neural network can be substituted with parameterized context models originating from the online compression literature (LTS+CM). We show that the LTS loss is convex under this new model, which allows for using standard convex optimization tools, and obtain convergence guarantees to the optimal parameters in an online setting for a given set of solution trajectories -- guarantees that cannot be provided for neural networks. The new LTS+CM algorithm compares favorably against LTS+NN on several benchmarks: Sokoban (Boxoban), The Witness, and the 24-Sliding Tile puzzle (STP). The difference is particularly large on STP, where LTS+NN fails to solve most of the test instances while LTS+CM solves each test instance in a fraction of a second. Furthermore, we show that LTS+CM is able to learn a policy that solves the Rubik's cube in only a few hundred expansions, which considerably improves upon previous machine learning techniques.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2305.16945 [cs.LG]
	(or arXiv:2305.16945v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2305.16945

Submission history

From: Laurent Orseau [view email]
[v1] Fri, 26 May 2023 14:00:12 UTC (124 KB)
[v2] Tue, 27 Jun 2023 10:21:43 UTC (103 KB)
[v3] Tue, 12 Nov 2024 16:23:50 UTC (103 KB)

Computer Science > Machine Learning

Title:Levin Tree Search with Context Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Levin Tree Search with Context Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators