Learning Logic Specifications for Soft Policy Guidance in POMCP

Mazzi, Giulio; Meli, Daniele; Castellini, Alberto; Farinelli, Alessandro

Computer Science > Artificial Intelligence

arXiv:2303.09172 (cs)

[Submitted on 16 Mar 2023]

Title:Learning Logic Specifications for Soft Policy Guidance in POMCP

Authors:Giulio Mazzi, Daniele Meli, Alberto Castellini, Alessandro Farinelli

View PDF

Abstract:Partially Observable Monte Carlo Planning (POMCP) is an efficient solver for Partially Observable Markov Decision Processes (POMDPs). It allows scaling to large state spaces by computing an approximation of the optimal policy locally and online, using a Monte Carlo Tree Search based strategy. However, POMCP suffers from sparse reward function, namely, rewards achieved only when the final goal is reached, particularly in environments with large state spaces and long horizons. Recently, logic specifications have been integrated into POMCP to guide exploration and to satisfy safety requirements. However, such policy-related rules require manual definition by domain experts, especially in real-world scenarios. In this paper, we use inductive logic programming to learn logic specifications from traces of POMCP executions, i.e., sets of belief-action pairs generated by the planner. Specifically, we learn rules expressed in the paradigm of answer set programming. We then integrate them inside POMCP to provide soft policy bias toward promising actions. In the context of two benchmark scenarios, rocksample and battery, we show that the integration of learned rules from small task instances can improve performance with fewer Monte Carlo simulations and in larger task instances. We make our modified version of POMCP publicly available at this https URL.

Comments:	To appear in the Proceedings of 22nd International Conference on Autonomous Agents and Multiagent Systems (AAMAS) 2023
Subjects:	Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Logic in Computer Science (cs.LO)
Cite as:	arXiv:2303.09172 [cs.AI]
	(or arXiv:2303.09172v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2303.09172

Submission history

From: Daniele Meli [view email]
[v1] Thu, 16 Mar 2023 09:37:10 UTC (1,397 KB)

Computer Science > Artificial Intelligence

Title:Learning Logic Specifications for Soft Policy Guidance in POMCP

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Learning Logic Specifications for Soft Policy Guidance in POMCP

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators