Verifiable Planning in Expected Reward Multichain MDPs

Atia, George K.; Beckus, Andre; Alkhouri, Ismail; Velasquez, Alvaro

Computer Science > Artificial Intelligence

arXiv:2012.02178v1 (cs)

[Submitted on 3 Dec 2020 (this version), latest version 23 Oct 2021 (v2)]

Title:Verifiable Planning in Expected Reward Multichain MDPs

Authors:George K. Atia, Andre Beckus, Ismail Alkhouri, Alvaro Velasquez

View PDF

Abstract:The planning domain has experienced increased interest in the formal synthesis of decision-making policies. This formal synthesis typically entails finding a policy which satisfies formal specifications in the form of some well-defined logic, such as Linear Temporal Logic (LTL) or Computation Tree Logic (CTL), among others. While such logics are very powerful and expressive in their capacity to capture desirable agent behavior, their value is limited when deriving decision-making policies which satisfy certain types of asymptotic behavior. In particular, we are interested in specifying constraints on the steady-state behavior of an agent, which captures the proportion of time an agent spends in each state as it interacts for an indefinite period of time with its environment. This is sometimes called the average or expected behavior of the agent. In this paper, we explore the steady-state planning problem of deriving a decision-making policy for an agent such that constraints on its steady-state behavior are satisfied. A linear programming solution for the general case of multichain Markov Decision Processes (MDPs) is proposed and we prove that optimal solutions to the proposed programs yield stationary policies with rigorous guarantees of behavior.

Subjects:	Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multiagent Systems (cs.MA); Machine Learning (stat.ML)
Cite as:	arXiv:2012.02178 [cs.AI]
	(or arXiv:2012.02178v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2012.02178

Submission history

From: George Atia [view email]
[v1] Thu, 3 Dec 2020 18:54:24 UTC (12,133 KB)
[v2] Sat, 23 Oct 2021 19:04:04 UTC (5,371 KB)

Computer Science > Artificial Intelligence

Title:Verifiable Planning in Expected Reward Multichain MDPs

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Verifiable Planning in Expected Reward Multichain MDPs

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators