Non-Stationary Policy Learning for Multi-Timescale Multi-Agent Reinforcement Learning

Emami, Patrick; Zhang, Xiangyu; Biagioni, David; Zamzam, Ahmed S.

Computer Science > Machine Learning

arXiv:2307.08794 (cs)

[Submitted on 17 Jul 2023]

Title:Non-Stationary Policy Learning for Multi-Timescale Multi-Agent Reinforcement Learning

Authors:Patrick Emami, Xiangyu Zhang, David Biagioni, Ahmed S. Zamzam

View PDF

Abstract:In multi-timescale multi-agent reinforcement learning (MARL), agents interact across different timescales. In general, policies for time-dependent behaviors, such as those induced by multiple timescales, are non-stationary. Learning non-stationary policies is challenging and typically requires sophisticated or inefficient algorithms. Motivated by the prevalence of this control problem in real-world complex systems, we introduce a simple framework for learning non-stationary policies for multi-timescale MARL. Our approach uses available information about agent timescales to define a periodic time encoding. In detail, we theoretically demonstrate that the effects of non-stationarity introduced by multiple timescales can be learned by a periodic multi-agent policy. To learn such policies, we propose a policy gradient algorithm that parameterizes the actor and critic with phase-functioned neural networks, which provide an inductive bias for periodicity. The framework's ability to effectively learn multi-timescale policies is validated on a gridworld and building energy management environment.

Comments:	Accepted at IEEE CDC'23. 7 pages, 6 figures
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Multiagent Systems (cs.MA); Systems and Control (eess.SY)
Cite as:	arXiv:2307.08794 [cs.LG]
	(or arXiv:2307.08794v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2307.08794

Submission history

From: Patrick Emami [view email]
[v1] Mon, 17 Jul 2023 19:25:46 UTC (213 KB)

Computer Science > Machine Learning

Title:Non-Stationary Policy Learning for Multi-Timescale Multi-Agent Reinforcement Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Non-Stationary Policy Learning for Multi-Timescale Multi-Agent Reinforcement Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators