Optimal Dynamic Regret in LQR Control

Baby, Dheeraj; Wang, Yu-Xiang

Computer Science > Machine Learning

arXiv:2206.09257 (cs)

[Submitted on 18 Jun 2022]

Title:Optimal Dynamic Regret in LQR Control

Authors:Dheeraj Baby, Yu-Xiang Wang

View PDF

Abstract:We consider the problem of nonstochastic control with a sequence of quadratic losses, i.e., LQR control. We provide an efficient online algorithm that achieves an optimal dynamic (policy) regret of $\tilde{O}(\text{max}\{n^{1/3} \mathcal{TV}(M_{1:n})^{2/3}, 1\})$, where $\mathcal{TV}(M_{1:n})$ is the total variation of any oracle sequence of Disturbance Action policies parameterized by $M_1,...,M_n$ -- chosen in hindsight to cater to unknown nonstationarity. The rate improves the best known rate of $\tilde{O}(\sqrt{n (\mathcal{TV}(M_{1:n})+1)} )$ for general convex losses and we prove that it is information-theoretically optimal for LQR. Main technical components include the reduction of LQR to online linear regression with delayed feedback due to Foster and Simchowitz (2020), as well as a new proper learning algorithm with an optimal $\tilde{O}(n^{1/3})$ dynamic regret on a family of ``minibatched'' quadratic losses, which could be of independent interest.

Subjects:	Machine Learning (cs.LG); Dynamical Systems (math.DS); Optimization and Control (math.OC); Machine Learning (stat.ML)
Cite as:	arXiv:2206.09257 [cs.LG]
	(or arXiv:2206.09257v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2206.09257

Submission history

From: Dheeraj Baby [view email]
[v1] Sat, 18 Jun 2022 18:00:21 UTC (88 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2022-06

Change to browse by:

cs
math
math.DS
math.OC
stat
stat.ML

References & Citations

export BibTeX citation

Computer Science > Machine Learning

Title:Optimal Dynamic Regret in LQR Control

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Optimal Dynamic Regret in LQR Control

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators