Optimism-Based Adaptive Regulation of Linear-Quadratic Systems

Faradonbeh, Mohamad Kazem Shirani; Tewari, Ambuj; Michailidis, George

Computer Science > Systems and Control

arXiv:1711.07230 (cs)

[Submitted on 20 Nov 2017 (v1), last revised 29 Mar 2019 (this version, v3)]

Title:Optimism-Based Adaptive Regulation of Linear-Quadratic Systems

Authors:Mohamad Kazem Shirani Faradonbeh, Ambuj Tewari, George Michailidis

View PDF

Abstract:The main challenge for adaptive regulation of linear-quadratic systems is the trade-off between identification and control. An adaptive policy needs to address both the estimation of unknown dynamics parameters (exploration), as well as the regulation of the underlying system (exploitation). To this end, optimism-based methods which bias the identification in favor of optimistic approximations of the true parameter are employed in the literature. A number of asymptotic results have been established, but their finite time counterparts are few, with important restrictions.
This study establishes results for the worst-case regret of optimism-based adaptive policies. The presented high probability upper bounds are optimal up to logarithmic factors. The non-asymptotic analysis of this work requires very mild assumptions; (i) stabilizability of the system's dynamics, and (ii) limiting the degree of heaviness of the noise distribution. To establish such bounds, certain novel techniques are developed to comprehensively address the probabilistic behavior of dependent random matrices with heavy-tailed distributions.

Comments:	28 pages
Subjects:	Systems and Control (eess.SY); Optimization and Control (math.OC); Applications (stat.AP); Machine Learning (stat.ML)
Cite as:	arXiv:1711.07230 [cs.SY]
	(or arXiv:1711.07230v3 [cs.SY] for this version)
	https://doi.org/10.48550/arXiv.1711.07230

Submission history

From: Mohamad Kazem Shirani Faradonbeh [view email]
[v1] Mon, 20 Nov 2017 09:52:25 UTC (37 KB)
[v2] Sun, 29 Jul 2018 15:54:41 UTC (25 KB)
[v3] Fri, 29 Mar 2019 01:56:55 UTC (28 KB)

Computer Science > Systems and Control

Title:Optimism-Based Adaptive Regulation of Linear-Quadratic Systems

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Systems and Control

Title:Optimism-Based Adaptive Regulation of Linear-Quadratic Systems

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators