Reinforcement Twinning: from digital twins to model-based reinforcement learning

Schena, Lorenzo; Marques, Pedro; Poletti, Romain; Ahizi, Samuel; Berghe, Jan Van den; Mendez, Miguel A.

Electrical Engineering and Systems Science > Systems and Control

arXiv:2311.03628 (eess)

[Submitted on 7 Nov 2023 (v1), last revised 11 Jul 2024 (this version, v4)]

Title:Reinforcement Twinning: from digital twins to model-based reinforcement learning

Authors:Lorenzo Schena, Pedro Marques, Romain Poletti, Samuel Ahizi, Jan Van den Berghe, Miguel A. Mendez

View PDF HTML (experimental)

Abstract:Digital twins promise to revolutionize engineering by offering new avenues for optimization, control, and predictive maintenance. We propose a novel framework for simultaneously training the digital twin of an engineering system and an associated control agent. The twin's training combines adjoint-based data assimilation and system identification methods, while the control agent's training merges model-based optimal control with model-free reinforcement learning. The control agent evolves along two independent paths: one driven by model-based optimal control and the other by reinforcement learning. The digital twin serves as a virtual environment for confrontation and indirect interaction, functioning as an "expert demonstrator." The best policy is selected for real-world interaction and cloned to the other path if training stagnates. We call this framework Reinforcement Twinning (RT). The framework is tested on three diverse engineering systems and control tasks: (1) controlling a wind turbine under varying wind speeds, (2) trajectory control of flapping-wing micro air vehicles (FWMAVs) facing wind gusts, and (3) mitigating thermal loads in managing cryogenic storage tanks. These test cases use simplified models with known ground truth closure laws. Results show that the adjoint-based digital twin training is highly sample-efficient, completing within a few iterations. For the control agent training, both model-based and model-free approaches benefit from their complementary learning experiences. The promising results pave the way for implementing the RT framework on real systems.

Comments:	submitted Journal of Computational Science
Subjects:	Systems and Control (eess.SY)
Cite as:	arXiv:2311.03628 [eess.SY]
	(or arXiv:2311.03628v4 [eess.SY] for this version)
	https://doi.org/10.48550/arXiv.2311.03628

Submission history

From: Miguel Mendez A [view email]
[v1] Tue, 7 Nov 2023 00:24:25 UTC (23,749 KB)
[v2] Wed, 31 Jan 2024 14:49:29 UTC (55,831 KB)
[v3] Sun, 25 Feb 2024 17:18:44 UTC (14,638 KB)
[v4] Thu, 11 Jul 2024 08:28:25 UTC (13,009 KB)

Electrical Engineering and Systems Science > Systems and Control

Title:Reinforcement Twinning: from digital twins to model-based reinforcement learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Systems and Control

Title:Reinforcement Twinning: from digital twins to model-based reinforcement learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators