Parallel Reduced Order Modeling for Digital Twins using High-Performance Computing Workflows
Authors:
S. Ares de Parga,
J. R. Bravo,
N. Sibuet,
J. A. Hernandez,
R. Rossi,
Stefan Boschert,
Enrique S. Quintana-Ortí,
Andrés E. Tomás,
Cristian Cătălin Tatu,
Fernando Vázquez-Novoa,
Jorge Ejarque,
Rosa M. Badia
Abstract:
The integration of reduced-order models (ROMs) with high-performance computing (HPC) is critical for developing digital twins, particularly for real-time monitoring and predictive maintenance of industrial systems. This paper presents a comprehensive, HPC-enabled workflow for developing and deploying projection-based reduced-order models (PROMs) for large-scale mechanical simulations. We use PyCOM…
▽ More
The integration of reduced-order models (ROMs) with high-performance computing (HPC) is critical for developing digital twins, particularly for real-time monitoring and predictive maintenance of industrial systems. This paper presents a comprehensive, HPC-enabled workflow for developing and deploying projection-based reduced-order models (PROMs) for large-scale mechanical simulations. We use PyCOMPSs' parallel framework to efficiently execute ROM training simulations, employing parallel singular value decomposition (SVD) algorithms such as randomized SVD, Lanczos SVD, and full SVD based on tall-skinny QR (TSQR). Moreover, we introduce a partitioned version of the hyper-reduction scheme known as the Empirical Cubature Method (ECM) to further enhance computational efficiency in PROMs for mechanical systems. Despite the widespread use of HPC for PROMs, there is a significant lack of publications detailing comprehensive workflows for building and deploying end-to-end PROMs in HPC environments. Our workflow is validated through a case study focusing on the thermal dynamics of a motor, a multiphysics problem involving convective heat transfer and mechanical components. The PROM is designed to deliver a real-time prognosis tool that could enable rapid and safe motor restarts post-emergency shutdowns under different operating conditions, demonstrating its potential impact on the practice of simulations in engineering mechanics. To facilitate deployment, we use the Workflow as a Service (WaaS) strategy and Functional Mock-Up Units (FMUs) to ensure compatibility and ease of integration across HPC, edge, and cloud environments. The outcomes illustrate the efficacy of combining PROMs and HPC, establishing a precedent for scalable, real-time digital twin applications in computational mechanics across multiple industries.
△ Less
Submitted 28 March, 2025; v1 submitted 10 September, 2024;
originally announced September 2024.
Enabling Dynamic and Intelligent Workflows for HPC, Data Analytics, and AI Convergence
Authors:
Jorge Ejarque,
Rosa M. Badia,
Loïc Albertin,
Giovanni Aloisio,
Enrico Baglione,
Yolanda Becerra,
Stefan Boschert,
Julian R. Berlin,
Alessandro D'Anca,
Donatello Elia,
François Exertier,
Sandro Fiore,
José Flich,
Arnau Folch,
Steven J Gibbons,
Nikolay Koldunov,
Francesc Lordan,
Stefano Lorito,
Finn Løvholt,
Jorge Macías,
Fabrizio Marozzo,
Alberto Michelini,
Marisol Monterrubio-Velasco,
Marta Pienkowska,
Josep de la Puente
, et al. (12 additional authors not shown)
Abstract:
The evolution of High-Performance Computing (HPC) platforms enables the design and execution of progressively larger and more complex workflow applications in these systems. The complexity comes not only from the number of elements that compose the workflows but also from the type of computations they perform. While traditional HPC workflows target simulations and modelling of physical phenomena,…
▽ More
The evolution of High-Performance Computing (HPC) platforms enables the design and execution of progressively larger and more complex workflow applications in these systems. The complexity comes not only from the number of elements that compose the workflows but also from the type of computations they perform. While traditional HPC workflows target simulations and modelling of physical phenomena, current needs require in addition data analytics (DA) and artificial intelligence (AI) tasks. However, the development of these workflows is hampered by the lack of proper programming models and environments that support the integration of HPC, DA, and AI, as well as the lack of tools to easily deploy and execute the workflows in HPC systems. To progress in this direction, this paper presents use cases where complex workflows are required and investigates the main issues to be addressed for the HPC/DA/AI convergence. Based on this study, the paper identifies the challenges of a new workflow platform to manage complex workflows. Finally, it proposes a development approach for such a workflow platform addressing these challenges in two directions: first, by defining a software stack that provides the functionalities to manage these complex workflows; and second, by proposing the HPC Workflow as a Service (HPCWaaS) paradigm, which leverages the software stack to facilitate the reusability of complex workflows in federated HPC infrastructures. Proposals presented in this work are subject to study and development as part of the EuroHPC eFlows4HPC project.
△ Less
Submitted 13 May, 2022; v1 submitted 20 April, 2022;
originally announced April 2022.