-
Data-driven Day Ahead Market Prices Forecasting: A Focus on Short Training Set Windows
Authors:
Vasilis Michalakopoulos,
Christoforos Menos-Aikateriniadis,
Elissaios Sarmas,
Antonis Zakynthinos,
Pavlos S. Georgilakis,
Dimitris Askounis
Abstract:
This study investigates the performance of machine learning models in forecasting electricity Day-Ahead Market (DAM) prices using short historical training windows, with a focus on detecting seasonal trends and price spikes. We evaluate four models, namely LSTM with Feed Forward Error Correction (FFEC), XGBoost, LightGBM, and CatBoost, across three European energy markets (Greece, Belgium, Ireland…
▽ More
This study investigates the performance of machine learning models in forecasting electricity Day-Ahead Market (DAM) prices using short historical training windows, with a focus on detecting seasonal trends and price spikes. We evaluate four models, namely LSTM with Feed Forward Error Correction (FFEC), XGBoost, LightGBM, and CatBoost, across three European energy markets (Greece, Belgium, Ireland) using feature sets derived from ENTSO-E forecast data. Training window lengths range from 7 to 90 days, allowing assessment of model adaptability under constrained data availability. Results indicate that LightGBM consistently achieves the highest forecasting accuracy and robustness, particularly with 45 and 60 day training windows, which balance temporal relevance and learning depth. Furthermore, LightGBM demonstrates superior detection of seasonal effects and peak price events compared to LSTM and other boosting models. These findings suggest that short-window training approaches, combined with boosting methods, can effectively support DAM forecasting in volatile, data-scarce environments.
△ Less
Submitted 12 June, 2025;
originally announced June 2025.
-
From Transformers to Large Language Models: A systematic review of AI applications in the energy sector towards Agentic Digital Twins
Authors:
Gabriel Antonesi,
Tudor Cioara,
Ionut Anghel,
Vasilis Michalakopoulos,
Elissaios Sarmas,
Liana Toderean
Abstract:
Artificial intelligence (AI) has long promised to improve energy management in smart grids by enhancing situational awareness and supporting more effective decision-making. While traditional machine learning has demonstrated notable results in forecasting and optimization, it often struggles with generalization, situational awareness, and heterogeneous data integration. Recent advances in foundati…
▽ More
Artificial intelligence (AI) has long promised to improve energy management in smart grids by enhancing situational awareness and supporting more effective decision-making. While traditional machine learning has demonstrated notable results in forecasting and optimization, it often struggles with generalization, situational awareness, and heterogeneous data integration. Recent advances in foundation models such as Transformer architecture and Large Language Models (LLMs) have demonstrated improved capabilities in modelling complex temporal and contextual relationships, as well as in multi-modal data fusion which is essential for most AI applications in the energy sector. In this review we synthesize the rapid expanding field of AI applications in the energy domain focusing on Transformers and LLMs. We examine the architectural foundations, domain-specific adaptations and practical implementations of transformer models across various forecasting and grid management tasks. We then explore the emerging role of LLMs in the field: adaptation and fine tuning for the energy sector, the type of tasks they are suited for, and the new challenges they introduce. Along the way, we highlight practical implementations, innovations, and areas where the research frontier is rapidly expanding. These recent developments reviewed underscore a broader trend: Generative AI (GenAI) is beginning to augment decision-making not only in high-level planning but also in day-to-day operations, from forecasting and grid balancing to workforce training and asset onboarding. Building on these developments, we introduce the concept of the Agentic Digital Twin, a next-generation model that integrates LLMs to bring autonomy, proactivity, and social interaction into digital twin-based energy management systems.
△ Less
Submitted 3 June, 2025;
originally announced June 2025.
-
A multi-dimensional unsupervised machine learning framework for clustering residential heat load profiles
Authors:
Vasilis Michalakopoulos,
Elissaios Sarmas,
Viktor Daropoulos,
Giannis Kazdaridis,
Stratos Keranidis,
Vangelis Marinakis,
Dimitris Askounis
Abstract:
Central to achieving the energy transition, heating systems provide essential space heating and hot water in residential and industrial environments. A major challenge lies in effectively profiling large clusters of buildings to improve demand estimation and enable efficient Demand Response (DR) schemes. This paper addresses this challenge by introducing an unsupervised machine learning framework…
▽ More
Central to achieving the energy transition, heating systems provide essential space heating and hot water in residential and industrial environments. A major challenge lies in effectively profiling large clusters of buildings to improve demand estimation and enable efficient Demand Response (DR) schemes. This paper addresses this challenge by introducing an unsupervised machine learning framework for clustering residential heating load profiles, focusing on natural gas space heating and hot water preparation boilers. The profiles are analyzed across five dimensions: boiler usage, heating demand, weather conditions, building characteristics, and user behavior. We apply three distance metrics: Euclidean Distance (ED), Dynamic Time Warping (DTW), and Derivative Dynamic Time Warping (DDTW), and evaluate their performance using established clustering indices. The proposed method is assessed considering 29 residential buildings in Greece equipped with smart meters throughout a calendar heating season (i.e., 210 days). Results indicate that DTW is the most suitable metric, uncovering strong correlations between boiler usage, heat demand, and temperature, while ED highlights broader interrelations across dimensions and DDTW proves less effective, resulting in weaker clusters. These findings offer key insights into heating load behavior, establishing a solid foundation for developing more targeted and effective DR programs.
△ Less
Submitted 11 November, 2024;
originally announced November 2024.
-
Integrating Dynamic Correlation Shifts and Weighted Benchmarking in Extreme Value Analysis
Authors:
Dimitrios P. Panagoulias,
Elissaios Sarmas,
Vangelis Marinakis,
Maria Virvou,
George A. Tsihrintzis
Abstract:
This paper presents an innovative approach to Extreme Value Analysis (EVA) by introducing the Extreme Value Dynamic Benchmarking Method (EVDBM). EVDBM integrates extreme value theory to detect extreme events and is coupled with the novel Dynamic Identification of Significant Correlation (DISC)-Thresholding algorithm, which enhances the analysis of key variables under extreme conditions. By integra…
▽ More
This paper presents an innovative approach to Extreme Value Analysis (EVA) by introducing the Extreme Value Dynamic Benchmarking Method (EVDBM). EVDBM integrates extreme value theory to detect extreme events and is coupled with the novel Dynamic Identification of Significant Correlation (DISC)-Thresholding algorithm, which enhances the analysis of key variables under extreme conditions. By integrating return values predicted through EVA into the benchmarking scores, we are able to transform these scores to reflect anticipated conditions more accurately. This provides a more precise picture of how each case is projected to unfold under extreme conditions. As a result, the adjusted scores offer a forward-looking perspective, highlighting potential vulnerabilities and resilience factors for each case in a way that static historical data alone cannot capture. By incorporating both historical and probabilistic elements, the EVDBM algorithm provides a comprehensive benchmarking framework that is adaptable to a range of scenarios and contexts. The methodology is applied to real PV data, revealing critical low - production scenarios and significant correlations between variables, which aid in risk management, infrastructure design, and long-term planning, while also allowing for the comparison of different production plants. The flexibility of EVDBM suggests its potential for broader applications in other sectors where decision-making sensitivity is crucial, offering valuable insights to improve outcomes.
△ Less
Submitted 25 November, 2024; v1 submitted 19 November, 2024;
originally announced November 2024.
-
Home Energy Management Systems: Challenges, Heterogeneity & Integration Architecture Towards A Smart City Ecosystem
Authors:
Georgios Kormpakis,
Alexios Lekidis,
Elissaios Sarmas,
Giannis Papias,
Filippos Serepas,
George Stravodimos,
Vangelis Marinakis
Abstract:
The contemporary era is marked by rapid urban growth and increasing population. A significant, and constantly growing, portion of the global population now resides in major cities, leading to escalating energy demands in urban centers. As urban population is expected to keep on expanding in the near future, the same is also expected to happen with the associated energy requirements. The situation…
▽ More
The contemporary era is marked by rapid urban growth and increasing population. A significant, and constantly growing, portion of the global population now resides in major cities, leading to escalating energy demands in urban centers. As urban population is expected to keep on expanding in the near future, the same is also expected to happen with the associated energy requirements. The situation with the continuously increasing energy demand, along with the emergence of smart grids and the capabilities that are already -- or can be -- offered by Home Energy Management System (HEMS), has created a lot of opportunities towards a more sustainable future, with optimized energy consumption and demand response, which leads to economic and environmental benefits, based on the actual needs of the consumers. In this paper, we begin by providing an analytical exploration of the challenges faced at both the development and deployment levels. We proceed with a thorough analysis and comparison between the abundance of devices, smart home technologies, and protocols currently used by various products. Following, aiming to blunt the currently existing challenges, we propose a reliable, flexible, and extendable architectural schema. Finally, we analyze a number of potential ways in which the data deriving from such implementations can be analyzed and leveraged, in order to produce services that offer useful insights and smart solutions towards enhanced energy efficiency.
△ Less
Submitted 7 August, 2024;
originally announced August 2024.
-
A Machine Learning-Based Framework for Clustering Residential Electricity Load Profiles to Enhance Demand Response Programs
Authors:
Vasilis Michalakopoulos,
Elissaios Sarmas,
Ioannis Papias,
Panagiotis Skaloumpakas,
Vangelis Marinakis,
Haris Doukas
Abstract:
Load shapes derived from smart meter data are frequently employed to analyze daily energy consumption patterns, particularly in the context of applications like Demand Response (DR). Nevertheless, one of the most important challenges to this endeavor lies in identifying the most suitable consumer clusters with similar consumption behaviors. In this paper, we present a novel machine learning based…
▽ More
Load shapes derived from smart meter data are frequently employed to analyze daily energy consumption patterns, particularly in the context of applications like Demand Response (DR). Nevertheless, one of the most important challenges to this endeavor lies in identifying the most suitable consumer clusters with similar consumption behaviors. In this paper, we present a novel machine learning based framework in order to achieve optimal load profiling through a real case study, utilizing data from almost 5000 households in London. Four widely used clustering algorithms are applied specifically K-means, K-medoids, Hierarchical Agglomerative Clustering and Density-based Spatial Clustering. An empirical analysis as well as multiple evaluation metrics are leveraged to assess those algorithms. Following that, we redefine the problem as a probabilistic classification one, with the classifier emulating the behavior of a clustering algorithm,leveraging Explainable AI (xAI) to enhance the interpretability of our solution. According to the clustering algorithm analysis the optimal number of clusters for this case is seven. Despite that, our methodology shows that two of the clusters, almost 10\% of the dataset, exhibit significant internal dissimilarity and thus it splits them even further to create nine clusters in total. The scalability and versatility of our solution makes it an ideal choice for power utility companies aiming to segment their users for creating more targeted Demand Response programs.
△ Less
Submitted 31 October, 2023;
originally announced October 2023.