-
D3A-TS: Denoising-Driven Data Augmentation in Time Series
Authors:
David Solis-Martin,
Juan Galan-Paez,
Joaquin Borrego-Diaz
Abstract:
It has been demonstrated that the amount of data is crucial in data-driven machine learning methods. Data is always valuable, but in some tasks, it is almost like gold. This occurs in engineering areas where data is scarce or very expensive to obtain, such as predictive maintenance, where faults are rare. In this context, a mechanism to generate synthetic data can be very useful. While in fields s…
▽ More
It has been demonstrated that the amount of data is crucial in data-driven machine learning methods. Data is always valuable, but in some tasks, it is almost like gold. This occurs in engineering areas where data is scarce or very expensive to obtain, such as predictive maintenance, where faults are rare. In this context, a mechanism to generate synthetic data can be very useful. While in fields such as Computer Vision or Natural Language Processing synthetic data generation has been extensively explored with promising results, in other domains such as time series it has received less attention. This work specifically focuses on studying and analyzing the use of different techniques for data augmentation in time series for classification and regression problems. The proposed approach involves the use of diffusion probabilistic models, which have recently achieved successful results in the field of Image Processing, for data augmentation in time series. Additionally, the use of meta-attributes to condition the data augmentation process is investigated. The results highlight the high utility of this methodology in creating synthetic data to train classification and regression models. To assess the results, six different datasets from diverse domains were employed, showcasing versatility in terms of input size and output types. Finally, an extensive ablation study is conducted to further support the obtained outcomes.
△ Less
Submitted 9 December, 2023;
originally announced December 2023.
-
On the Soundness of XAI in Prognostics and Health Management (PHM)
Authors:
David Solís-Martín,
Juan Galán-Páez,
Joaquín Borrego-Díaz
Abstract:
The aim of Predictive Maintenance, within the field of Prognostics and Health Management (PHM), is to identify and anticipate potential issues in the equipment before these become critical. The main challenge to be addressed is to assess the amount of time a piece of equipment will function effectively before it fails, which is known as Remaining Useful Life (RUL). Deep Learning (DL) models, such…
▽ More
The aim of Predictive Maintenance, within the field of Prognostics and Health Management (PHM), is to identify and anticipate potential issues in the equipment before these become critical. The main challenge to be addressed is to assess the amount of time a piece of equipment will function effectively before it fails, which is known as Remaining Useful Life (RUL). Deep Learning (DL) models, such as Deep Convolutional Neural Networks (DCNN) and Long Short-Term Memory (LSTM) networks, have been widely adopted to address the task, with great success. However, it is well known that this kind of black box models are opaque decision systems, and it may be hard to explain its outputs to stakeholders (experts in the industrial equipment). Due to the large number of parameters that determine the behavior of these complex models, understanding the reasoning behind the predictions is challenging. This work presents a critical and comparative revision on a number of XAI methods applied on time series regression model for PM. The aim is to explore XAI methods within time series regression, which have been less studied than those for time series classification. The model used during the experimentation is a DCNN trained to predict the RUL of an aircraft engine. The methods are reviewed and compared using a set of metrics that quantifies a number of desirable properties that any XAI method should fulfill. The results show that GRAD-CAM is the most robust method, and that the best layer is not the bottom one, as is commonly seen within the context of Image Processing.
△ Less
Submitted 9 March, 2023;
originally announced March 2023.
-
A stacked deep convolutional neural network to predict the remaining useful life of a turbofan engine
Authors:
David Solis-Martin,
Juan Galan-Paez,
Joaquin Borrego-Diaz
Abstract:
This paper presents the data-driven techniques and methodologies used to predict the remaining useful life (RUL) of a fleet of aircraft engines that can suffer failures of diverse nature. The solution presented is based on two Deep Convolutional Neural Networks (DCNN) stacked in two levels. The first DCNN is used to extract a low-dimensional feature vector using the normalized raw data as input. T…
▽ More
This paper presents the data-driven techniques and methodologies used to predict the remaining useful life (RUL) of a fleet of aircraft engines that can suffer failures of diverse nature. The solution presented is based on two Deep Convolutional Neural Networks (DCNN) stacked in two levels. The first DCNN is used to extract a low-dimensional feature vector using the normalized raw data as input. The second DCNN ingests a list of vectors taken from the former DCNN and estimates the RUL. Model selection was carried out by means of Bayesian optimization using a repeated random subsampling validation approach. The proposed methodology was ranked in the third place of the 2021 PHM Conference Data Challenge.
△ Less
Submitted 24 November, 2021;
originally announced November 2021.
-
Discovering New Sentiments from the Social Web
Authors:
Juan Galan-Paez,
Joaquín Borrego-Díaz
Abstract:
A persistent challenge in Complex Systems (CS) research is the phenomenological reconstruction of systems from raw data. In order to face the problem, the use of sound features to reason on the system from data processing is a key step. In the specific case of complex societal systems, sentiment analysis allows to mirror (part of) the affective dimension. However it is not reasonable to think that…
▽ More
A persistent challenge in Complex Systems (CS) research is the phenomenological reconstruction of systems from raw data. In order to face the problem, the use of sound features to reason on the system from data processing is a key step. In the specific case of complex societal systems, sentiment analysis allows to mirror (part of) the affective dimension. However it is not reasonable to think that individual sentiment categorization can encompass the new affective phenomena in digital social networks.
The present papers addresses the problem of isolating sentiment concepts which emerge in social networks. In an analogy to Artificial Intelligent Singularity, we propose the study and analysis of these new complex sentiment structures and how they are similar to or diverge from classic conceptual structures associated to sentiment lexicons. The conjecture is that it is highly probable that hypercomplex sentiment structures -not explained with human categorizations- emerge from high dynamic social information networks. Roughly speaking, new sentiment can emerge from the new global nervous systems as it occurs in humans.
△ Less
Submitted 29 June, 2014;
originally announced July 2014.
-
Selecting Attributes for Sport Forecasting using Formal Concept Analysis
Authors:
Gonzalo A. Aranda-Corral,
Joaquín Borrego-Díaz,
Juan Galán-Páez
Abstract:
In order to address complex systems, apply pattern recongnition on their evolution could play an key role to understand their dynamics. Global patterns are required to detect emergent concepts and trends, some of them with qualitative nature. Formal Concept Analysis (FCA) is a theory whose goal is to discover and to extract Knowledge from qualitative data. It provides tools for reasoning with impl…
▽ More
In order to address complex systems, apply pattern recongnition on their evolution could play an key role to understand their dynamics. Global patterns are required to detect emergent concepts and trends, some of them with qualitative nature. Formal Concept Analysis (FCA) is a theory whose goal is to discover and to extract Knowledge from qualitative data. It provides tools for reasoning with implication basis (and association rules). Implications and association rules are usefull to reasoning on previously selected attributes, providing a formal foundation for logical reasoning. In this paper we analyse how to apply FCA reasoning to increase confidence in sports betting, by means of detecting temporal regularities from data. It is applied to build a Knowledge-Based system for confidence reasoning.
△ Less
Submitted 4 August, 2011; v1 submitted 27 July, 2011;
originally announced July 2011.