-
Simultaneous predictive bands for functional time series using minimum entropy sets
Authors:
Nicolás Hernández,
Jairo Cugliari,
Julien Jacques
Abstract:
Functional Time Series are sequences of dependent random elements taking values on some functional space. Most of the research on this domain is focused on producing a predictor able to forecast the value of the next function having observed a part of the sequence. For this, the Autoregresive Hilbertian process is a suitable framework. We address here the problem of constructing simultaneous predi…
▽ More
Functional Time Series are sequences of dependent random elements taking values on some functional space. Most of the research on this domain is focused on producing a predictor able to forecast the value of the next function having observed a part of the sequence. For this, the Autoregresive Hilbertian process is a suitable framework. We address here the problem of constructing simultaneous predictive confidence bands for a stationary functional time series. The method is based on an entropy measure for stochastic processes, in particular functional time series. To construct predictive bands we use a functional bootstrap procedure that allow us to estimate the prediction law through the use of pseudo-predictions. Each pseudo-realisation is then projected into a space of finite dimension, associated to a functional basis. We use Reproducing Kernel Hilbert Spaces (RKHS) to represent the functions, considering then the basis associated to the reproducing kernel. Using a simple decision rule, we classify the points on the projected space among those belonging to the minimum entropy set and those that do not. We push back the minimum entropy set to the functional space and construct a band using the regularity property of the RKHS. The proposed methodology is illustrated through artificial and real-world data sets.
△ Less
Submitted 4 May, 2023; v1 submitted 28 May, 2021;
originally announced May 2021.
-
Transfer Learning for Linear Regression: a Statistical Test of Gain
Authors:
David Obst,
Badih Ghattas,
Jairo Cugliari,
Georges Oppenheim,
Sandra Claudel,
Yannig Goude
Abstract:
Transfer learning, also referred as knowledge transfer, aims at reusing knowledge from a source dataset to a similar target one. While many empirical studies illustrate the benefits of transfer learning, few theoretical results are established especially for regression problems. In this paper a theoretical framework for the problem of parameter transfer for the linear model is proposed. It is show…
▽ More
Transfer learning, also referred as knowledge transfer, aims at reusing knowledge from a source dataset to a similar target one. While many empirical studies illustrate the benefits of transfer learning, few theoretical results are established especially for regression problems. In this paper a theoretical framework for the problem of parameter transfer for the linear model is proposed. It is shown that the quality of transfer for a new input vector $x$ depends on its representation in an eigenbasis involving the parameters of the problem. Furthermore a statistical test is constructed to predict whether a fine-tuned model has a lower prediction quadratic risk than the base target model for an unobserved sample. Efficiency of the test is illustrated on synthetic data as well as real electricity consumption data.
△ Less
Submitted 18 February, 2021;
originally announced February 2021.
-
Textual Data for Time Series Forecasting
Authors:
David Obst,
Badih Ghattas,
Sandra Claudel,
Jairo Cugliari,
Yannig Goude,
Georges Oppenheim
Abstract:
While ubiquitous, textual sources of information such as company reports, social media posts, etc. are hardly included in prediction algorithms for time series, despite the relevant information they may contain. In this work, openly accessible daily weather reports from France and the United-Kingdom are leveraged to predict time series of national electricity consumption, average temperature and w…
▽ More
While ubiquitous, textual sources of information such as company reports, social media posts, etc. are hardly included in prediction algorithms for time series, despite the relevant information they may contain. In this work, openly accessible daily weather reports from France and the United-Kingdom are leveraged to predict time series of national electricity consumption, average temperature and wind-speed with a single pipeline. Two methods of numerical representation of text are considered, namely traditional Term Frequency - Inverse Document Frequency (TF-IDF) as well as our own neural word embedding. Using exclusively text, we are able to predict the aforementioned time series with sufficient accuracy to be used to replace missing data. Furthermore the proposed word embeddings display geometric properties relating to the behavior of the time series and context similarity between words.
△ Less
Submitted 29 October, 2019; v1 submitted 25 October, 2019;
originally announced October 2019.
-
How to detect novelty in textual data streams? A comparative study of existing methods
Authors:
Clément Christophe,
Julien Velcin,
Jairo Cugliari,
Philippe Suignard,
Manel Boumghar
Abstract:
Since datasets with annotation for novelty at the document and/or word level are not easily available, we present a simulation framework that allows us to create different textual datasets in which we control the way novelty occurs. We also present a benchmark of existing methods for novelty detection in textual data streams. We define a few tasks to solve and compare several state-of-the-art meth…
▽ More
Since datasets with annotation for novelty at the document and/or word level are not easily available, we present a simulation framework that allows us to create different textual datasets in which we control the way novelty occurs. We also present a benchmark of existing methods for novelty detection in textual data streams. We define a few tasks to solve and compare several state-of-the-art methods. The simulation framework allows us to evaluate their performances according to a set of limited scenarios and test their sensitivity to some parameters. Finally, we experiment with the same methods on different kinds of novelty in the New York Times Annotated Dataset.
△ Less
Submitted 11 September, 2019;
originally announced September 2019.
-
Bagging of Density Estimators
Authors:
Mathias Bourel,
Jairo Cugliari
Abstract:
In this work we give new density estimators by averaging classical density estimators such as the histogram, the frequency polygon and the kernel density estimators obtained over different bootstrap samples of the original data. We prove the L 2-consistency of these new estimators and compare them to several similar approaches by extensive simulations. Based on them, we give also a way to construc…
▽ More
In this work we give new density estimators by averaging classical density estimators such as the histogram, the frequency polygon and the kernel density estimators obtained over different bootstrap samples of the original data. We prove the L 2-consistency of these new estimators and compare them to several similar approaches by extensive simulations. Based on them, we give also a way to construct non parametric pointwise confidence intervals for the target density.
△ Less
Submitted 23 August, 2018; v1 submitted 10 August, 2018;
originally announced August 2018.
-
A prediction interval for a function-valued forecast model
Authors:
Anestis Antoniadis,
Xavier Brossat,
Jairo Cugliari,
Jean-Michel Poggi
Abstract:
Starting from the information contained in the shape of the load curves, we have proposed a flexible nonparametric function-valued fore-cast model called KWF (Kernel+Wavelet+Functional) well suited to handle nonstationary series. The predictor can be seen as a weighted average of futures of past situations, where the weights increase with the similarity between the past situations and the actual o…
▽ More
Starting from the information contained in the shape of the load curves, we have proposed a flexible nonparametric function-valued fore-cast model called KWF (Kernel+Wavelet+Functional) well suited to handle nonstationary series. The predictor can be seen as a weighted average of futures of past situations, where the weights increase with the similarity between the past situations and the actual one. In addi-tion, this strategy provides with a simultaneous multiple horizon pre-diction. These weights induce a probability distribution that can be used to produce bootstrap pseudo predictions. Prediction intervals are constructed after obtaining the corresponding bootstrap pseudo pre-diction residuals. We develop two propositions following directly the KWF strategy and compare it to two alternative ways coming from proposals of econometricians. They construct simultaneous prediction intervals using multiple comparison corrections through the control of the family wise error (FWE) or the false discovery rate. Alternatively, such prediction intervals can be constructed bootstrapping joint prob-ability regions. In this work we propose to obtain prediction intervals for the KWF model that are simultaneously valid for the H predic-tion horizons that corresponds with the corresponding path forecast, making a connection between functional time series and the econome-tricians' framework.
△ Less
Submitted 13 December, 2014;
originally announced December 2014.
-
Conditional Autoregressive Hilbertian processes
Authors:
Jairo Cugliari
Abstract:
When considering the problem of forecasting a continuous-time stochastic process over an entire time-interval in terms of its recent past, the notion of Autoregressive Hilbert space processes (ARH) arises. This model can be seen as a generalization of the classical autoregressive processes to Hilbert space valued random variables. Its estimation presents several challenges that were addressed by m…
▽ More
When considering the problem of forecasting a continuous-time stochastic process over an entire time-interval in terms of its recent past, the notion of Autoregressive Hilbert space processes (ARH) arises. This model can be seen as a generalization of the classical autoregressive processes to Hilbert space valued random variables. Its estimation presents several challenges that were addressed by many authors in recent years. In this paper, we propose an extension based on this model by introducing a conditioning process on the arh. In this way, we are aiming a double objective. First, the intrinsic linearity of arh is overwhelm. Second, we allow the introduction of exogenous covariates on this function- valued time series model. We begin defining a new kind of processes that we call Conditional arh. We then propose estimators for the infinite dimensional parameters associated to such processes. Using two classes of predictors defined within the arh framework, we extend these to our case. Consistency results are provided as well as a real data application related to electricity load forecasting.
△ Less
Submitted 14 February, 2013;
originally announced February 2013.
-
Clustering functional data using wavelets
Authors:
Anestis Antoniadis,
Xavier Brossat,
Jairo Cugliari,
Jean-Michel Poggi
Abstract:
We present two methods for detecting patterns and clusters in high dimensional time-dependent functional data. Our methods are based on wavelet-based similarity measures, since wavelets are well suited for identifying highly discriminant local time and scale features. The multiresolution aspect of the wavelet transform provides a time-scale decomposition of the signals allowing to visualize and to…
▽ More
We present two methods for detecting patterns and clusters in high dimensional time-dependent functional data. Our methods are based on wavelet-based similarity measures, since wavelets are well suited for identifying highly discriminant local time and scale features. The multiresolution aspect of the wavelet transform provides a time-scale decomposition of the signals allowing to visualize and to cluster the functional data into homogeneous groups. For each input function, through its empirical orthogonal wavelet transform the first method uses the distribution of energy across scales generate a handy number of features that can be sufficient to still make the signals well distinguishable. Our new similarity measure combined with an efficient feature selection technique in the wavelet domain is then used within more or less classical clustering algorithms to effectively differentiate among high dimensional populations. The second method uses dissimilarity measures between the whole time-scale representations and are based on wavelet-coherence tools. The clustering is then performed using a k-centroid algorithm starting from these dissimilarities. Practical performance of these methods that jointly designs both the feature selection in the wavelet domain and the classification distance is demonstrated through simulations as well as daily profiles of the French electricity power demand.
△ Less
Submitted 31 May, 2011; v1 submitted 25 January, 2011;
originally announced January 2011.