-
Functional independent component analysis by choice of norm: a framework for near-perfect classification
Authors:
Marc Vidal,
Marc Leman,
Ana M. Aguilera
Abstract:
We develop a theory for functional independent component analysis in an infinite-dimensional framework using Sobolev spaces that accommodate smoother functions. The notion of penalized kurtosis is introduced motivated by Silverman's method for smoothing principal components. This approach allows for a classical definition of independent components obtained via projection onto the eigenfunctions of…
▽ More
We develop a theory for functional independent component analysis in an infinite-dimensional framework using Sobolev spaces that accommodate smoother functions. The notion of penalized kurtosis is introduced motivated by Silverman's method for smoothing principal components. This approach allows for a classical definition of independent components obtained via projection onto the eigenfunctions of a smoothed kurtosis operator mapping a whitened functional random variable. We discuss the theoretical properties of this operator in relation to a generalized Fisher discriminant function and the relationship it entails with the Feldman-Hájek dichotomy for Gaussian measures, both of which are critical to the principles of functional classification. The proposed estimators are a particularly competitive alternative in binary classification of functional data and can eventually achieve the so-called near-perfect classification, which is a genuine phenomenon of high-dimensional data. Our methods are illustrated through simulations, various real datasets, and used to model electroencephalographic biomarkers for the diagnosis of depressive disorder.
△ Less
Submitted 23 December, 2024;
originally announced December 2024.
-
Different PCA approaches for vector functional time series with applications to resistive switching processes
Authors:
C. Acal,
A. M. Aguilera,
F. J. Alonso,
J. E. Ruiz-Castro,
J. B. Roldán
Abstract:
This paper is motivated by modeling the cycle-to-cycle variability associated with the resistive switching operation behind memristors. As the data are by nature curves, functional principal component analysis is a suitable candidate to explain the main modes of variability. Taking into account this data-driven motivation, in this paper we propose two new forecasting approaches based on studying t…
▽ More
This paper is motivated by modeling the cycle-to-cycle variability associated with the resistive switching operation behind memristors. As the data are by nature curves, functional principal component analysis is a suitable candidate to explain the main modes of variability. Taking into account this data-driven motivation, in this paper we propose two new forecasting approaches based on studying the sequential cross-dependence between and within a multivariate functional time series in terms of vector autoregressive modeling of the most explicative functional principal component scores. The main difference between the two methods lies in whether a univariate or multivariate PCA is performed so that we have a different set of principal component scores for each functional time series or the same one for all of them. Finally, the sample performance of the proposed methodologies is illustrated by an application on a bivariate functional time series of reset-set curves.
△ Less
Submitted 19 November, 2024;
originally announced November 2024.
-
Functional ANOVA approaches for detecting changes in air pollution during the COVID-19 pandemic
Authors:
Christian Acal,
Ana M. Aguilera,
Annalina Sarra,
Adelia Evangelista,
Tonio Di Battista,
Sergio Palermi
Abstract:
Faced with novel coronavirus outbreak, the most hard-hit countries adopted a lockdown strategy to contrast the spread of virus. Many studies have already documented that the COVID-19 control actions have resulted in improved air quality locally and around the world. Following these lines of research, we focus on air quality changes in the urban territory of Chieti-Pescara (Central Italy), identifi…
▽ More
Faced with novel coronavirus outbreak, the most hard-hit countries adopted a lockdown strategy to contrast the spread of virus. Many studies have already documented that the COVID-19 control actions have resulted in improved air quality locally and around the world. Following these lines of research, we focus on air quality changes in the urban territory of Chieti-Pescara (Central Italy), identified as an area of criticality in terms of air pollution. Concentrations of NO2, PM10, PM2.5 and benzene are used to evaluate air pollution changes in this Region. Data were measured by several monitoring stations over two specific periods: from 1st February to 10 th March 2020 (before lockdown period) and from 11st March 2020 to 18 th April 2020 (during lockdown period). The impact of lockdown on air quality is assessed through functional data analysis. Our work makes an important contribution to the analysis of variance for functional data (FANOVA). Specifically, a novel approach based on multivariate functional principal component analysis is introduced to tackle the multivariate FANOVA problem for independent measures, which is reduced to test multivariate homogeneity on the vectors of the most explicative principal components scores. Results of the present study suggest that the level of each pollutant changed during the confinement. Additionally, the differences in the mean functions of all pollutants according to the location and type of monitoring stations (background vs traffic), are ascribable to the PM10 and benzene concentrations for pre-lockdown and during-lockdown tenure, respectively. FANOVA has proven to be beneficial to monitoring the evolution of air quality in both periods of time. This can help environmental protection agencies in drawing a more holistic picture of air quality status in the area of interest.
△ Less
Submitted 8 February, 2024;
originally announced February 2024.
-
Stochastic modeling of Random Access Memories reset transitions
Authors:
M Carmen Aguilera-Morillo,
Ana M Aguilera,
Francisco Jiménez-Molinos,
Juan B Roldán
Abstract:
Resistive Random Access Memories (RRAMs) are being studied by the industry and academia because it is widely accepted that they are promising candidates for the next generation of high density nonvolatile memories. Taking into account the stochastic nature of mechanisms behind resistive switching, a new technique based on the use of functional data analysis has been developed to accurately model r…
▽ More
Resistive Random Access Memories (RRAMs) are being studied by the industry and academia because it is widely accepted that they are promising candidates for the next generation of high density nonvolatile memories. Taking into account the stochastic nature of mechanisms behind resistive switching, a new technique based on the use of functional data analysis has been developed to accurately model resistive memory device characteristics. Functional principal component analysis (FPCA) based on Karhunen-Loeve expansion is applied to obtain an orthogonal decomposition of the reset process in terms of uncorrelated scalar random variables. Then, the device current has been accurately described making use of just one variable presenting a modeling approach that can be very attractive from the circuit simulation viewpoint. The new method allows a comprehensive description of the stochastic variability of these devices by introducing a probability distribution that allows the simulation of the main parameter that is employed for the model implementation. A rigorous description of the mathematical theory behind the technique is given and its application for a broad set of experimental measurements is explained.
△ Less
Submitted 7 February, 2024;
originally announced February 2024.
-
Multi-class classification of biomechanical data: A functional LDA approach based on multi-class penalized functional PLS
Authors:
M Carmen Aguilera-Morillo,
Ana M Aguilera
Abstract:
A functional linear discriminant analysis approach to classify a set of kinematic data (human movement curves of individuals performing different physical activities) is performed. Kinematic data, usually collected in linear acceleration or angular rotation format, can be identified with functions in a continuous domain (time, percentage of gait cycle, etc.). Since kinematic curves are measured in…
▽ More
A functional linear discriminant analysis approach to classify a set of kinematic data (human movement curves of individuals performing different physical activities) is performed. Kinematic data, usually collected in linear acceleration or angular rotation format, can be identified with functions in a continuous domain (time, percentage of gait cycle, etc.). Since kinematic curves are measured in the same sample of individuals performing different activities, they are a clear example of functional data with repeated measures. On the other hand, the sample curves are observed with noise. Then, a roughness penalty might be necessary in order to provide a smooth estimation of the discriminant functions, which would make them more interpretable. Moreover, because of the infinite dimension of functional data, a reduction dimension technique should be considered. To solve these problems, we propose a multi-class approach for penalized functional partial least squares (FPLS) regression. Then linear discriminant analysis (LDA) will be performed on the estimated FPLS components. This methodology is motivated by two case studies. The first study considers the linear acceleration recorded every two seconds in 30 subjects, related to three different activities (walking, climbing stairs and down stairs). The second study works with the triaxial angular rotation, for each joint, in 51 children when they completed a cycle walking under three conditions (walking, carrying a backpack and pulling a trolley). A simulation study is also developed for comparing the performance of the proposed functional LDA with respect to the corresponding multivariate and non-penalized approaches.
△ Less
Submitted 7 February, 2024;
originally announced February 2024.
-
logitFD: an R package for functional principal component logit regression
Authors:
Manuel Escabias,
Ana M. Aguilera,
Christian Acal
Abstract:
The functional logit regression model was proposed by Escabias et al. (2004) with the objective of modeling a scalar binary response variable from a functional predictor. The model estimation proposed in that case was performed in a subspace of L2(T) of squared integrable functions of finite dimension, generated by a finite set of basis functions. For that estimation it was assumed that the curves…
▽ More
The functional logit regression model was proposed by Escabias et al. (2004) with the objective of modeling a scalar binary response variable from a functional predictor. The model estimation proposed in that case was performed in a subspace of L2(T) of squared integrable functions of finite dimension, generated by a finite set of basis functions. For that estimation it was assumed that the curves of the functional predictor and the functional parameter of the model belong to the same finite subspace. The estimation so obtained was affected by high multicollinearity problems and the solution given to these problems was based on different functional principal component analysis. The logitFD package introduced here provides a toolbox for the fit of these models by implementing the different proposed solutions and by generalizing the model proposed in 2004 to the case of several functional and non-functional predictors. The performance of the functions is illustrated by using data sets of functional data included in the fda.usc package from R-CRAN.
△ Less
Submitted 7 February, 2024;
originally announced February 2024.
-
Basis expansion approaches for functional analysis of variance with repeated measures
Authors:
Christian Acal,
Ana M. Aguilera
Abstract:
The methodological contribution in this paper is motivated by biomechanical studies where data characterizing human movement are waveform curves representing joint measures such as flexion angles, velocity, acceleration, and so on. In many cases the aim consists of detecting differences in gait patterns when several independent samples of subjects walk or run under different conditions (repeated m…
▽ More
The methodological contribution in this paper is motivated by biomechanical studies where data characterizing human movement are waveform curves representing joint measures such as flexion angles, velocity, acceleration, and so on. In many cases the aim consists of detecting differences in gait patterns when several independent samples of subjects walk or run under different conditions (repeated measures). Classic kinematic studies often analyse discrete summaries of the sample curves discarding important information and providing biased results. As the sample data are obviously curves, a Functional Data Analysis approach is proposed to solve the problem of testing the equality of the mean curves of a functional variable observed on several independent groups under different treatments or time periods. A novel approach for Functional Analysis of Variance (FANOVA) for repeated measures that takes into account the complete curves is introduced. By assuming a basis expansion for each sample curve, two-way FANOVA problem is reduced to Multivariate ANOVA for the multivariate response of basis coefficients. Then, two different approaches for MANOVA with repeated measures are considered. Besides, an extensive simulation study is developed to check their performance. Finally, two applications with gait data are developed.
△ Less
Submitted 7 February, 2024;
originally announced February 2024.
-
Linear-Phase-Type probability modelling of functional PCA with applications to resistive memories
Authors:
Juan E. Ruiz-Castro,
Christian Acal,
Ana M. Aguilera,
M. Carmen Aguilera-Morillo,
Juan B. Roldán
Abstract:
Functional principal component analysis based on Karhunen Loeve expansion allows to describe the stochastic evolution of the main characteristics associated to multiple systems and devices. Identifying the probability distribution of the principal component scores is fundamental to characterize the whole process. The aim of this work is to consider a family of statistical distributions that could…
▽ More
Functional principal component analysis based on Karhunen Loeve expansion allows to describe the stochastic evolution of the main characteristics associated to multiple systems and devices. Identifying the probability distribution of the principal component scores is fundamental to characterize the whole process. The aim of this work is to consider a family of statistical distributions that could be accurately adjusted to a previous transformation. Then, a new class of distributions, the linear-phase-type, is introduced to model the principal components. This class is studied in detail in order to prove, through the KL expansion, that certain linear transformations of the process at each time point are phase-type distributed. This way, the one-dimensional distributions of the process are in the same linear-phase-type class. Finally, an application to model the reset process associated with resistive memories is developed and explained.
△ Less
Submitted 6 February, 2024;
originally announced February 2024.
-
Memristor variability and stochastic physical properties modeling from a multivariate time series approach
Authors:
Francisco J. Alonso,
David Maldonado,
Ana M. Aguilera,
Juan B. Roldán
Abstract:
A powerful time series analysis modeling technique is presented to describe cycle-to-cycle variability in memristors. These devices show variability linked to the inherent stochasticity of device operation and it needs to be accurately modeled to build compact models for circuit simulation and design purposes. A new multivariate approach is proposed for the reset and set voltages that accurately d…
▽ More
A powerful time series analysis modeling technique is presented to describe cycle-to-cycle variability in memristors. These devices show variability linked to the inherent stochasticity of device operation and it needs to be accurately modeled to build compact models for circuit simulation and design purposes. A new multivariate approach is proposed for the reset and set voltages that accurately describes the statistical data structure of a resistive switching series. Experimental data were measured from advanced hafnium oxide based devices. The models reproduce the experiments correctly and a comparison of the multivariate and univariate approaches is shown for comparison.
△ Less
Submitted 6 February, 2024;
originally announced February 2024.
-
Phase-type distributions for studying variability in resistive memories
Authors:
Christian Acal,
Juan E. Ruiz-Castro,
Ana M. Aguilera,
Francisco Jiménez-Molinos,
Juan B. Roldán
Abstract:
A new statistical approach has been developed to analyze Resistive Random Access Memory (RRAM) variability. The stochastic nature of the physical processes behind the operation of resistive memories makes variability one of the key issues to solve from the industrial viewpoint of these new devices. The statistical features of variability have been usually studied making use of Weibull distribution…
▽ More
A new statistical approach has been developed to analyze Resistive Random Access Memory (RRAM) variability. The stochastic nature of the physical processes behind the operation of resistive memories makes variability one of the key issues to solve from the industrial viewpoint of these new devices. The statistical features of variability have been usually studied making use of Weibull distribution. However, this probability distribution does not work correctly for some resistive memories, in particular for those based on the Ni/HfO2/Si structure that has been employed in this work. A completely new approach based on phase-type modeling is proposed in this paper to characterize the randomness of resistive memories operation. An in-depth comparison with experimental results shows that the fitted phase-type distribution works better than the Weibull distribution and also helps to understand the physics of the resistive memories.
△ Less
Submitted 6 February, 2024;
originally announced February 2024.
-
Homogeneity problem for basis expansion of functional data with applications to resistive memories
Authors:
Ana M Aguilera,
Christian Acal,
M Carmen Aguilera-Morillo,
Francisco Jiménez-Molinos,
Juan B. Roldán
Abstract:
The homogeneity problem for testing if more than two different samples come from the same population is considered for the case of functional data. The methodological results are motivated by the study of homogeneity of electronic devices fabricated by different materials and active layer thicknesses. In the case of normality distribution of the stochastic processes associated with each sample, th…
▽ More
The homogeneity problem for testing if more than two different samples come from the same population is considered for the case of functional data. The methodological results are motivated by the study of homogeneity of electronic devices fabricated by different materials and active layer thicknesses. In the case of normality distribution of the stochastic processes associated with each sample, this problem is known as Functional ANOVA problem and is reduced to test the equality of the mean group functions (FANOVA). The problem is that the current/voltage curves associated with Resistive Random Access Memories (RRAM) are not generated by a Gaussian process so that a different approach is necessary for testing homogeneity. To solve this problem two different parametric and nonparametric approaches based on basis expansion of the sample curves are proposed. The first consists of testing multivariate homogeneity tests on a vector of basis coefficients of the sample curves. The second is based on dimension reduction by using functional principal component analysis of the sample curves (FPCA) and testing multivariate homogeneity on a vector of principal components scores. Different approximation numerical techniques are employed to adapt the experimental data for the statistical study. An extensive simulation study is developed for analyzing the performance of both approaches in the parametric and non-parametric cases. Finally, the proposed methodologies are applied on three samples of experimental reset curves measured in three different RRAM technologies.
△ Less
Submitted 6 February, 2024;
originally announced February 2024.
-
Bi-Smoothed Functional Independent Component Analysis for EEG Artifact Removal
Authors:
Marc Vidal,
Mattia Rosso,
Ana M. Aguilera
Abstract:
Motivated by mapping adverse artifactual events caused by body movements in electroencephalographic (EEG) signals, we present a functional independent component analysis based on the spectral decomposition of the kurtosis operator of a smoothed principal component expansion. A discrete roughness penalty is introduced in the orthonormality constraint of the covariance eigenfunctions in order to obt…
▽ More
Motivated by mapping adverse artifactual events caused by body movements in electroencephalographic (EEG) signals, we present a functional independent component analysis based on the spectral decomposition of the kurtosis operator of a smoothed principal component expansion. A discrete roughness penalty is introduced in the orthonormality constraint of the covariance eigenfunctions in order to obtain the smoothed basis for the proposed independent component model. To select the tuning parameters, a cross-validation method that incorporates shrinkage is used to enhance the performance on functional representations with large basis dimension. This method provides an estimation strategy to determine the penalty parameter and the optimal number of components. Our independent component approach is applied to real EEG data to estimate genuine brain potentials from a contaminated signal. As a result, it is possible to control high-frequency remnants of neural origin overlapping artifactual sources to optimize their removal from the signal. An R package implementing our methods is available at CRAN.
△ Less
Submitted 27 May, 2021; v1 submitted 14 January, 2021;
originally announced January 2021.
-
The use of the Biorthogonal Decomposition for the identification of zonal flows at TJ-II
Authors:
B. Ph. van Milligen,
E. Sánchez,
A. Alonso,
M. A. Pedrosa,
C. Hidalgo,
A. Martín de Aguilera,
A. López Fraguas
Abstract:
This work addresses the identification of zonal flows in fusion plasmas. Zonal flows are large scale phenomena, hence multipoint measurements taken at remote locations are required for their identification. Given such data, the Biorthogonal Decomposition (or Singular Value Decomposition) is capable of extracting the globally correlated component of the multipoint fluctuations. By using a novel qua…
▽ More
This work addresses the identification of zonal flows in fusion plasmas. Zonal flows are large scale phenomena, hence multipoint measurements taken at remote locations are required for their identification. Given such data, the Biorthogonal Decomposition (or Singular Value Decomposition) is capable of extracting the globally correlated component of the multipoint fluctuations. By using a novel quadrature technique based on the Hilbert transform, propagating global modes (such as MHD modes) can be distinguished from the non-propagating, synchronous (zonal flow-like) global component. The combination of these techniques with further information such as the spectrogram and the spatial structure then allows an unambiguous identification of the zonal flow component of the fluctuations. The technique is tested using gyro-kinetic simulations. The first unambiguous identification of a zonal flow at the TJ-II stellarator is presented, based on multipoint Langmuir probe measurements.
△ Less
Submitted 22 October, 2014; v1 submitted 8 August, 2014;
originally announced August 2014.
-
Parallel and perpendicular turbulence correlation length in the TJ-II Stellarator
Authors:
B. Ph. van Milligen,
A. Lopez Fraguas,
M. A. Pedrosa,
C. Hidalgo,
A. Martín de Aguilera,
E. Ascasíbar
Abstract:
Long range correlations were measured using two remote reciprocating Langmuir probe systems at TJ-II. The influence of the rotational transform on the correlation was studied by scanning the magnetic configuration. A simple drift wave correlation model, assuming an exponential decay of the correlation with different correlation lengths in the directions parallel and perpendicular to the field line…
▽ More
Long range correlations were measured using two remote reciprocating Langmuir probe systems at TJ-II. The influence of the rotational transform on the correlation was studied by scanning the magnetic configuration. A simple drift wave correlation model, assuming an exponential decay of the correlation with different correlation lengths in the directions parallel and perpendicular to the field lines, was found to describe the observations well at low densities. The experiment was repeated at gradually higher densities, and an additional correlation was detected at a critical value of the density. In accordance with previous work, this additional correlation was ascribed to zonal flows associated with a confinement transition. Thus, the total long range correlation is found to be a sum of the drift wave and zonal flow contributions.
△ Less
Submitted 6 June, 2013;
originally announced June 2013.