Skip to main content

Showing 1–16 of 16 results for author: García, C

Searching in archive stat. Search in all archives.
.
  1. arXiv:2504.06171  [pdf, ps, other

    stat.ME

    Generalized Ridge Regression: Applications to Nonorthogonal Linear Regression Models

    Authors: Román Salmerón Gómez, Catalina García García, Guillermo Hortal Reina

    Abstract: This paper analyzes the possibilities of using the generalized ridge regression to mitigate multicollinearity in a multiple linear regression model. For this purpose, we obtain the expressions for the estimated variance, the coefficient of variation, the coefficient of correlation, the variance inflation factor and the condition number. The results obtained are illustrated with two numerical examp… ▽ More

    Submitted 8 April, 2025; originally announced April 2025.

    Comments: 25 pages, 7 figures, 12 tables, working paper

    MSC Class: 62J05

  2. arXiv:2503.04330  [pdf, other

    stat.ME

    Stepwise regression revisited

    Authors: Román Salmerón Gómez, Catalina García García

    Abstract: This paper shows that the degree of approximate multicollinearity in a linear regression model increases simply by including independent variables, even if these are not highly linearly related. In the current situation where it is relatively easy to find linear models with a large number of independent variables, it is shown that this issue can lead to the erroneous conclusion that there is a wor… ▽ More

    Submitted 6 March, 2025; originally announced March 2025.

    Comments: 22 pages, 2 figures, 7 tables, working paper

  3. arXiv:2410.17680  [pdf, ps, other

    stat.ME

    Unraveling Residualization: enhancing its application and exposing its relationship with the FWL theorem

    Authors: Catalina García García, Román Salmerón Gómez, Claudia García García

    Abstract: The residualization procedure has been applied in many different fields to estimate models with multicollinearity. However, there exists a lack of understanding of this methodology and some authors discourage its use. This paper aims to contribute to a better understanding of the residualization procedure to promote an adequate application and interpretation of it among statistics and data science… ▽ More

    Submitted 23 October, 2024; originally announced October 2024.

    Comments: 10 pages, 2 tables, working paper

    MSC Class: 62J05

  4. arXiv:2407.02583  [pdf, ps, other

    stat.ME

    Generalized Ridge Regression: Biased Estimation for Multiple Linear Regression Models

    Authors: Román Salmerón Gómez, Catalina García García, Guillermo Hortal Reina

    Abstract: When the regressors of a econometric linear model are nonorthogonal, it is well known that their estimation by ordinary least squares can present various problems that discourage the use of this model. The ridge regression is the most commonly used alternative; however, its generalized version has hardly been analyzed. The present work addresses the estimation of this generalized version, as well… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: 23 pages, 5 tables, 7 figures, working paper

    MSC Class: 62J05

  5. arXiv:2407.01172  [pdf, ps, other

    stat.AP

    Enlarging of the sample to address multicollinearity

    Authors: Román Salmerón Gómez, Catalina García García, Ainara Rodríguez Sánchez

    Abstract: The paper analyzes how the enlarging of the sample affects to the mitigation of collinearity concluding that it may mitigate the consequences of collinearity related to statistical analysis but not necessarily the numerical instability. The problem that is addressed is of importance in the teaching of social sciences since it discusses one of the solutions proposed almost unanimously to solve the… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: 11 pages, 2 tables, working paper

    MSC Class: 62J05

  6. arXiv:2208.00087  [pdf, ps, other

    cs.LG cs.CC cs.CV eess.SP stat.ME

    Low-complexity Approximate Convolutional Neural Networks

    Authors: R. J. Cintra, S. Duffner, C. Garcia, A. Leite

    Abstract: In this paper, we present an approach for minimizing the computational complexity of trained Convolutional Neural Networks (ConvNet). The idea is to approximate all elements of a given ConvNet and replace the original convolutional filters and parameters (pooling and bias coefficients; and activation function) with efficient approximations capable of extreme reductions in computational complexity.… ▽ More

    Submitted 29 July, 2022; originally announced August 2022.

    Comments: 13 pages, 4 figures, 8 tables

    Journal ref: IEEE Transactions on Neural Networks and Learning Systems, v. 29, n. 12, Dec. 2018

  7. arXiv:2107.03077  [pdf, ps, other

    stat.CO

    MultiColl package and other packages to detect multicollinearity in R

    Authors: R. Salmerón, C. B. García, J. García

    Abstract: This work presents a guide for the use of some of the functions of the multiColl package in R for the detection of near-multicollinearity. The main contribution, in comparison to other existing packages in R or other econometric software, is the treatment of qualitative independent variables and the intercept in the simple/multiple linear regression model. The main goal of this paper is to show th… ▽ More

    Submitted 7 July, 2021; originally announced July 2021.

    Comments: 10 pages, 1 table, working paper

    MSC Class: 62J05

  8. arXiv:2104.14423  [pdf, other

    stat.ME

    The Raise Regression: Justification, properties and application

    Authors: Román Salmerón Gómez, Catalina García García, José García Pérez

    Abstract: Multicollinearity produces an inflation in the variance of the Ordinary Least Squares estimators due to the correlation between two or more independent variables (including the constant term). A widely applied solution is to estimate with penalized estimators (such as the ridge estimator, the Liu estimator, etc.) which exchange the mean square error by the bias. Although the variance diminishes wi… ▽ More

    Submitted 29 April, 2021; originally announced April 2021.

    Comments: 25 pages, 2 figures, 9 tables; working paper

    MSC Class: 62J05

  9. arXiv:2005.02245  [pdf, ps, other

    stat.ME

    Overcoming the inconsistences of the variance inflation factor: a redefined VIF and a test to detect statistical troubling multicollinearity

    Authors: Román Salmerón, Catalina García, José García

    Abstract: Multicollinearity is relevant to many different fields where linear regression models are applied, and its existence may affect the analysis of ordinary least squares (OLS) estimators from both the numerical and statistical points of views. Thus, multicollinearity can lead to incoherence in the statistical significance of the independent variables and the global significance of the model. The vari… ▽ More

    Submitted 5 May, 2020; originally announced May 2020.

    Comments: 23 pages, working paper

    MSC Class: 62J05

  10. arXiv:1910.14590  [pdf, ps, other

    stat.CO stat.ME

    "multiColl": An R package to detect multicollinearity

    Authors: Román Salmerón, Catalina García, José García

    Abstract: This work presents a guide for the use of some of the functions of the R package "multiColl" for the detection of near multicollinearity. The main contribution, in comparison to other existing packages in R or other econometric software, is the treatment of qualitative independent variables and the intercept in the simple/multiple linear regression model.

    Submitted 31 October, 2019; originally announced October 2019.

    Comments: 15 pages

  11. arXiv:1907.04666  [pdf, other

    cs.LG cs.AI stat.ML

    Routine Modeling with Time Series Metric Learning

    Authors: Paul Compagnon, Grégoire Lefebvre, Stefan Duffner, Christophe Garcia

    Abstract: Traditionally, the automatic recognition of human activities is performed with supervised learning algorithms on limited sets of specific activities. This work proposes to recognize recurrent activity patterns, called routines, instead of precisely defined activities. The modeling of routines is defined as a metric learning problem, and an architecture, called SS2S, based on sequence-to-sequence m… ▽ More

    Submitted 8 July, 2019; originally announced July 2019.

    Journal ref: 28th International Conference on Artificial Neural Networks, Sep 2019, Munich, Germany

  12. arXiv:1901.01583  [pdf, other

    stat.ME

    Sparse estimation for case-control studies with multiple subtypes of cases

    Authors: Nadim Ballout, Cedric Garcia, Vivian Viallon

    Abstract: The analysis of case-control studies with several subtypes of cases is increasingly common, e.g. in cancer epidemiology. For matched designs, we show that a natural strategy is based on a stratified conditional logistic regression model. Then, to account for the potential homogeneity among the subtypes of cases, we adapt the ideas of data shared lasso, which has been recently proposed for the esti… ▽ More

    Submitted 18 January, 2019; v1 submitted 6 January, 2019; originally announced January 2019.

    Comments: 26 pages, 4 figures

  13. arXiv:1811.12081  [pdf, other

    eess.SP cs.LG stat.ML

    Deep Haar Scattering Networks in Pattern Recognition: A promising approach

    Authors: Fernando Fernandes Neto, Alemayehu Admasu Solomon, Rodrigo de Losso, Claudio Garcia, Pedro Delano Cavalcanti

    Abstract: The aim of this paper is to discuss the use of Haar scattering networks, which is a very simple architecture that naturally supports a large number of stacked layers, yet with very few parameters, in a relatively broad set of pattern recognition problems, including regression and classification tasks. This architecture, basically, consists of stacking convolutional filters, that can be thought as… ▽ More

    Submitted 29 November, 2018; originally announced November 2018.

  14. arXiv:1804.00760  [pdf

    stat.AP

    Process Control with Highly Left Censored Data

    Authors: Javier Neira Rueda, Andres Carrion Garcia

    Abstract: The need to monitor industrial processes, detecting changes in process parameters in order to promptly correct problems that may arise, generates a particular area of interest. This is particularly critical and complex when the measured value falls below the sensitivity limits of the measuring system or below detection limits, causing much of their observations are incomplete. Such observations to… ▽ More

    Submitted 5 May, 2019; v1 submitted 2 April, 2018; originally announced April 2018.

    Comments: 14 pages

  15. arXiv:1704.04375  [pdf, other

    stat.ML physics.data-an

    Non-parametric Estimation of Stochastic Differential Equations with Sparse Gaussian Processes

    Authors: Constantino A. García, Abraham Otero, Paulo Félix, Jesús Presedo, David G. Márquez

    Abstract: The application of Stochastic Differential Equations (SDEs) to the analysis of temporal data has attracted increasing attention, due to their ability to describe complex dynamics with physically interpretable equations. In this paper, we introduce a non-parametric method for estimating the drift and diffusion terms of SDEs from a densely observed discrete time series. The use of Gaussian processes… ▽ More

    Submitted 10 July, 2017; v1 submitted 14 April, 2017; originally announced April 2017.

    Journal ref: Phys. Rev. E 96, 022104 (2017)

  16. arXiv:1411.5179  [pdf, ps, other

    q-bio.QM physics.med-ph stat.AP

    A new algorithm for wavelet-based heart rate variability analysis

    Authors: Constantino A. García, Abraham Otero, Xosé Vila, David G. Márquez

    Abstract: One of the most promising non-invasive markers of the activity of the autonomic nervous system is Heart Rate Variability (HRV). HRV analysis toolkits often provide spectral analysis techniques using the Fourier transform, which assumes that the heart rate series is stationary. To overcome this issue, the Short Time Fourier Transform is often used (STFT). However, the wavelet transform is thought t… ▽ More

    Submitted 19 November, 2014; originally announced November 2014.

    Journal ref: Biomedical Signal Processing and Control, Volume 8, Issue 6, November 2013, Pages 542-550