Skip to main content

Showing 1–15 of 15 results for author: Yohai, V J

Searching in archive math. Search in all archives.
.
  1. arXiv:2102.06851  [pdf, other

    stat.ME math.ST stat.CO

    Robust Model-Based Clustering

    Authors: Juan D. Gonzalez, Ricardo Maronna, Victor J. Yohai, Ruben H. Zamar

    Abstract: We propose a new class of robust and Fisher-consistent estimators for mixture models. These estimators can be used to construct robust model-based clustering procedures. We study in detail the case of multivariate normal mixtures and propose a procedure that uses S estimators of multivariate location and scatter. We develop an algorithm to compute the estimators and to build the clusters which is… ▽ More

    Submitted 8 June, 2021; v1 submitted 12 February, 2021; originally announced February 2021.

  2. arXiv:1911.03982  [pdf, ps, other

    math.ST

    Optimal robust estimators for families of distributions on the integers

    Authors: Ricardo A. Maronna, Victor J. Yohai

    Abstract: Let F_{θ} be a family of distributions with support on the set of nonnegative integers Z_0. In this paper we derive the M-estimators with smallest gross error sensitivity (GES). We start by defining the uniform median of a distribution F with support on Z_0 (umed(F)) as the median of x+u, where x and u are independent variables with distributions F and uniform in [-0.5,0.5] respectively. Under som… ▽ More

    Submitted 10 November, 2019; originally announced November 2019.

    Comments: 13 pages

    MSC Class: Primary: 62G35; secondary: 62G30

  3. arXiv:1609.00402  [pdf, other

    math.ST

    Multivariate Location and Scatter Matrix Estimation Under Cellwise and Casewise Contamination

    Authors: Andy Leung, Victor J. Yohai, Ruben H. Zamar

    Abstract: We consider the problem of multivariate location and scatter matrix estimation when the data contain cellwise and casewise outliers. Agostinelli et al. (2015) propose a two-step approach to deal with this problem: first, apply a univariate filter to remove cellwise outliers and second, apply a generalized S-estimator to downweight casewise outliers. We improve this proposal in three main direction… ▽ More

    Submitted 25 December, 2016; v1 submitted 1 September, 2016; originally announced September 2016.

    MSC Class: 62G35; 62G05; 62G20

  4. arXiv:1508.01967  [pdf, ps, other

    math.ST

    Robust and sparse estimators for linear regression models

    Authors: Ezequiel Smucler, Víctor J. Yohai

    Abstract: Penalized regression estimators are a popular tool for the analysis of sparse and high-dimensional data sets. However, penalized regression estimators defined using an unbounded loss function can be very sensitive to the presence of outlying observations, especially high leverage outliers. Moreover, it can be particularly challenging to detect outliers in high-dimensional data sets. Thus, robust e… ▽ More

    Submitted 16 October, 2015; v1 submitted 8 August, 2015; originally announced August 2015.

    MSC Class: 62F35; 62J05; 62J07

  5. arXiv:1504.03389  [pdf, ps, other

    math.ST

    Robust and efficient estimation of high dimensional scatter and location

    Authors: Ricardo A. Maronna, Victor J. Yohai

    Abstract: We deal with the equivariant estimation of scatter and location for p-dimensional data, giving emphasis to scatter. It it important that the estimators possess both a high efficiency for normal data and a high resistance to outliers, that is, a low bias under contamination. The most frequently employed estimators are not quite satisfactory in this respect. The Minimum Volume Ellipsoid (MVE) and Mi… ▽ More

    Submitted 13 August, 2015; v1 submitted 13 April, 2015; originally announced April 2015.

    Comments: 24 pages, 4 figures, 15 tables

    MSC Class: 62G35; 62H12

  6. arXiv:1407.2176  [pdf, other

    math.ST

    Composite Robust Estimators for Linear Mixed Models

    Authors: Claudio Agostinelli, Victor J. Yohai

    Abstract: The Classical Tukey-Huber Contamination Model (CCM) is a usual framework to describe the mechanism of outliers generation in robust statistics. In a data set with $n$ observations and $p$ variables, under the CCM, an outlier is a unit, even if only one or few values are corrupted. Classical robust procedures were designed to cope with this setting and the impact of observations were limited whenev… ▽ More

    Submitted 14 July, 2014; v1 submitted 8 July, 2014; originally announced July 2014.

    MSC Class: 62G35; 62G05

  7. arXiv:1406.6031  [pdf, other

    math.ST

    Robust estimation of multivariate location and scatter in the presence of cellwise and casewise contamination

    Authors: Claudio Agostinelli, Andy Leung, Victor J. Yohai, Ruben H. Zamar

    Abstract: Multivariate location and scatter matrix estimation is a cornerstone in multivariate data analysis. We consider this problem when the data may contain independent cellwise and casewise outliers. Flat data sets with a large number of variables and a relatively small number of cases are common place in modern statistical applications. In these cases global down-weighting of an entire case, as perfor… ▽ More

    Submitted 23 June, 2014; originally announced June 2014.

    MSC Class: 62G35 (Primary); 62G05 (Secondary)

  8. arXiv:1406.4543  [pdf, ps, other

    math.ST

    Dynamic Principal Components in the Time Domain

    Authors: Daniel Peña, Víctor J. Yohai

    Abstract: We propose a time domain approach to define dynamic principal components (DPC) using a reconstruction of the original series criterion. This approach to define DPC was introduced by Brillinger, who gave a very elegant theoretical solution in the stationary case using the cross spectrum. Our procedure can be applied under more general conditions including the case ofnon stationary series and relati… ▽ More

    Submitted 17 June, 2014; originally announced June 2014.

    Comments: 35 pages,6 figures, 5 tables

    MSC Class: 62M10; 62H25

  9. arXiv:1105.5065  [pdf, ps, other

    stat.ME math.ST

    M-estimators for Isotonic Regression

    Authors: Enrique E. Álvarez, Víctor J. Yohai

    Abstract: In this paper we propose a family of robust estimates for isotonic regression: isotonic M-estimators. We show that their asymptotic distribution is, up to an scalar factor, the same as that of Brunk's classical isotonic estimator. We also derive the influence function and the breakdown point of these estimates. Finally we perform a Monte Carlo study that shows that the proposed family includes est… ▽ More

    Submitted 25 May, 2011; originally announced May 2011.

    MSC Class: 62G35

  10. arXiv:1004.5418  [pdf, ps, other

    math.ST

    Robust location estimation with missing data

    Authors: Mariela Sued, Victor J. Yohai

    Abstract: In a missing-data setting, we have a sample in which a vector of explanatory variables x_i is observed for every subject i, while scalar outcomes y_i are missing by happenstance on some individuals. In this work we propose robust estimates of the distribution of the responses assuming missing at random (MAR) data, under a semiparametric regression model. Our approach allows the consistent estimati… ▽ More

    Submitted 17 September, 2010; v1 submitted 29 April, 2010; originally announced April 2010.

  11. Continuity and differentiability of regression M functionals

    Authors: María V. Fasano, Ricardo A. Maronna, Mariela Sued, Víctor J. Yohai

    Abstract: This paper deals with the Fisher-consistency, weak continuity and differentiability of estimating functionals corresponding to a class of both linear and nonlinear regression high breakdown M estimates, which includes S and MM estimates. A restricted type of differentiability, called weak differentiability, is defined, which suffices to prove the asymptotic normality of estimates based on the func… ▽ More

    Submitted 23 November, 2012; v1 submitted 24 April, 2010; originally announced April 2010.

    Comments: Published in at http://dx.doi.org/10.3150/11-BEJ368 the Bernoulli (http://isi.cbs.nl/bernoulli/) by the International Statistical Institute/Bernoulli Society (http://isi.cbs.nl/BS/bshome.htm)

    Report number: IMS-BEJ-BEJ368

    Journal ref: Bernoulli 2012, Vol. 18, No. 4, 1284-1309

  12. Robust estimation for ARMA models

    Authors: Nora Muler, Daniel Peña, Víctor J. Yohai

    Abstract: This paper introduces a new class of robust estimates for ARMA models. They are M-estimates, but the residuals are computed so the effect of one outlier is limited to the period where it occurs. These estimates are closely related to those based on a robust filter, but they have two important advantages: they are consistent and the asymptotic theory is tractable. We perform a Monte Carlo where w… ▽ More

    Submitted 1 April, 2009; originally announced April 2009.

    Comments: Published in at http://dx.doi.org/10.1214/07-AOS570 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS570 MSC Class: 62F35; 62M10 (Primary)

    Journal ref: Annals of Statistics 2009, Vol. 37, No. 2, 816-840

  13. Propagation of outliers in multivariate data

    Authors: Fatemah Alqallaf, Stefan Van Aelst, Victor J. Yohai, Ruben H. Zamar

    Abstract: We investigate the performance of robust estimates of multivariate location under nonstandard data contamination models such as componentwise outliers (i.e., contamination in each variable is independent from the other variables). This model brings up a possible new source of statistical error that we call "propagation of outliers." This source of error is unusual in the sense that it is generat… ▽ More

    Submitted 3 March, 2009; originally announced March 2009.

    Comments: Published in at http://dx.doi.org/10.1214/07-AOS588 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS588 MSC Class: 62F35 (Primary) 62H12 (Secondary)

    Journal ref: Annals of Statistics 2009, Vol. 37, No. 1, 311-331

  14. High breakdown point robust regression with censored data

    Authors: Matías Salibian-Barrera, Víctor J. Yohai

    Abstract: In this paper, we propose a class of high breakdown point estimators for the linear regression model when the response variable contains censored observations. These estimators are robust against high-leverage outliers and they generalize the LMS (least median of squares), S, MM and $τ$-estimators for linear regression. An important contribution of this paper is that we can define consistent est… ▽ More

    Submitted 12 March, 2008; originally announced March 2008.

    Comments: Published in at http://dx.doi.org/10.1214/009053607000000794 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS0314 MSC Class: 62F35; 62J05 (Primary)

    Journal ref: Annals of Statistics 2008, Vol. 36, No. 1, 118-146

  15. Robust nonparametric inference for the median

    Authors: Victor J. Yohai, Ruben H. Zamar

    Abstract: We consider the problem of constructing robust nonparametric confidence intervals and tests of hypothesis for the median when the data distribution is unknown and the data may contain a small fraction of contamination. We propose a modification of the sign test (and its associated confidence interval) which attains the nominal significance level (probability coverage) for any distribution in the… ▽ More

    Submitted 29 March, 2005; originally announced March 2005.

    Comments: Published at http://dx.doi.org/10.1214/009053604000000634 in the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS283 MSC Class: 62F35 (Primary) 62G35 (Secondary)

    Journal ref: Annals of Statistics 2004, Vol. 32, No. 5, 1841-1857