Skip to main content

Showing 1–8 of 8 results for author: Minsker, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2305.18681  [pdf, ps, other

    math.ST stat.ML

    Efficient median of means estimator

    Authors: Stanislav Minsker

    Abstract: The goal of this note is to present a modification of the popular median of means estimator that achieves sub-Gaussian deviation bounds with nearly optimal constants under minimal assumptions on the underlying distribution. We build on a recent work on the topic by the author, and prove that desired guarantees can be attained under weaker requirements.

    Submitted 29 May, 2023; originally announced May 2023.

    MSC Class: 62G35

  2. arXiv:2111.07041  [pdf, other

    math.ST stat.ML

    Minimax Supervised Clustering in the Anisotropic Gaussian Mixture Model: A new take on Robust Interpolation

    Authors: Stanislav Minsker, Mohamed Ndaoud, Yiqiu Shen

    Abstract: We study the supervised clustering problem under the two-component anisotropic Gaussian mixture model in high dimensions and in the non-asymptotic setting. We first derive a lower and a matching upper bound for the minimax risk of clustering in this framework. We also show that in the high-dimensional regime, the linear discriminant analysis (LDA) classifier turns out to be sub-optimal in the mini… ▽ More

    Submitted 13 November, 2021; originally announced November 2021.

  3. arXiv:2004.02328  [pdf, other

    math.ST stat.ML

    Asymptotic normality of robust risk minimizers

    Authors: Stanislav Minsker

    Abstract: This paper investigates asymptotic properties of algorithms that can be viewed as robust analogues of the classical empirical risk minimization. These strategies are based on replacing the usual empirical average by a robust proxy of the mean, such as the (version of) the median of means estimator. It is well known by now that the excess risk of resulting estimators often converges to zero at opti… ▽ More

    Submitted 30 May, 2023; v1 submitted 5 April, 2020; originally announced April 2020.

    MSC Class: 62F35

  4. arXiv:1910.07485  [pdf, other

    stat.ML cs.LG

    Excess risk bounds in robust empirical risk minimization

    Authors: Stanislav Minsker, Timothée Mathieu

    Abstract: This paper investigates robust versions of the general empirical risk minimization algorithm, one of the core techniques underlying modern statistical methods. Success of the empirical risk minimization is based on the fact that for a "well-behaved" stochastic process $\left\{ f(X), \ f\in \mathcal F\right\}$ indexed by a class of functions $f\in \mathcal F$, averages… ▽ More

    Submitted 16 October, 2019; originally announced October 2019.

    MSC Class: 62G35

  5. arXiv:1811.01520  [pdf, other

    stat.ME math.ST

    User-Friendly Covariance Estimation for Heavy-Tailed Distributions

    Authors: Yuan Ke, Stanislav Minsker, Zhao Ren, Qiang Sun, Wen-Xin Zhou

    Abstract: We offer a survey of recent results on covariance estimation for heavy-tailed distributions. By unifying ideas scattered in the literature, we propose user-friendly methods that facilitate practical implementation. Specifically, we introduce element-wise and spectrum-wise truncation operators, as well as their $M$-estimator counterparts, to robustify the sample covariance matrix. Different from th… ▽ More

    Submitted 11 March, 2019; v1 submitted 5 November, 2018; originally announced November 2018.

    Comments: 56 pages, 2 figures

  6. arXiv:1704.02658  [pdf, other

    math.ST cs.DC stat.ML

    Distributed Statistical Estimation and Rates of Convergence in Normal Approximation

    Authors: Stanislav Minsker, Nate Strawn

    Abstract: This paper presents a class of new algorithms for distributed statistical estimation that exploit divide-and-conquer approach. We show that one of the key benefits of the divide-and-conquer strategy is robustness, an important characteristic for large distributed systems. We establish connections between performance of these distributed algorithms and the rates of convergence in normal approximati… ▽ More

    Submitted 27 August, 2018; v1 submitted 9 April, 2017; originally announced April 2017.

    MSC Class: 68W15; 62G35

  7. arXiv:1605.07129  [pdf, other

    math.ST stat.ML

    Sub-Gaussian estimators of the mean of a random matrix with heavy-tailed entries

    Authors: Stanislav Minsker

    Abstract: Estimation of the covariance matrix has attracted a lot of attention of the statistical research community over the years, partially due to important applications such as Principal Component Analysis. However, frequently used empirical covariance estimator (and its modifications) is very sensitive to outliers in the data. As P. J. Huber wrote in 1964, "...This raises a question which could have be… ▽ More

    Submitted 17 June, 2018; v1 submitted 23 May, 2016; originally announced May 2016.

    MSC Class: 60B20; 62G35 (Primary) 62H12 (Secondary)

  8. arXiv:1404.2971  [pdf, other

    stat.ME

    Active Clinical Trials for Personalized Medicine

    Authors: Stanislav Minsker, Ying-Qi Zhao, Guang Cheng

    Abstract: Individualized treatment rules (ITRs) tailor treatments according to individual patient characteristics. They can significantly improve patient care and are thus becoming increasingly popular. The data collected during randomized clinical trials are often used to estimate the optimal ITRs. However, these trials are generally expensive to run, and, moreover, they are not designed to efficiently est… ▽ More

    Submitted 28 June, 2015; v1 submitted 10 April, 2014; originally announced April 2014.

    Comments: 48 Page, 9 Figures. To Appear in JASA--T&M