Skip to main content

Showing 1–10 of 10 results for author: Lou, Z

Searching in archive stat. Search in all archives.
.
  1. arXiv:2401.09346  [pdf, other

    stat.ML cs.LG

    High Confidence Level Inference is Almost Free using Parallel Stochastic Optimization

    Authors: Wanrong Zhu, Zhipeng Lou, Ziyang Wei, Wei Biao Wu

    Abstract: Uncertainty quantification for estimation through stochastic optimization solutions in an online setting has gained popularity recently. This paper introduces a novel inference method focused on constructing confidence intervals with efficient computation and fast convergence to the nominal level. Specifically, we propose to use a small number of independent multi-runs to acquire distribution info… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

  2. arXiv:2308.02918  [pdf, other

    stat.ME cs.IT cs.LG math.ST stat.ML

    Spectral Ranking Inferences based on General Multiway Comparisons

    Authors: Jianqing Fan, Zhipeng Lou, Weichen Wang, Mengxin Yu

    Abstract: This paper studies the performance of the spectral method in the estimation and uncertainty quantification of the unobserved preference scores of compared entities in a general and more realistic setup. Specifically, the comparison graph consists of hyper-edges of possible heterogeneous sizes, and the number of comparisons can be as low as one for a given hyper-edge. Such a setting is pervasive in… ▽ More

    Submitted 1 March, 2024; v1 submitted 5 August, 2023; originally announced August 2023.

    Comments: 62 pages, 4 figures

  3. arXiv:2302.12111  [pdf, other

    stat.ME math.ST stat.AP stat.ML

    Communication-Efficient Distributed Estimation and Inference for Cox's Model

    Authors: Pierre Bayle, Jianqing Fan, Zhipeng Lou

    Abstract: Motivated by multi-center biomedical studies that cannot share individual data due to privacy and ownership concerns, we develop communication-efficient iterative distributed algorithms for estimation and inference in the high-dimensional sparse Cox proportional hazards model. We demonstrate that our estimator, even with a relatively small number of iterations, achieves the same convergence rate a… ▽ More

    Submitted 23 June, 2024; v1 submitted 23 February, 2023; originally announced February 2023.

  4. arXiv:2301.04209  [pdf, other

    stat.ME

    High Dimensional Analysis of Variance in Multivariate Linear Regression

    Authors: Zhipeng Lou, Xianyang Zhang, Wei Biao Wu

    Abstract: In this paper, we develop a systematic theory for high dimensional analysis of variance in multivariate linear regression, where the dimension and the number of coefficients can both grow with the sample size. We propose a new \emph{U}~type test statistic to test linear hypotheses and establish a high dimensional Gaussian approximation result under fairly mild moment assumptions. Our general frame… ▽ More

    Submitted 10 January, 2023; originally announced January 2023.

  5. arXiv:2211.11959  [pdf, ps, other

    math.ST cs.LG stat.ME stat.ML

    Robust High-dimensional Tuning Free Multiple Testing

    Authors: Jianqing Fan, Zhipeng Lou, Mengxin Yu

    Abstract: A stylized feature of high-dimensional data is that many variables have heavy tails, and robust statistical inference is critical for valid large-scale statistical inference. Yet, the existing developments such as Winsorization, Huberization and median of means require the bounded second moments and involve variable-dependent tuning parameters, which hamper their fidelity in applications to large-… ▽ More

    Submitted 23 November, 2022; v1 submitted 21 November, 2022; originally announced November 2022.

    Comments: In this paper, we develop tuning-free and moment-free high dimensional inference procedures;

  6. arXiv:2211.11957  [pdf, other

    stat.ME cs.IT math.ST stat.ML

    Ranking Inferences Based on the Top Choice of Multiway Comparisons

    Authors: Jianqing Fan, Zhipeng Lou, Weichen Wang, Mengxin Yu

    Abstract: This paper considers ranking inference of $n$ items based on the observed data on the top choice among $M$ randomly selected items at each trial. This is a useful modification of the Plackett-Luce model for $M$-way ranking with only the top choice observed and is an extension of the celebrated Bradley-Terry-Luce model that corresponds to $M=2$. Under a uniform sampling scheme in which any $M$ dist… ▽ More

    Submitted 5 January, 2023; v1 submitted 21 November, 2022; originally announced November 2022.

    Comments: In this paper, we build simultaneous confidence intervals for ranks through multiway comparisons

  7. arXiv:2203.01219  [pdf, other

    stat.ME cs.LG math.ST stat.ML

    Are Latent Factor Regression and Sparse Regression Adequate?

    Authors: Jianqing Fan, Zhipeng Lou, Mengxin Yu

    Abstract: We propose the Factor Augmented sparse linear Regression Model (FARM) that not only encompasses both the latent factor regression and sparse linear regression as special cases but also bridges dimension reduction and sparse regression together. We provide theoretical guarantees for the estimation of our model under the existence of sub-Gaussian and heavy-tailed noises (with bounded (1+x)-th moment… ▽ More

    Submitted 2 March, 2022; originally announced March 2022.

  8. arXiv:2201.04982  [pdf

    stat.OT

    An empirical exploration of the diversified R ecosystem

    Authors: Tian-Yuan Huang, Zhilan Lou

    Abstract: Born in the late 20s, R is one of the most popular software for statistical computing and graphics. With the development of information technology and the advent of the big data era, great changes have taken place in the R ecosystem. Based on the meta information of the Comprehensive R Archive Network (CRAN) and the bibliometric data of literature citing R, we discovered that while R is initiated… ▽ More

    Submitted 6 December, 2023; v1 submitted 13 January, 2022; originally announced January 2022.

  9. arXiv:2007.03092  [pdf, other

    cs.LG stat.ML

    Neural Subgraph Matching

    Authors: Rex, Ying, Zhaoyu Lou, Jiaxuan You, Chengtao Wen, Arquimedes Canedo, Jure Leskovec

    Abstract: Subgraph matching is the problem of determining the presence and location(s) of a given query graph in a large target graph. Despite being an NP-complete problem, the subgraph matching problem is crucial in domains ranging from network science and database systems to biochemistry and cognitive science. However, existing techniques based on combinatorial matching and integer programming cannot hand… ▽ More

    Submitted 27 October, 2020; v1 submitted 6 July, 2020; originally announced July 2020.

  10. arXiv:1906.00216  [pdf, other

    cs.LG cs.CV stat.ML

    Robust Learning Under Label Noise With Iterative Noise-Filtering

    Authors: Duc Tam Nguyen, Thi-Phuong-Nhung Ngo, Zhongyu Lou, Michael Klar, Laura Beggel, Thomas Brox

    Abstract: We consider the problem of training a model under the presence of label noise. Current approaches identify samples with potentially incorrect labels and reduce their influence on the learning process by either assigning lower weights to them or completely removing them from the training set. In the first case the model however still learns from noisy labels; in the latter approach, good training d… ▽ More

    Submitted 1 June, 2019; originally announced June 2019.