Skip to main content

Showing 1–6 of 6 results for author: Niu, Y S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2506.12626  [pdf, ps, other

    stat.AP

    Kernel Density Balancing

    Authors: John Park, Ning Hao, Yue Selena Niu, Ming Hu

    Abstract: High-throughput chromatin conformation capture (Hi-C) data provide insights into the 3D structure of chromosomes, with normalization being a crucial pre-processing step. A common technique for normalization is matrix balancing, which rescales rows and columns of a Hi-C matrix to equalize their sums. Despite its popularity and convenience, matrix balancing lacks statistical justification. In this p… ▽ More

    Submitted 14 June, 2025; originally announced June 2025.

  2. arXiv:2108.09431  [pdf, other

    stat.ME math.ST

    Equivariant Variance Estimation for Multiple Change-point Model

    Authors: Ning Hao, Yue Selena Niu, Han Xiao

    Abstract: The variance of noise plays an important role in many change-point detection procedures and the associated inferences. Most commonly used variance estimators require strong assumptions on the true mean structure or normality of the error distribution, which may not hold in applications. More importantly, the qualities of these estimators have not been discussed systematically in the literature. In… ▽ More

    Submitted 15 November, 2023; v1 submitted 21 August, 2021; originally announced August 2021.

    Comments: 44 pages

  3. arXiv:2003.12540  [pdf, other

    stat.ME stat.AP stat.CO

    A super scalable algorithm for short segment detection

    Authors: Ning Hao, Yue Selena Niu, Feifei Xiao, Heping Zhang

    Abstract: In many applications such as copy number variant (CNV) detection, the goal is to identify short segments on which the observations have different means or medians from the background. Those segments are usually short and hidden in a long sequence, and hence are very challenging to find. We study a super scalable short segment (4S) detection algorithm in this paper. This nonparametric method cluste… ▽ More

    Submitted 27 March, 2020; originally announced March 2020.

    Comments: To be published in Statistics in Biosciences

  4. arXiv:1512.04093  [pdf, other

    stat.ME math.ST

    Multiple Change-point Detection: a Selective Overview

    Authors: Yue S. Niu, Ning Hao, Heping Zhang

    Abstract: Very long and noisy sequence data arise from biological sciences to social science including high throughput data in genomics and stock prices in econometrics. Often such data are collected in order to identify and understand shifts in trend, e.g., from a bull market to a bear market in finance or from a normal number of chromosome copies to an excessive number of chromosome copies in genetics. Th… ▽ More

    Submitted 14 July, 2016; v1 submitted 13 December, 2015; originally announced December 2015.

    Comments: 26 pages, 2 figures

  5. arXiv:1511.00282  [pdf, ps, other

    stat.ME

    A New Reduced-Rank Linear Discriminant Analysis Method and Its Applications

    Authors: Yue Selena Niu, Ning Hao, Bin Dong

    Abstract: We consider multi-class classification problems for high dimensional data. Following the idea of reduced-rank linear discriminant analysis (LDA), we introduce a new dimension reduction tool with a flavor of supervised principal component analysis (PCA). The proposed method is computationally efficient and can incorporate the correlation structure among the features. Besides the theoretical insight… ▽ More

    Submitted 25 March, 2017; v1 submitted 1 November, 2015; originally announced November 2015.

    Comments: This is the accepted version which may be slightly different from the published version

  6. The screening and ranking algorithm to detect DNA copy number variations

    Authors: Yue S. Niu, Heping Zhang

    Abstract: DNA Copy number variation (CNV) has recently gained considerable interest as a source of genetic variation that likely influences phenotypic differences. Many statistical and computational methods have been proposed and applied to detect CNVs based on data that generated by genome analysis platforms. However, most algorithms are computationally intensive with complexity at least $O(n^2)$, where… ▽ More

    Submitted 1 October, 2012; originally announced October 2012.

    Comments: Published in at http://dx.doi.org/10.1214/12-AOAS539 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS539

    Journal ref: Annals of Applied Statistics 2012, Vol. 6, No. 3, 1306-1326