Skip to main content

Showing 1–5 of 5 results for author: Storey, J D

Searching in archive stat. Search in all archives.
.
  1. arXiv:1510.03497  [pdf, other

    stat.ML

    Consistent Estimation of Low-Dimensional Latent Structure in High-Dimensional Data

    Authors: Xiongzhi Chen, John D. Storey

    Abstract: We consider the problem of extracting a low-dimensional, linear latent variable structure from high-dimensional random variables. Specifically, we show that under mild conditions and when this structure manifests itself as a linear space that spans the conditional means, it is possible to consistently recover the structure using only information up to the second moments of these random variables.… ▽ More

    Submitted 12 October, 2015; originally announced October 2015.

  2. arXiv:1312.2041  [pdf, other

    q-bio.PE q-bio.GN q-bio.QM stat.AP stat.ME

    Probabilistic models of genetic variation in structured populations applied to global human studies

    Authors: Wei Hao, Minsun Song, John D. Storey

    Abstract: Modern population genetics studies typically involve genome-wide genotyping of individuals from a diverse network of ancestries. An important, unsolved problem is how to formulate and estimate probabilistic models of observed genotypes that allow for complex population structure. We formulate two general probabilistic models, and we propose computationally efficient algorithms to estimate them. Fi… ▽ More

    Submitted 3 March, 2015; v1 submitted 6 December, 2013; originally announced December 2013.

    Comments: Wei Hao and Minsun Song contributed equally to this work

  3. arXiv:1308.6013  [pdf, other

    stat.ME q-bio.QM stat.AP

    Statistical significance of variables driving systematic variation

    Authors: Neo Christopher Chung, John D. Storey

    Abstract: There are a number of well-established methods such as principal components analysis (PCA) for automatically capturing systematic variation due to latent variables in large-scale genomic data. PCA and related methods may directly provide a quantitative characterization of a complex biological variable that is otherwise difficult to precisely define or model. An unsolved problem in this context is… ▽ More

    Submitted 27 August, 2013; originally announced August 2013.

    Comments: 35 pages, 1 table, 6 main figures, 7 supplementary figures

    Journal ref: Bioinformatics (2015) 31 (4): 545-554

  4. arXiv:1301.3933  [pdf, other

    stat.ME q-bio.GN q-bio.QM

    Gene set bagging for estimating replicability of gene set analyses

    Authors: Andrew E. Jaffe, John D. Storey, Hongkai Ji, Jeffrey T. Leek

    Abstract: Background: Significance analysis plays a major role in identifying and ranking genes, transcription factor binding sites, DNA methylation regions, and other high-throughput features for association with disease. We propose a new approach, called gene set bagging, for measuring the stability of ranking procedures using predefined gene sets. Gene set bagging involves resampling the original high-th… ▽ More

    Submitted 17 January, 2013; v1 submitted 16 January, 2013; originally announced January 2013.

    Comments: 3 Figures

  5. arXiv:1210.3313  [pdf, other

    q-bio.QM q-bio.GN stat.AP stat.ME

    Identifying and Mapping Cell-type Specific Chromatin Programming of Gene Expression

    Authors: Troels T. Marstrand, John D. Storey

    Abstract: A problem of substantial interest is to systematically map variation in chromatin structure to gene expression regulation across conditions, environments, or differentiated cell types. We developed and applied a quantitative framework for determining the existence, strength, and type of relationship between high-resolution chromatin structure in terms of DNaseI hypersensitivity (DHS) and genome-wi… ▽ More

    Submitted 11 October, 2012; originally announced October 2012.

    Comments: First version completed December 2010. Last modified August 2011. We remain in the submission and publication process, so the content of this manuscript may change in the future. With the recent publication of 30 ENCODE papers, we would like to share our related work with the research community. The Supplementary Information may be found among the source files, specifically in arxiv_SI.pdf

    Journal ref: Proceedings of the National Academy of Sciences (2014), 111(6), E645-E654