Skip to main content

Showing 1–7 of 7 results for author: Greenwood, C

Searching in archive stat. Search in all archives.
.
  1. arXiv:2507.05990  [pdf, ps, other

    stat.ME

    Multivariate regression with missing response data for modelling regional DNA methylation QTLs

    Authors: Shomoita Alam, Yixiao Zeng, Sasha Bernatsky, Marie Hudson, Inés Colmegna, David A. Stephens, Celia M. T. Greenwood, Archer Y. Yang

    Abstract: Identifying genetic regulators of DNA methylation (mQTLs) with multivariate models enhances statistical power, but is challenged by missing data from bisulfite sequencing. Standard imputation-based methods can introduce bias, limiting reliable inference. We propose \texttt{missoNet}, a novel convex estimation framework that jointly estimates regression coefficients and the precision matrix from da… ▽ More

    Submitted 8 July, 2025; originally announced July 2025.

  2. arXiv:2410.10082  [pdf, other

    stat.ML cs.LG stat.AP stat.CO

    fastHDMI: Fast Mutual Information Estimation for High-Dimensional Data

    Authors: Kai Yang, Masoud Asgharian, Nikhil Bhagwat, Jean-Baptiste Poline, Celia M. T. Greenwood

    Abstract: In this paper, we introduce fastHDMI, a Python package designed for efficient variable screening in high-dimensional datasets, particularly neuroimaging data. This work pioneers the application of three mutual information estimation methods for neuroimaging variable selection, a novel approach implemented via fastHDMI. These advancements enhance our ability to analyze the complex structures of neu… ▽ More

    Submitted 13 October, 2024; originally announced October 2024.

    Comments: 31 pages, 5 figures

  3. arXiv:2105.12286  [pdf, other

    stat.ME stat.AP stat.CO stat.ML

    An algorithm-based multiple detection influence measure for high dimensional regression using expectile

    Authors: Amadou Barry, Nikhil Bhagwat, Bratislav Misic, Jean-Baptiste Poline, Celia M. T. Greenwood

    Abstract: The identification of influential observations is an important part of data analysis that can prevent erroneous conclusions drawn from biased estimators. However, in high dimensional data, this identification is challenging. Classical and recently-developed methods often perform poorly when there are multiple influential observations in the same dataset. In particular, current methods can fail whe… ▽ More

    Submitted 25 May, 2021; originally announced May 2021.

    Comments: 38 pages, 11 figures

  4. arXiv:2101.07374  [pdf, other

    stat.ME stat.AP

    Detecting differentially methylated regions in bisulfite sequencing data using quasi-binomial mixed models with smooth covariate effect estimates

    Authors: Kaiqiong Zhao, Karim Oualkacha, Lajmi Lakhal-Chaieb, Aurélie Labbe, Kathleen Klein, Sasha Bernatsky, Marie Hudson, Inés Colmegna, Celia M. T. Greenwood

    Abstract: Identifying disease-associated changes in DNA methylation can help to gain a better understanding of disease etiology. Bisulfite sequencing technology allows the generation of methylation profiles at single base of DNA. We previously developed a method for estimating smooth covariate effects and identifying differentially methylated regions (DMRs) from bisulfite sequencing data, which copes with e… ▽ More

    Submitted 18 January, 2021; originally announced January 2021.

  5. arXiv:1811.07356  [pdf, other

    stat.ME

    A Tracy-Widom Empirical Estimator For Valid P-values With High-Dimensional Datasets

    Authors: Maxime Turgeon, Celia MT Greenwood, Aurelie Labbe

    Abstract: Recent technological advances in many domains including both genomics and brain imaging have led to an abundance of high-dimensional and correlated data being routinely collected. Classical multivariate approaches like Multivariate Analysis of Variance (MANOVA) and Canonical Correlation Analysis (CCA) can be used to study relationships between such multivariate datasets. Yet, special care is requi… ▽ More

    Submitted 18 November, 2018; originally announced November 2018.

  6. arXiv:1712.04058  [pdf

    stat.AP

    Distinguishing differential susceptibility, diathesis-stress and vantage sensitivity: beyond the single gene and environment model

    Authors: Alexia Jolicoeur-Martineau, Jay Belsky, Eszter Szekely, Keith F. Widaman, Michael Pluess, Celia Greenwood, Ashley Wazana

    Abstract: Currently, two main approaches exist to distinguish differential susceptibility from diathesis-stress and vantage sensitivity in genotype x environment interaction (GxE) research: Regions of significance (RoS) and competitive-confirmatory approaches. Each is limited by their single-gene/single-environment foci given that most phenotypes are the product of multiple interacting genetic and environme… ▽ More

    Submitted 21 August, 2018; v1 submitted 11 December, 2017; originally announced December 2017.

  7. arXiv:1703.08111  [pdf

    stat.AP

    Alternating optimization for GxE modelling with weighted genetic and environmental scores: examples from the MAVAN study

    Authors: Alexia Jolicoeur-Martineau, Ashley Wazana, Eszter Szekely, Meir Steiner, Alison S. Fleming, James L. Kennedy, Michael J. Meaney, Celia M. T. Greenwood

    Abstract: Motivated by the goal of expanding currently existing genotype x environment interaction (GxE) models to simultaneously include multiple genetic variants and environmental exposures in a parsimonious way, we developed a novel method to estimate the parameters in a GxE model, where G is a weighted sum of genetic variants (genetic score) and E is a weighted sum of environments (environmental score).… ▽ More

    Submitted 31 August, 2017; v1 submitted 23 March, 2017; originally announced March 2017.