Skip to main content

Showing 1–5 of 5 results for author: Greenwood, C M T

Searching in archive stat. Search in all archives.
.
  1. arXiv:2507.05990  [pdf, ps, other

    stat.ME

    Multivariate regression with missing response data for modelling regional DNA methylation QTLs

    Authors: Shomoita Alam, Yixiao Zeng, Sasha Bernatsky, Marie Hudson, Inés Colmegna, David A. Stephens, Celia M. T. Greenwood, Archer Y. Yang

    Abstract: Identifying genetic regulators of DNA methylation (mQTLs) with multivariate models enhances statistical power, but is challenged by missing data from bisulfite sequencing. Standard imputation-based methods can introduce bias, limiting reliable inference. We propose \texttt{missoNet}, a novel convex estimation framework that jointly estimates regression coefficients and the precision matrix from da… ▽ More

    Submitted 8 July, 2025; originally announced July 2025.

  2. arXiv:2410.10082  [pdf, other

    stat.ML cs.LG stat.AP stat.CO

    fastHDMI: Fast Mutual Information Estimation for High-Dimensional Data

    Authors: Kai Yang, Masoud Asgharian, Nikhil Bhagwat, Jean-Baptiste Poline, Celia M. T. Greenwood

    Abstract: In this paper, we introduce fastHDMI, a Python package designed for efficient variable screening in high-dimensional datasets, particularly neuroimaging data. This work pioneers the application of three mutual information estimation methods for neuroimaging variable selection, a novel approach implemented via fastHDMI. These advancements enhance our ability to analyze the complex structures of neu… ▽ More

    Submitted 13 October, 2024; originally announced October 2024.

    Comments: 31 pages, 5 figures

  3. arXiv:2105.12286  [pdf, other

    stat.ME stat.AP stat.CO stat.ML

    An algorithm-based multiple detection influence measure for high dimensional regression using expectile

    Authors: Amadou Barry, Nikhil Bhagwat, Bratislav Misic, Jean-Baptiste Poline, Celia M. T. Greenwood

    Abstract: The identification of influential observations is an important part of data analysis that can prevent erroneous conclusions drawn from biased estimators. However, in high dimensional data, this identification is challenging. Classical and recently-developed methods often perform poorly when there are multiple influential observations in the same dataset. In particular, current methods can fail whe… ▽ More

    Submitted 25 May, 2021; originally announced May 2021.

    Comments: 38 pages, 11 figures

  4. arXiv:2101.07374  [pdf, other

    stat.ME stat.AP

    Detecting differentially methylated regions in bisulfite sequencing data using quasi-binomial mixed models with smooth covariate effect estimates

    Authors: Kaiqiong Zhao, Karim Oualkacha, Lajmi Lakhal-Chaieb, Aurélie Labbe, Kathleen Klein, Sasha Bernatsky, Marie Hudson, Inés Colmegna, Celia M. T. Greenwood

    Abstract: Identifying disease-associated changes in DNA methylation can help to gain a better understanding of disease etiology. Bisulfite sequencing technology allows the generation of methylation profiles at single base of DNA. We previously developed a method for estimating smooth covariate effects and identifying differentially methylated regions (DMRs) from bisulfite sequencing data, which copes with e… ▽ More

    Submitted 18 January, 2021; originally announced January 2021.

  5. arXiv:1703.08111  [pdf

    stat.AP

    Alternating optimization for GxE modelling with weighted genetic and environmental scores: examples from the MAVAN study

    Authors: Alexia Jolicoeur-Martineau, Ashley Wazana, Eszter Szekely, Meir Steiner, Alison S. Fleming, James L. Kennedy, Michael J. Meaney, Celia M. T. Greenwood

    Abstract: Motivated by the goal of expanding currently existing genotype x environment interaction (GxE) models to simultaneously include multiple genetic variants and environmental exposures in a parsimonious way, we developed a novel method to estimate the parameters in a GxE model, where G is a weighted sum of genetic variants (genetic score) and E is a weighted sum of environments (environmental score).… ▽ More

    Submitted 31 August, 2017; v1 submitted 23 March, 2017; originally announced March 2017.