Skip to main content

Showing 1–4 of 4 results for author: Szafranski, M

Searching in archive stat. Search in all archives.
.
  1. arXiv:2401.04682  [pdf, other

    cs.LG math.ST stat.ML

    Mixture of multilayer stochastic block models for multiview clustering

    Authors: Kylliann De Santiago, Marie Szafranski, Christophe Ambroise

    Abstract: In this work, we propose an original method for aggregating multiple clustering coming from different sources of information. Each partition is encoded by a co-membership matrix between observations. Our approach uses a mixture of multilayer Stochastic Block Models (SBM) to group co-membership matrices with similar information into components and to partition observations into different clusters,… ▽ More

    Submitted 9 January, 2024; originally announced January 2024.

  2. arXiv:1810.12169  [pdf, other

    stat.AP cs.LG math.ST stat.ME

    Fast Computation of Genome-Metagenome Interaction Effects

    Authors: Florent Guinot, Marie Szafranski, Julien Chiquet, Anouk Zancarini, Christine Le Signor, Christophe Mougel, Christophe Ambroise

    Abstract: Motivation. Association studies have been widely used to search for associations between common genetic variants observations and a given phenotype. However, it is now generally accepted that genes and environment must be examined jointly when estimating phenotypic variance. In this work we consider two types of biological markers: genotypic markers, which characterize an observation in terms of i… ▽ More

    Submitted 18 June, 2020; v1 submitted 29 October, 2018; originally announced October 2018.

  3. arXiv:1710.01085  [pdf, other

    stat.ME

    Learning the optimal scale for GWAS through hierarchical SNP aggregation

    Authors: Florent Guinot, Marie Szafranski, Christophe Ambroise, Franck Samson

    Abstract: Motivation: Genome-Wide Association Studies (GWAS) seek to identify causal genomic variants associated with rare human diseases. The classical statistical approach for detecting these variants is based on univariate hypothesis testing, with healthy individuals being tested against affected individuals at each locus. Given that an individual's genotype is characterized by up to one million SNPs, th… ▽ More

    Submitted 19 October, 2018; v1 submitted 3 October, 2017; originally announced October 2017.

  4. arXiv:0909.1933  [pdf, ps, other

    cs.LG math.ST stat.ML

    Chromatic PAC-Bayes Bounds for Non-IID Data: Applications to Ranking and Stationary $β$-Mixing Processes

    Authors: Liva Ralaivola, Marie Szafranski, Guillaume Stempfel

    Abstract: Pac-Bayes bounds are among the most accurate generalization bounds for classifiers learned from independently and identically distributed (IID) data, and it is particularly so for margin classifiers: there have been recent contributions showing how practical these bounds can be either to perform model selection (Ambroladze et al., 2007) or even to directly guide the learning of linear classifiers… ▽ More

    Submitted 4 June, 2010; v1 submitted 10 September, 2009; originally announced September 2009.

    Comments: Long version of the AISTATS 09 paper: http://jmlr.csail.mit.edu/proceedings/papers/v5/ralaivola09a/ralaivola09a.pdf