Skip to main content

Showing 1–11 of 11 results for author: Chehab, O

Searching in archive cs. Search in all archives.
.
  1. arXiv:2502.20115  [pdf, ps, other

    cs.LG stat.ML

    Multi-View Causal Discovery without Non-Gaussianity: Identifiability and Algorithms

    Authors: Ambroise Heurtebise, Omar Chehab, Pierre Ablin, Alexandre Gramfort, Aapo Hyvärinen

    Abstract: Causal discovery is a difficult problem that typically relies on strong assumptions on the data-generating model, such as non-Gaussianity. In practice, many modern applications provide multiple related views of the same system, which has rarely been considered for causal discovery. Here, we leverage this multi-view structure to achieve causal discovery with weak assumptions. We propose a multi-vie… ▽ More

    Submitted 26 September, 2025; v1 submitted 27 February, 2025; originally announced February 2025.

    MSC Class: 62R07 (Primary) 68T05; 05C82 (Secondary) ACM Class: I.2.6; I.5.1

  2. arXiv:2502.02300  [pdf, ps, other

    cs.LG

    Density Ratio Estimation with Conditional Probability Paths

    Authors: Hanlin Yu, Arto Klami, Aapo Hyvärinen, Anna Korba, Omar Chehab

    Abstract: Density ratio estimation in high dimensions can be reframed as integrating a certain quantity, the time score, over probability paths which interpolate between the two densities. In practice, the time score has to be estimated based on samples from the two densities. However, existing methods for this problem remain computationally expensive and can yield inaccurate estimates. Inspired by recent a… ▽ More

    Submitted 12 June, 2025; v1 submitted 4 February, 2025; originally announced February 2025.

    Comments: To appear in ICML 2025

  3. arXiv:2501.07426  [pdf, ps, other

    cs.LG

    MVICAD2: Multi-View Independent Component Analysis with Delays and Dilations

    Authors: Ambroise Heurtebise, Omar Chehab, Pierre Ablin, Alexandre Gramfort

    Abstract: Machine learning techniques in multi-view settings face significant challenges, particularly when integrating heterogeneous data, aligning feature spaces, and managing view-specific biases. These issues are prominent in neuroscience, where data from multiple subjects exposed to the same stimuli are analyzed to uncover brain activity dynamics. In magnetoencephalography (MEG), where signals are capt… ▽ More

    Submitted 13 August, 2025; v1 submitted 13 January, 2025; originally announced January 2025.

    Comments: 23 pages, 10 figures

  4. arXiv:2501.00565  [pdf, other

    stat.CO cs.LG math.ST

    Polynomial time sampling from log-smooth distributions in fixed dimension under semi-log-concavity of the forward diffusion with application to strongly dissipative distributions

    Authors: Adrien Vacher, Omar Chehab, Anna Korba

    Abstract: In this article, we provide a stochastic sampling algorithm with polynomial complexity in fixed dimension that leverages the recent advances on diffusion models where it is shown that under mild conditions, sampling can be achieved via an accurate estimation of intermediate scores across the marginals $(p_t)_{t\ge 0}$ of the standard Ornstein-Uhlenbeck process started at $μ$, the density we wish t… ▽ More

    Submitted 27 January, 2025; v1 submitted 31 December, 2024; originally announced January 2025.

  5. arXiv:2410.09697  [pdf, other

    stat.ML cs.LG stat.CO

    Provable Convergence and Limitations of Geometric Tempering for Langevin Dynamics

    Authors: Omar Chehab, Anna Korba, Austin Stromme, Adrien Vacher

    Abstract: Geometric tempering is a popular approach to sampling from challenging multi-modal probability distributions by instead sampling from a sequence of distributions which interpolate, using the geometric mean, between an easier proposal distribution and the target distribution. In this paper, we theoretically investigate the soundness of this approach when the sampling algorithm is Langevin dynamics,… ▽ More

    Submitted 7 April, 2025; v1 submitted 12 October, 2024; originally announced October 2024.

  6. arXiv:2406.14040  [pdf, other

    stat.ML cs.LG

    A Practical Diffusion Path for Sampling

    Authors: Omar Chehab, Anna Korba

    Abstract: Diffusion models are state-of-the-art methods in generative modeling when samples from a target probability distribution are available, and can be efficiently sampled, using score matching to estimate score vectors guiding a Langevin process. However, in the setting where samples from the target are not available, e.g. when this target's density is known up to a normalization constant, the score e… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  7. arXiv:2310.03902  [pdf, other

    stat.ML cs.LG

    Provable benefits of annealing for estimating normalizing constants: Importance Sampling, Noise-Contrastive Estimation, and beyond

    Authors: Omar Chehab, Aapo Hyvarinen, Andrej Risteski

    Abstract: Recent research has developed several Monte Carlo methods for estimating the normalization constant (partition function) based on the idea of annealing. This means sampling successively from a path of distributions that interpolate between a tractable "proposal" distribution and the unnormalized "target" distribution. Prominent estimators in this family include annealed importance sampling and ann… ▽ More

    Submitted 9 October, 2023; v1 submitted 5 October, 2023; originally announced October 2023.

  8. arXiv:2301.09696  [pdf, other

    stat.ML cs.LG

    Optimizing the Noise in Self-Supervised Learning: from Importance Sampling to Noise-Contrastive Estimation

    Authors: Omar Chehab, Alexandre Gramfort, Aapo Hyvarinen

    Abstract: Self-supervised learning is an increasingly popular approach to unsupervised learning, achieving state-of-the-art results. A prevalent approach consists in contrasting data points and noise points within a classification task: this requires a good noise distribution which is notoriously hard to specify. While a comprehensive theory is missing, it is widely assumed that the optimal noise distributi… ▽ More

    Submitted 23 January, 2023; originally announced January 2023.

    Comments: arXiv admin note: text overlap with arXiv:2203.01110

  9. arXiv:2203.01110  [pdf, other

    stat.ML cs.LG

    The Optimal Noise in Noise-Contrastive Learning Is Not What You Think

    Authors: Omar Chehab, Alexandre Gramfort, Aapo Hyvarinen

    Abstract: Learning a parametric model of a data distribution is a well-known statistical problem that has seen renewed interest as it is brought to scale in deep learning. Framing the problem as a self-supervised task, where data samples are discriminated from noise samples, is at the core of state-of-the-art methods, beginning with Noise-Contrastive Estimation (NCE). Yet, such contrastive learning requires… ▽ More

    Submitted 26 July, 2022; v1 submitted 2 March, 2022; originally announced March 2022.

  10. arXiv:2103.02339  [pdf, other

    q-bio.NC cs.LG cs.NE

    Deep Recurrent Encoder: A scalable end-to-end network to model brain signals

    Authors: Omar Chehab, Alexandre Defossez, Jean-Christophe Loiseau, Alexandre Gramfort, Jean-Remi King

    Abstract: Understanding how the brain responds to sensory inputs is challenging: brain recordings are partial, noisy, and high dimensional; they vary across sessions and subjects and they capture highly nonlinear dynamics. These challenges have led the community to develop a variety of preprocessing and analytical (almost exclusively linear) methods, each designed to tackle one of these issues. Instead, we… ▽ More

    Submitted 30 September, 2022; v1 submitted 3 March, 2021; originally announced March 2021.

  11. arXiv:2007.16104  [pdf, other

    stat.ML cs.LG eess.SP q-bio.NC q-bio.QM

    Uncovering the structure of clinical EEG signals with self-supervised learning

    Authors: Hubert Banville, Omar Chehab, Aapo Hyvärinen, Denis-Alexander Engemann, Alexandre Gramfort

    Abstract: Objective. Supervised learning paradigms are often limited by the amount of labeled data that is available. This phenomenon is particularly problematic in clinically-relevant data, such as electroencephalography (EEG), where labeling can be costly in terms of specialized expertise and human processing time. Consequently, deep learning architectures designed to learn on EEG data have yielded relati… ▽ More

    Submitted 31 July, 2020; originally announced July 2020.

    Comments: 32 pages, 9 figures