Skip to main content

Showing 1–31 of 31 results for author: Besserve, M

.
  1. arXiv:2408.05647  [pdf, other

    cs.LG stat.ME

    Controlling for discrete unmeasured confounding in nonlinear causal models

    Authors: Patrick Burauel, Frederick Eberhardt, Michel Besserve

    Abstract: Unmeasured confounding is a major challenge for identifying causal relationships from non-experimental data. Here, we propose a method that can accommodate unmeasured discrete confounding. Extending recent identifiability results in deep latent variable models, we show theoretically that confounding can be detected and corrected under the assumption that the observed data is a piecewise affine tra… ▽ More

    Submitted 10 August, 2024; originally announced August 2024.

  2. arXiv:2312.13438  [pdf, ps, other

    stat.ML cs.LG

    Independent Mechanism Analysis and the Manifold Hypothesis

    Authors: Shubhangi Ghosh, Luigi Gresele, Julius von Kügelgen, Michel Besserve, Bernhard Schölkopf

    Abstract: Independent Mechanism Analysis (IMA) seeks to address non-identifiability in nonlinear Independent Component Analysis (ICA) by assuming that the Jacobian of the mixing function has orthogonal columns. As typical in ICA, previous work focused on the case with an equal number of latent components and observed mixtures. Here, we extend IMA to settings with a larger number of mixtures that reside on a… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

    Comments: 6 pages, Accepted at Neurips Causal Representation Learning 2023

  3. arXiv:2311.18639  [pdf, other

    stat.ML cs.LG

    Targeted Reduction of Causal Models

    Authors: Armin Kekić, Bernhard Schölkopf, Michel Besserve

    Abstract: Why does a phenomenon occur? Addressing this question is central to most scientific inquiries and often relies on simulations of scientific models. As models become more intricate, deciphering the causes behind phenomena in high-dimensional spaces of interconnected variables becomes increasingly challenging. Causal Representation Learning (CRL) offers a promising avenue to uncover interpretable ca… ▽ More

    Submitted 3 June, 2024; v1 submitted 30 November, 2023; originally announced November 2023.

  4. arXiv:2306.00907  [pdf, other

    physics.chem-ph

    A site-site interaction two-dimensional model with water like structural properties

    Authors: Tangi Baré, Maxime Besserve, Tomaz Urbic, Aurélien Perera

    Abstract: A site-site interaction model is proposed for water in two-dimension, as an alternative to the traditional Mercedes-Benz model. In MB model, water molecules are modeled as 2-dimensional Lennard-Jones disks with three hydrogen bonding arms arranged symmetrically, resembling the Mercedes-Benz logo. The MB model qualitatively predicts both the anomalous properties of pure water and the anomalous solv… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: 24 pages, 17 figures

  5. arXiv:2306.00542  [pdf, other

    stat.ML cs.AI cs.LG

    Nonparametric Identifiability of Causal Representations from Unknown Interventions

    Authors: Julius von Kügelgen, Michel Besserve, Liang Wendong, Luigi Gresele, Armin Kekić, Elias Bareinboim, David M. Blei, Bernhard Schölkopf

    Abstract: We study causal representation learning, the task of inferring latent causal variables and their causal relations from high-dimensional mixtures of the variables. Prior work relies on weak supervision, in the form of counterfactual pre- and post-intervention views or temporal structure; places restrictive assumptions, such as linearity, on the mixing function or latent causal model; or requires pa… ▽ More

    Submitted 28 October, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: NeurIPS 2023 camera-ready version; 36 pages, 4 figures

    MSC Class: 68T05 ACM Class: I.2.6

  6. arXiv:2305.17225  [pdf, other

    stat.ML cs.AI cs.LG

    Causal Component Analysis

    Authors: Liang Wendong, Armin Kekić, Julius von Kügelgen, Simon Buchholz, Michel Besserve, Luigi Gresele, Bernhard Schölkopf

    Abstract: Independent Component Analysis (ICA) aims to recover independent latent variables from observed mixtures thereof. Causal Representation Learning (CRL) aims instead to infer causally related (thus often statistically dependent) latent variables, together with the unknown graph encoding their causal relationships. We introduce an intermediate problem termed Causal Component Analysis (CauCA). CauCA c… ▽ More

    Submitted 17 January, 2024; v1 submitted 26 May, 2023; originally announced May 2023.

    Comments: NeurIPS 2023 final camera-ready version

  7. arXiv:2209.07508  [pdf, other

    q-bio.NC stat.ME stat.ML

    Information Theoretic Measures of Causal Influences during Transient Neural Events

    Authors: Kaidi Shao, Nikos K. Logothetis, Michel Besserve

    Abstract: Transient phenomena play a key role in coordinating brain activity at multiple scales, however,their underlying mechanisms remain largely unknown. A key challenge for neural data science is thus to characterize the network interactions at play during these events. Using the formalism of Structural Causal Models and their graphical representation, we investigate the theoretical and empirical proper… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.

  8. arXiv:2208.06406  [pdf, other

    stat.ML cs.LG

    Function Classes for Identifiable Nonlinear Independent Component Analysis

    Authors: Simon Buchholz, Michel Besserve, Bernhard Schölkopf

    Abstract: Unsupervised learning of latent variable models (LVMs) is widely used to represent data in machine learning. When such models reflect the ground truth factors and the mechanisms mapping them to observations, there is reason to expect that they allow generalization in downstream tasks. It is however well known that such identifiability guaranties are typically not achievable without putting constra… ▽ More

    Submitted 12 August, 2022; originally announced August 2022.

    Comments: 43 pages

    Journal ref: NeurIPS 2022

  9. arXiv:2207.12067  [pdf, other

    cs.LG math.GR stat.ML

    Homomorphism Autoencoder -- Learning Group Structured Representations from Observed Transitions

    Authors: Hamza Keurti, Hsiao-Ru Pan, Michel Besserve, Benjamin F. Grewe, Bernhard Schölkopf

    Abstract: How can agents learn internal models that veridically represent interactions with the real world is a largely open question. As machine learning is moving towards representations containing not just observational but also interventional knowledge, we study this problem using tools from representation learning and group theory. We propose methods enabling an agent acting upon the world to learn int… ▽ More

    Submitted 2 July, 2024; v1 submitted 25 July, 2022; originally announced July 2022.

    Comments: Accepted at ICML2023, Presented at the Symmetry and Geometry in Neural Representations Workshop (NeurReps) @ NeurIPS2022, 26 pages, 17 figures

  10. arXiv:2206.02416  [pdf, other

    stat.ML cs.AI cs.LG

    Embrace the Gap: VAEs Perform Independent Mechanism Analysis

    Authors: Patrik Reizinger, Luigi Gresele, Jack Brady, Julius von Kügelgen, Dominik Zietlow, Bernhard Schölkopf, Georg Martius, Wieland Brendel, Michel Besserve

    Abstract: Variational autoencoders (VAEs) are a popular framework for modeling complex data distributions; they can be efficiently trained via variational inference by maximizing the evidence lower bound (ELBO), at the expense of a gap to the exact (log-)marginal likelihood. While VAEs are commonly used for representation learning, it is unclear why ELBO maximization would yield useful representations, sinc… ▽ More

    Submitted 27 January, 2023; v1 submitted 6 June, 2022; originally announced June 2022.

    Comments: NeurIPS2022 final version

  11. arXiv:2204.14096  [pdf, other

    stat.ML cs.LG q-bio.QM stat.AP

    Bayesian Information Criterion for Event-based Multi-trial Ensemble data

    Authors: Kaidi Shao, Nikos K. Logothetis, Michel Besserve

    Abstract: Transient recurring phenomena are ubiquitous in many scientific fields like neuroscience and meteorology. Time inhomogenous Vector Autoregressive Models (VAR) may be used to characterize peri-event system dynamics associated with such phenomena, and can be learned by exploiting multi-dimensional data gathering samples of the evolution of the system in multiple time windows comprising, each associa… ▽ More

    Submitted 29 April, 2022; originally announced April 2022.

    Comments: 12 pages, 4 figures

  12. arXiv:2202.06844  [pdf, other

    stat.ML cs.AI cs.LG

    On Pitfalls of Identifiability in Unsupervised Learning. A Note on: "Desiderata for Representation Learning: A Causal Perspective"

    Authors: Shubhangi Ghosh, Luigi Gresele, Julius von Kügelgen, Michel Besserve, Bernhard Schölkopf

    Abstract: Model identifiability is a desirable property in the context of unsupervised representation learning. In absence thereof, different models may be observationally indistinguishable while yielding representations that are nontrivially related to one another, thus making the recovery of a ground truth generative model fundamentally impossible, as often shown through suitably constructed counterexampl… ▽ More

    Submitted 14 February, 2022; originally announced February 2022.

    Comments: 5 pages, 1 figure

  13. arXiv:2112.05729  [pdf, other

    cs.LG

    Learning soft interventions in complex equilibrium systems

    Authors: Michel Besserve, Bernhard Schölkopf

    Abstract: Complex systems often contain feedback loops that can be described as cyclic causal models. Intervening in such systems may lead to counterintuitive effects, which cannot be inferred directly from the graph structure. After establishing a framework for differentiable soft interventions based on Lie groups, we take advantage of modern automatic differentiation techniques and their application to im… ▽ More

    Submitted 14 December, 2022; v1 submitted 10 December, 2021; originally announced December 2021.

  14. arXiv:2110.15595  [pdf, other

    stat.ME stat.ML

    Cause-effect inference through spectral independence in linear dynamical systems: theoretical foundations

    Authors: Michel Besserve, Naji Shajarisales, Dominik Janzing, Bernhard Schölkopf

    Abstract: Distinguishing between cause and effect using time series observational data is a major challenge in many scientific fields. A new perspective has been provided based on the principle of Independence of Causal Mechanisms (ICM), leading to the Spectral Independence Criterion (SIC), postulating that the power spectral density (PSD) of the cause time series is uncorrelated with the squared modulus of… ▽ More

    Submitted 29 October, 2021; originally announced October 2021.

  15. arXiv:2106.16091  [pdf, other

    cs.LG cs.CV

    Exploring the Latent Space of Autoencoders with Interventional Assays

    Authors: Felix Leeb, Stefan Bauer, Michel Besserve, Bernhard Schölkopf

    Abstract: Autoencoders exhibit impressive abilities to embed the data manifold into a low-dimensional latent space, making them a staple of representation learning methods. However, without explicit supervision, which is often unavailable, the representation is usually uninterpretable, making analysis and principled progress challenging. We propose a framework, called latent responses, which exploits the lo… ▽ More

    Submitted 11 January, 2023; v1 submitted 30 June, 2021; originally announced June 2021.

    Comments: Published in NeurIPS 2022 Conference Proceedings

  16. arXiv:2106.05200  [pdf, other

    stat.ML cs.AI cs.LG

    Independent mechanism analysis, a new concept?

    Authors: Luigi Gresele, Julius von Kügelgen, Vincent Stimper, Bernhard Schölkopf, Michel Besserve

    Abstract: Independent component analysis provides a principled framework for unsupervised representation learning, with solid theory on the identifiability of the latent code that generated the data, given only observations of mixtures thereof. Unfortunately, when the mixing is nonlinear, the model is provably nonidentifiable, since statistical independence alone does not sufficiently constrain the problem.… ▽ More

    Submitted 9 February, 2022; v1 submitted 9 June, 2021; originally announced June 2021.

    Comments: NeurIPS 2021 final camera-ready version

  17. arXiv:2106.04619  [pdf, other

    stat.ML cs.AI cs.CV cs.LG

    Self-Supervised Learning with Data Augmentations Provably Isolates Content from Style

    Authors: Julius von Kügelgen, Yash Sharma, Luigi Gresele, Wieland Brendel, Bernhard Schölkopf, Michel Besserve, Francesco Locatello

    Abstract: Self-supervised representation learning has shown remarkable success in a number of domains. A common practice is to perform data augmentation via hand-crafted transformations intended to leave the semantics of the data invariant. We seek to understand the empirical success of this approach from a theoretical perspective. We formulate the augmentation process as a latent variable model by postulat… ▽ More

    Submitted 14 January, 2022; v1 submitted 8 June, 2021; originally announced June 2021.

    Comments: NeurIPS 2021 final camera-ready revision (with minor corrections)

  18. arXiv:2012.01912  [pdf, other

    stat.AP q-bio.PE

    Assaying Large-scale Testing Models to Interpret COVID-19 Case Numbers

    Authors: Michel Besserve, Simon Buchholz, Bernhard Schölkopf

    Abstract: Large-scale testing is considered key to assess the state of the current COVID-19 pandemic. Yet, the link between the reported case numbers and the true state of the pandemic remains elusive. We develop mathematical models based on competing hypotheses regarding this link, thereby providing different prevalence estimates based on case numbers, and validate them by predicting SARS-CoV-2-attributed… ▽ More

    Submitted 3 February, 2021; v1 submitted 3 December, 2020; originally announced December 2020.

    Comments: 41 pages, 7 figures

  19. arXiv:2010.05375  [pdf, other

    stat.ML cs.LG

    Causal learning with sufficient statistics: an information bottleneck approach

    Authors: Daniel Chicharro, Michel Besserve, Stefano Panzeri

    Abstract: The inference of causal relationships using observational data from partially observed multivariate systems with hidden variables is a fundamental question in many scientific domains. Methods extracting causal information from conditional independencies between variables of a system are common tools for this purpose, but are limited in the lack of independencies. To surmount this limitation, we ca… ▽ More

    Submitted 11 October, 2020; originally announced October 2020.

    MSC Class: 94A16; 62D20; 62H22; 62Bxx

  20. arXiv:2007.02938  [pdf, other

    stat.ML cs.LG math.ST

    Causal Feature Selection via Orthogonal Search

    Authors: Ashkan Soleymani, Anant Raj, Stefan Bauer, Bernhard Schölkopf, Michel Besserve

    Abstract: The problem of inferring the direct causal parents of a response variable among a large set of explanatory variables is of high practical importance in many disciplines. However, established approaches often scale at least exponentially with the number of explanatory variables, are difficult to extend to nonlinear relationships, and are difficult to extend to cyclic data. Inspired by {\em Debiased… ▽ More

    Submitted 16 September, 2022; v1 submitted 6 July, 2020; originally announced July 2020.

  21. arXiv:2006.07796  [pdf, other

    cs.LG cs.CV stat.ML

    Structure by Architecture: Structured Representations without Regularization

    Authors: Felix Leeb, Guilia Lanzillotta, Yashas Annadani, Michel Besserve, Stefan Bauer, Bernhard Schölkopf

    Abstract: We study the problem of self-supervised structured representation learning using autoencoders for downstream tasks such as generative modeling. Unlike most methods which rely on matching an arbitrary, relatively unstructured, prior distribution for sampling, we propose a sampling technique that relies solely on the independence of latent variables, thereby avoiding the trade-off between reconstruc… ▽ More

    Submitted 15 February, 2024; v1 submitted 14 June, 2020; originally announced June 2020.

    Comments: Published at ICLR 2023

  22. arXiv:2005.04034  [pdf, ps, other

    stat.ME q-bio.NC stat.AP

    From univariate to multivariate coupling between continuous signals and point processes: a mathematical framework

    Authors: Shervin Safavi, Nikos K. Logothetis, Michel Besserve

    Abstract: Time series datasets often contain heterogeneous signals, composed of both continuously changing quantities and discretely occurring events. The coupling between these measurements may provide insights into key underlying mechanisms of the systems under study. To better extract this information, we investigate the asymptotic statistical properties of coupling measures between continuous signals an… ▽ More

    Submitted 8 May, 2020; originally announced May 2020.

    Comments: 50 pages

  23. arXiv:2004.00184  [pdf, other

    cs.LG cs.CV stat.ML

    A theory of independent mechanisms for extrapolation in generative models

    Authors: Michel Besserve, Rémy Sun, Dominik Janzing, Bernhard Schölkopf

    Abstract: Generative models can be trained to emulate complex empirical data, but are they useful to make predictions in the context of previously unobserved environments? An intuitive idea to promote such extrapolation capabilities is to have the architecture of such model reflect a causal graph of the true data generating process, such that one can intervene on each node independently of the others. Howev… ▽ More

    Submitted 31 December, 2021; v1 submitted 31 March, 2020; originally announced April 2020.

    Comments: 21 pages

  24. arXiv:1903.02456   

    stat.ML cs.LG

    Orthogonal Structure Search for Efficient Causal Discovery from Observational Data

    Authors: Anant Raj, Luigi Gresele, Michel Besserve, Bernhard Schölkopf, Stefan Bauer

    Abstract: The problem of inferring the direct causal parents of a response variable among a large set of explanatory variables is of high practical importance in many disciplines. Recent work exploits stability of regression coefficients or invariance properties of models across different experimental conditions for reconstructing the full causal graph. These approaches generally do not scale well with the… ▽ More

    Submitted 6 July, 2020; v1 submitted 6 March, 2019; originally announced March 2019.

    Comments: first author uploaded a new version as "Causal Feature Selection via Orthogonal Search"

  25. arXiv:1812.03253  [pdf, other

    cs.LG stat.ML

    Counterfactuals uncover the modular structure of deep generative models

    Authors: Michel Besserve, Arash Mehrjou, Rémy Sun, Bernhard Schölkopf

    Abstract: Deep generative models can emulate the perceptual properties of complex image datasets, providing a latent representation of the data. However, manipulating such representation to perform meaningful and controllable transformations in the data space remains challenging without some form of supervision. While previous work has focused on exploiting statistical independence to disentangle latent fac… ▽ More

    Submitted 12 December, 2019; v1 submitted 7 December, 2018; originally announced December 2018.

    Comments: 26 pages, 17 figures

  26. arXiv:1803.06247  [pdf, ps, other

    cs.GT stat.ML

    Coordinating users of shared facilities via data-driven predictive assistants and game theory

    Authors: Philipp Geiger, Michel Besserve, Justus Winkelmann, Claudius Proissl, Bernhard Schölkopf

    Abstract: We study data-driven assistants that provide congestion forecasts to users of shared facilities (roads, cafeterias, etc.), to support coordination between them, and increase efficiency of such collective systems. Key questions are: (1) when and how much can (accurate) predictions help for coordination, and (2) which assistant algorithms reach optimal predictions? First we lay conceptual ground f… ▽ More

    Submitted 29 July, 2021; v1 submitted 16 March, 2018; originally announced March 2018.

    Comments: Extended version, including supplement, of a paper at the 35th Conference on Uncertainty in Artificial Intelligence, 2019

  27. arXiv:1707.06819  [pdf, ps, other

    math.PR

    A central limit like theorem for Fourier sums

    Authors: Dominik Janzing, Naji Shajarisales, Michel Besserve

    Abstract: We consider the probability distributions of values in the complex plane attained by Fourier sums of the form \sum_{j=1}^n a_j exp(-2πi j nu) /sqrt{n} when the frequency nu is drawn uniformly at random from an interval of length 1. If the coefficients a_j are i.i.d. drawn with finite third moment, the distance of these distributions to an isotropic two-dimensional Gaussian on C converges in probab… ▽ More

    Submitted 21 July, 2017; originally announced July 2017.

    Comments: 7 pages

    MSC Class: 60Fxx

  28. arXiv:1705.02212  [pdf, other

    stat.ML cs.AI cs.LG math.ST

    Group invariance principles for causal generative models

    Authors: Michel Besserve, Naji Shajarisales, Bernhard Schölkopf, Dominik Janzing

    Abstract: The postulate of independence of cause and mechanism (ICM) has recently led to several new causal discovery algorithms. The interpretation of independence and the way it is utilized, however, varies across these methods. Our aim in this paper is to propose a group theoretic framework for ICM to unify and generalize these approaches. In our setting, the cause-mechanism relationship is assessed by c… ▽ More

    Submitted 5 May, 2017; originally announced May 2017.

    Comments: 16 pages, 6 figures

    ACM Class: I.2.6; I.2.10; G.3; I.5.3

  29. arXiv:1503.01299  [pdf, ps, other

    cs.AI

    Telling cause from effect in deterministic linear dynamical systems

    Authors: Naji Shajarisales, Dominik Janzing, Bernhard Shoelkopf, Michel Besserve

    Abstract: Inferring a cause from its effect using observed time series data is a major challenge in natural and social sciences. Assuming the effect is generated by the cause trough a linear system, we propose a new approach based on the hypothesis that nature chooses the "cause" and the "mechanism that generates the effect from the cause" independent of each other. We therefore postulate that the power spe… ▽ More

    Submitted 4 March, 2015; originally announced March 2015.

    Comments: This article is under review for a peer-reviewed conference

  30. arXiv:1209.5549  [pdf, other

    q-bio.NC cs.LG stat.ML

    Towards a learning-theoretic analysis of spike-timing dependent plasticity

    Authors: David Balduzzi, Michel Besserve

    Abstract: This paper suggests a learning-theoretic perspective on how synaptic plasticity benefits global brain functioning. We introduce a model, the selectron, that (i) arises as the fast time constant limit of leaky integrate-and-fire neurons equipped with spiking timing dependent plasticity (STDP) and (ii) is amenable to theoretical analysis. We show that the selectron encodes reward estimates into spik… ▽ More

    Submitted 25 September, 2012; originally announced September 2012.

    Comments: To appear in Adv. Neural Inf. Proc. Systems

  31. arXiv:1202.4482  [pdf, other

    q-bio.NC cs.LG nlin.AO

    Metabolic cost as an organizing principle for cooperative learning

    Authors: David Balduzzi, Pedro A Ortega, Michel Besserve

    Abstract: This paper investigates how neurons can use metabolic cost to facilitate learning at a population level. Although decision-making by individual neurons has been extensively studied, questions regarding how neurons should behave to cooperate effectively remain largely unaddressed. Under assumptions that capture a few basic features of cortical neurons, we show that constraining reward maximization… ▽ More

    Submitted 9 February, 2013; v1 submitted 20 February, 2012; originally announced February 2012.

    Comments: 14 pages, 2 figures, to appear in Advances in Complex Systems