Skip to main content

Showing 1–16 of 16 results for author: Marx, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.20546  [pdf, other

    stat.ML cs.LG stat.ME

    Regression-Based Estimation of Causal Effects in the Presence of Selection Bias and Confounding

    Authors: Marlies Hafer, Alexander Marx

    Abstract: We consider the problem of estimating the expected causal effect $E[Y|do(X)]$ for a target variable $Y$ when treatment $X$ is set by intervention, focusing on continuous random variables. In settings without selection bias or confounding, $E[Y|do(X)] = E[Y|X]$, which can be estimated using standard regression methods. However, regression fails when systematic missingness induced by selection bias,… ▽ More

    Submitted 26 March, 2025; originally announced March 2025.

    Comments: 13 pages plus appendix

  2. arXiv:2502.15568  [pdf, other

    cs.LG cs.AI

    A Cautionary Tale About "Neutrally" Informative AI Tools Ahead of the 2025 Federal Elections in Germany

    Authors: Ina Dormuth, Sven Franke, Marlies Hafer, Tim Katzke, Alexander Marx, Emmanuel Müller, Daniel Neider, Markus Pauly, Jérôme Rutinowski

    Abstract: In this study, we examine the reliability of AI-based Voting Advice Applications (VAAs) and large language models (LLMs) in providing objective political information. Our analysis is based upon a comparison with party responses to 38 statements of the Wahl-O-Mat, a well-established German online tool that helps inform voters by comparing their views with political party positions. For the LLMs, we… ▽ More

    Submitted 7 April, 2025; v1 submitted 21 February, 2025; originally announced February 2025.

  3. arXiv:2412.13223  [pdf, other

    q-bio.QM cs.AI cs.LG

    Generative modeling of protein ensembles guided by crystallographic electron densities

    Authors: Sai Advaith Maddipatla, Nadav Bojan Sellam, Sanketh Vedula, Ailie Marx, Alex Bronstein

    Abstract: Proteins are dynamic, adopting ensembles of conformations. The nature of this conformational heterogenity is imprinted in the raw electron density measurements obtained from X-ray crystallography experiments. Fitting an ensemble of protein structures to these measurements is a challenging, ill-posed inverse problem. We propose a non-i.i.d. ensemble guidance approach to solve this problem using exi… ▽ More

    Submitted 16 December, 2024; originally announced December 2024.

  4. arXiv:2405.18848  [pdf, other

    cs.LG cs.AI

    Anomaly Detection by Context Contrasting

    Authors: Alain Ryser, Thomas M. Sutter, Alexander Marx, Julia E. Vogt

    Abstract: Anomaly detection focuses on identifying samples that deviate from the norm. When working with high-dimensional data such as images, a crucial requirement for detecting anomalous patterns is learning lower-dimensional representations that capture concepts of normality. Recent advances in self-supervised learning have shown great promise in this regard. However, many successful self-supervised anom… ▽ More

    Submitted 14 October, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

  5. arXiv:2310.10240  [pdf, other

    stat.ML cs.IT cs.LG

    On the Properties and Estimation of Pointwise Mutual Information Profiles

    Authors: Paweł Czyż, Frederic Grabowski, Julia E. Vogt, Niko Beerenwinkel, Alexander Marx

    Abstract: The pointwise mutual information profile, or simply profile, is the distribution of pointwise mutual information for a given pair of random variables. One of its important properties is that its expected value is precisely the mutual information between these random variables. In this paper, we analytically describe the profiles of multivariate normal distributions and introduce a novel family of… ▽ More

    Submitted 29 May, 2024; v1 submitted 16 October, 2023; originally announced October 2023.

    Comments: The accompanying code is accessible on GitHub: https://github.com/cbg-ethz/bmi

  6. arXiv:2310.07518  [pdf, other

    cs.LG

    Exploiting Causal Graph Priors with Posterior Sampling for Reinforcement Learning

    Authors: Mirco Mutti, Riccardo De Santi, Marcello Restelli, Alexander Marx, Giorgia Ramponi

    Abstract: Posterior sampling allows exploitation of prior knowledge on the environment's transition dynamics to improve the sample efficiency of reinforcement learning. The prior is typically specified as a class of parametric distributions, the design of which can be cumbersome in practice, often resulting in the choice of uninformative priors. In this work, we propose a novel posterior sampling approach i… ▽ More

    Submitted 8 April, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

  7. arXiv:2306.11078  [pdf, other

    stat.ML cs.IT cs.LG

    Beyond Normal: On the Evaluation of Mutual Information Estimators

    Authors: Paweł Czyż, Frederic Grabowski, Julia E. Vogt, Niko Beerenwinkel, Alexander Marx

    Abstract: Mutual information is a general statistical dependency measure which has found applications in representation learning, causality, domain generalization and computational biology. However, mutual information estimators are typically evaluated on simple families of probability distributions, namely multivariate normal distribution and selected distributions with one-dimensional random variables. In… ▽ More

    Submitted 16 October, 2023; v1 submitted 19 June, 2023; originally announced June 2023.

    Comments: Accepted at NeurIPS 2023. Code available at https://github.com/cbg-ethz/bmi

  8. arXiv:2303.09166  [pdf, other

    cs.LG stat.ML

    Identifiability Results for Multimodal Contrastive Learning

    Authors: Imant Daunhawer, Alice Bizeul, Emanuele Palumbo, Alexander Marx, Julia E. Vogt

    Abstract: Contrastive learning is a cornerstone underlying recent progress in multi-view and multimodal learning, e.g., in representation learning with image/caption pairs. While its effectiveness is not yet fully understood, a line of recent work reveals that contrastive learning can invert the data generating process and recover ground truth latent factors shared between views. In this work, we present ne… ▽ More

    Submitted 16 March, 2023; originally announced March 2023.

    Comments: ICLR 2023 camera-ready version

  9. arXiv:2210.09054  [pdf, other

    stat.ML cs.AI cs.LG

    On the Identifiability and Estimation of Causal Location-Scale Noise Models

    Authors: Alexander Immer, Christoph Schultheiss, Julia E. Vogt, Bernhard Schölkopf, Peter Bühlmann, Alexander Marx

    Abstract: We study the class of location-scale or heteroscedastic noise models (LSNMs), in which the effect $Y$ can be written as a function of the cause $X$ and a noise source $N$ independent of $X$, which may be scaled by a positive function $g$ over the cause, i.e., $Y = f(X) + g(X)N$. Despite the generality of the model class, we show the causal direction is identifiable up to some pathological cases. T… ▽ More

    Submitted 1 June, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: ICML 2023

  10. arXiv:2110.13883  [pdf, other

    cs.IT

    Estimating Mutual Information via Geodesic $k$NN

    Authors: Alexander Marx, Jonas Fischer

    Abstract: Estimating mutual information (MI) between two continuous random variables $X$ and $Y$ allows to capture non-linear dependencies between them, non-parametrically. As such, MI estimation lies at the core of many data science applications. Yet, robustly estimating MI for high-dimensional $X$ and $Y$ is still an open research question. In this paper, we formulate this problem through the lens of ma… ▽ More

    Submitted 18 January, 2022; v1 submitted 26 October, 2021; originally announced October 2021.

    Comments: Accepted at SIAM SDM'22

  11. arXiv:2105.01902  [pdf, other

    cs.IT

    Formally Justifying MDL-based Inference of Cause and Effect

    Authors: Alexander Marx, Jilles Vreeken

    Abstract: The algorithmic independence of conditionals, which postulates that the causal mechanism is algorithmically independent of the cause, has recently inspired many highly successful approaches to distinguish cause from effect given only observational data. Most popular among these is the idea to approximate algorithmic independence via two-part Minimum Description Length (MDL). Although intuitively s… ▽ More

    Submitted 5 May, 2021; originally announced May 2021.

  12. arXiv:2101.05009  [pdf, other

    cs.IT stat.AP

    Estimating Conditional Mutual Information for Discrete-Continuous Mixtures using Multi-Dimensional Adaptive Histograms

    Authors: Alexander Marx, Lincen Yang, Matthijs van Leeuwen

    Abstract: Estimating conditional mutual information (CMI) is an essential yet challenging step in many machine learning and data mining tasks. Estimating CMI from data that contains both discrete and continuous variables, or even discrete-continuous mixture variables, is a particularly hard problem. In this paper, we show that CMI for such mixture variables, defined based on the Radon-Nikodym derivate, can… ▽ More

    Submitted 13 January, 2021; originally announced January 2021.

    Comments: Extended version including supplementary material for main paper which is (will be) published in: Proceedings of the SIAM International Conference on Data Mining (SDM'21)

  13. arXiv:2010.14265  [pdf, other

    stat.ML cs.AI cs.LG

    A Weaker Faithfulness Assumption based on Triple Interactions

    Authors: Alexander Marx, Arthur Gretton, Joris M. Mooij

    Abstract: One of the core assumptions in causal discovery is the faithfulness assumption, i.e., assuming that independencies found in the data are due to separations in the true causal graph. This assumption can, however, be violated in many ways, including xor connections, deterministic functions or cancelling paths. In this work, we propose a weaker assumption that we call $2$-adjacency faithfulness. In c… ▽ More

    Submitted 4 August, 2021; v1 submitted 27 October, 2020; originally announced October 2020.

    Comments: Accepted for the 37th Conference on Uncertainty in Artificial Intelligence (UAI 2021)

    Journal ref: Proceedings of the Thirty-Seventh Conference on Uncertainty in Artificial Intelligence, PMLR 161:451-460, 2021

  14. arXiv:1903.04829  [pdf, other

    stat.ML cs.LG

    Testing Conditional Independence on Discrete Data using Stochastic Complexity

    Authors: Alexander Marx, Jilles Vreeken

    Abstract: Testing for conditional independence is a core aspect of constraint-based causal discovery. Although commonly used tests are perfect in theory, they often fail to reject independence in practice, especially when conditioning on multiple variables. We focus on discrete data and propose a new test based on the notion of algorithmic independence that we instantiate using stochastic complexity. Amon… ▽ More

    Submitted 12 March, 2019; originally announced March 2019.

    Comments: 18 pages, accepted at AISTATS'19, the proposed test was released in the R package SCCI

  15. arXiv:1808.06356  [pdf, other

    stat.ML cs.LG

    Causal Discovery by Telling Apart Parents and Children

    Authors: Alexander Marx, Jilles Vreeken

    Abstract: We consider the problem of inferring the directed, causal graph from observational data, assuming no hidden confounders. We take an information theoretic approach, and make three main contributions. First, we show how through algorithmic information theory we can obtain SCI, a highly robust, effective and computationally efficient test for conditional independence---and show it outperforms the s… ▽ More

    Submitted 6 September, 2018; v1 submitted 20 August, 2018; originally announced August 2018.

    Comments: 11 pages, results section changed slightly

  16. arXiv:1702.06385  [pdf, other

    stat.ML cs.LG

    Causal Inference on Multivariate and Mixed-Type Data

    Authors: Alexander Marx, Jilles Vreeken

    Abstract: Given data over the joint distribution of two random variables $X$ and $Y$, we consider the problem of inferring the most likely causal direction between $X$ and $Y$. In particular, we consider the general case where both $X$ and $Y$ may be univariate or multivariate, and of the same or mixed data types. We take an information theoretic approach, based on Kolmogorov complexity, from which it follo… ▽ More

    Submitted 16 October, 2017; v1 submitted 21 February, 2017; originally announced February 2017.

    Comments: 9 pages, submitted to sdm