Skip to main content

Showing 1–22 of 22 results for author: Maathuis, M H

Searching in archive stat. Search in all archives.
.
  1. arXiv:2410.12151  [pdf, ps, other

    stat.ME

    Root cause discovery via permutations and Cholesky decomposition

    Authors: Jinzhou Li, Benjamin B. Chu, Ines F. Scheller, Julien Gagneur, Marloes H. Maathuis

    Abstract: This work is motivated by the following problem: Can we identify the disease-causing gene in a patient affected by a monogenic disorder? This problem is an instance of root cause discovery. In particular, we aim to identify the intervened variable in one interventional sample using a set of observational samples as reference. We consider a linear structural equation model where the causal ordering… ▽ More

    Submitted 1 July, 2025; v1 submitted 15 October, 2024; originally announced October 2024.

    Comments: add funding information

  2. arXiv:2312.02717  [pdf, other

    stat.ME

    A Graphical Approach to Treatment Effect Estimation with Observational Network Data

    Authors: Meta-Lina Spohn, Leonard Henckel, Marloes H. Maathuis

    Abstract: We propose an easy-to-use adjustment estimator for the effect of a treatment based on observational data from a single (social) network of units. The approach allows for interactions among units within the network, called interference, and for observed confounding. We define a simplified causal graph that does not differentiate between units, called generic graph. Using valid adjustment sets deter… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

  3. arXiv:2212.12822  [pdf, other

    stat.ME

    Simultaneous false discovery proportion bounds via knockoffs and closed testing

    Authors: Jinzhou Li, Marloes H. Maathuis, Jelle J. Goeman

    Abstract: We propose new methods to obtain simultaneous false discovery proportion bounds for knockoff-based approaches. We first investigate an approach based on Janson and Su's $k$-familywise error rate control method and interpolation. We then generalize it by considering a collection of $k$ values, and show that the bound of Katsevich and Ramdas is a special case of this method and can be uniformly impr… ▽ More

    Submitted 25 February, 2024; v1 submitted 24 December, 2022; originally announced December 2022.

  4. arXiv:2110.06627  [pdf, other

    stat.ME math.ST

    Estimation and Inference of Extremal Quantile Treatment Effects for Heavy-Tailed Distributions

    Authors: David Deuber, Jinzhou Li, Sebastian Engelke, Marloes H. Maathuis

    Abstract: Causal inference for extreme events has many potential applications in fields such as climate science, medicine and economics. We study the extremal quantile treatment effect of a binary treatment on a continuous, heavy-tailed outcome. Existing methods are limited to the case where the quantile of interest is within the range of the observations. For applications in risk assessment, however, the m… ▽ More

    Submitted 5 July, 2023; v1 submitted 13 October, 2021; originally announced October 2021.

  5. arXiv:2103.06328  [pdf, other

    stat.ME

    Profiling Compliers in Instrumental Variables Designs

    Authors: Dominik Hangartner, Moritz Marbach, Leonard Henckel, Marloes H. Maathuis, Rachel R. Kelz, Luke Keele

    Abstract: Instrumental variable (IV) analyses are becoming common in health services research and epidemiology. IV analyses can be used both to analyze randomized trials with noncompliance and as a form of natural experiment. In these analyses, investigators often adopt a monotonicity assumption, which implies that the relevant effect only applies to a subset of the study population known as compliers. Sinc… ▽ More

    Submitted 10 March, 2021; originally announced March 2021.

  6. arXiv:2006.15387  [pdf, other

    stat.ME

    Evaluation of Causal Structure Learning Algorithms via Risk Estimation

    Authors: Marco F. Eigenmann, Sach Mukherjee, Marloes H. Maathuis

    Abstract: Recent years have seen many advances in methods for causal structure learning from data. The empirical assessment of such methods, however, is much less developed. Motivated by this gap, we pose the following question: how can one assess, in a given problem setting, the practical efficacy of one or more causal structure learning methods? We formalize the problem in a decision-theoretic framework,… ▽ More

    Submitted 27 June, 2020; originally announced June 2020.

  7. arXiv:1908.11611  [pdf, other

    stat.ME

    GGM knockoff filter: False Discovery Rate Control for Gaussian Graphical Models

    Authors: Jinzhou Li, Marloes H. Maathuis

    Abstract: We propose a new method to learn the structure of a Gaussian graphical model with finite sample false discovery rate control. Our method builds on the knockoff framework of Barber and Candès for linear models. We extend their approach to the graphical model setting by using a local (node-based) and a global (graph-based) step: we construct knockoffs and feature statistics for each node locally, an… ▽ More

    Submitted 19 April, 2021; v1 submitted 30 August, 2019; originally announced August 2019.

  8. arXiv:1708.01151  [pdf, ps, other

    stat.ME

    Robust causal structure learning with some hidden variables

    Authors: Benjamin Frot, Preetam Nandy, Marloes H. Maathuis

    Abstract: We introduce a new method to estimate the Markov equivalence class of a directed acyclic graph (DAG) in the presence of hidden variables, in settings where the underlying DAG among the observed variables is sparse, and there are a few hidden variables that have a direct effect on many of the observed ones. Building on the so-called low rank plus sparse framework, we suggest a two-stage approach wh… ▽ More

    Submitted 4 August, 2018; v1 submitted 3 August, 2017; originally announced August 2017.

  9. arXiv:1707.07560  [pdf, other

    stat.ME

    Structure Learning of Linear Gaussian Structural Equation Models with Weak Edges

    Authors: Marco F. Eigenmann, Preetam Nandy, Marloes H. Maathuis

    Abstract: We consider structure learning of linear Gaussian structural equation models with weak edges. Since the presence of weak edges can lead to a loss of edge orientations in the true underlying CPDAG, we define a new graphical object that can contain more edge orientations. We show that this object can be recovered from observational data under a type of strong faithfulness assumption. We present a ne… ▽ More

    Submitted 24 July, 2017; originally announced July 2017.

    Comments: 18 pages, 17 figures, UAI 2017

  10. arXiv:1706.09141  [pdf, other

    stat.ME

    Causal Structure Learning

    Authors: Christina Heinze-Deml, Marloes H. Maathuis, Nicolai Meinshausen

    Abstract: Graphical models can represent a multivariate distribution in a convenient and accessible form as a graph. Causal models can be viewed as a special class of graphical models that not only represent the distribution of the observed system but also the distributions under external interventions. They hence enable predictions under hypothetical interventions, which is important for decision making. T… ▽ More

    Submitted 28 June, 2017; originally announced June 2017.

    Comments: to appear in `Annual Review of Statistics and Its Application', 30 pages

  11. arXiv:1606.02359  [pdf, other

    stat.ME stat.ML

    Structure Learning in Graphical Modeling

    Authors: Mathias Drton, Marloes H. Maathuis

    Abstract: A graphical model is a statistical model that is associated to a graph whose nodes correspond to variables of interest. The edges of the graph reflect allowed conditional dependencies among the variables. Graphical models admit computationally convenient factorization properties and have long been a valuable tool for tractable modeling of multivariate distributions. More recently, applications suc… ▽ More

    Submitted 7 June, 2016; originally announced June 2016.

  12. arXiv:1508.01717  [pdf, other

    stat.ML

    Distributional Equivalence and Structure Learning for Bow-free Acyclic Path Diagrams

    Authors: Christopher Nowzohour, Marloes H. Maathuis, Robin J. Evans, Peter Bühlmann

    Abstract: We consider the problem of structure learning for bow-free acyclic path diagrams (BAPs). BAPs can be viewed as a generalization of linear Gaussian DAG models that allow for certain hidden variables. We present a first method for this problem using a greedy score-based search algorithm. We also prove some necessary and some sufficient conditions for distributional equivalence of BAPs which are used… ▽ More

    Submitted 2 December, 2017; v1 submitted 7 August, 2015; originally announced August 2015.

  13. arXiv:1507.02608  [pdf, other

    math.ST stat.ME

    High-dimensional consistency in score-based and hybrid structure learning

    Authors: Preetam Nandy, Alain Hauser, Marloes H. Maathuis

    Abstract: Main approaches for learning Bayesian networks can be classified as constraint-based, score-based or hybrid methods. Although high-dimensional consistency results are available for constraint-based methods like the PC algorithm, such results have not been proved for score-based or hybrid methods, and most of the hybrid methods have not even shown to be consistent in the classical setting where the… ▽ More

    Submitted 3 February, 2018; v1 submitted 9 July, 2015; originally announced July 2015.

    Comments: 37 pages, 5 figures, 41 pages supplement (available as an ancillary file)

  14. arXiv:1506.07669  [pdf, other

    stat.ME

    A review of some recent advances in causal inference

    Authors: Marloes H. Maathuis, Preetam Nandy

    Abstract: We give a selective review of some recent developments in causal inference, intended for researchers who are not familiar with graphical models and causality, and with a focus on methods that are applicable to large data sets. We mainly address the problem of estimating causal effects from observational data. For example, one can think of estimating the effect of single or multiple gene knockouts… ▽ More

    Submitted 25 June, 2015; originally announced June 2015.

    Comments: 23 pages, 4 figures, To appear in the "Handbook of Big Data", Chapman and Hall

    MSC Class: 62-09; 62H12; 62P10

  15. arXiv:1407.2451  [pdf, other

    stat.ME

    Estimating the effect of joint interventions from observational data in sparse high-dimensional settings

    Authors: Preetam Nandy, Marloes H. Maathuis, Thomas S. Richardson

    Abstract: We consider the estimation of joint causal effects from observational data. In particular, we propose new methods to estimate the effect of multiple simultaneous interventions (e.g., multiple gene knockouts), under the assumption that the observational data come from an unknown linear structural equation model with independent errors. We derive asymptotic variances of our estimators when the under… ▽ More

    Submitted 9 March, 2016; v1 submitted 9 July, 2014; originally announced July 2014.

    Comments: 30 pages, 3 figures, 45 pages supplement

    MSC Class: 62M99; 62H12; 62P10

  16. arXiv:1307.5636  [pdf, ps, other

    stat.ME cs.AI

    A generalized back-door criterion

    Authors: Marloes H. Maathuis, Diego Colombo

    Abstract: We generalize Pearl's back-door criterion for directed acyclic graphs (DAGs) to more general types of graphs that describe Markov equivalence classes of DAGs and/or allow for arbitrarily many hidden variables. We also give easily checkable necessary and sufficient graphical criteria for the existence of a set of variables that satisfies our generalized back-door criterion, when considering a singl… ▽ More

    Submitted 3 June, 2015; v1 submitted 22 July, 2013; originally announced July 2013.

    Comments: Published at http://dx.doi.org/10.1214/14-AOS1295 in the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS1295

    Journal ref: Annals of Statistics 2015, Vol. 43, No. 3, 1060-1088

  17. arXiv:1211.3295  [pdf, other

    stat.ML cs.LG

    Order-independent constraint-based causal structure learning

    Authors: Diego Colombo, Marloes H. Maathuis

    Abstract: We consider constraint-based methods for causal structure learning, such as the PC-, FCI-, RFCI- and CCD- algorithms (Spirtes et al. (2000, 1993), Richardson (1996), Colombo et al. (2012), Claassen et al. (2013)). The first step of all these algorithms consists of the PC-algorithm. This algorithm is known to be order-dependent, in the sense that the output can depend on the order in which the vari… ▽ More

    Submitted 27 September, 2013; v1 submitted 14 November, 2012; originally announced November 2012.

  18. arXiv:1104.5617  [pdf, ps, other

    stat.ME cs.LG math.ST

    Learning high-dimensional directed acyclic graphs with latent and selection variables

    Authors: Diego Colombo, Marloes H. Maathuis, Markus Kalisch, Thomas S. Richardson

    Abstract: We consider the problem of learning causal information between random variables in directed acyclic graphs (DAGs) when allowing arbitrarily many latent and selection variables. The FCI (Fast Causal Inference) algorithm has been explicitly designed to infer conditional independence and causal information in such settings. However, FCI is computationally infeasible for large graphs. We therefore pro… ▽ More

    Submitted 29 May, 2012; v1 submitted 29 April, 2011; originally announced April 2011.

    Comments: Published in at http://dx.doi.org/10.1214/11-AOS940 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS940

    Journal ref: Annals of Statistics 2012, Vol. 40, No. 1, 294-321

  19. Nonparametric inference for competing risks current status data with continuous, discrete or grouped observation times

    Authors: Marloes H. Maathuis, Michael G. Hudgens

    Abstract: New methods and theory have recently been developed to nonparametrically estimate cumulative incidence functions for competing risks survival data subject to current status censoring. In particular, the limiting distribution of the nonparametric maximum likelihood estimator and a simplified "naive estimator" have been established under certain smoothness conditions. In this paper, we establish the… ▽ More

    Submitted 20 December, 2010; v1 submitted 26 September, 2009; originally announced September 2009.

    Comments: 16 pages, 3 figures

    Journal ref: Biometrika 2011, Vol. 98, No. 2, 325-340

  20. Reduction algorithm for the NPMLE for the distribution function of bivariate interval censored data

    Authors: Marloes H. Maathuis

    Abstract: We study computational aspects of the nonparametric maximum likelihood estimator (NPMLE) for the distribution function of bivariate interval censored data. The computation of the NPMLE consists of two steps: a parameter reduction step and an optimization step. In this paper we focus on the reduction step. We introduce two new reduction algorithms: the Tree algorithm and the HeightMap algorithm.… ▽ More

    Submitted 17 June, 2009; originally announced June 2009.

    Comments: 12 pages, 3 figures

    Journal ref: Journal of Computational and Graphical Statistics 2005, Vol. 14, No. 2, 352-362

  21. Variable selection in high-dimensional linear models: partially faithful distributions and the PC-simple algorithm

    Authors: Peter Bühlmann, Markus Kalisch, Marloes H. Maathuis

    Abstract: We consider variable selection in high-dimensional linear models where the number of covariates greatly exceeds the sample size. We introduce the new concept of partial faithfulness and use it to infer associations between the covariates and the response. Under partial faithfulness, we develop a simplified version of the PC algorithm (Spirtes et al., 2000), the PC-simple algorithm, which is comp… ▽ More

    Submitted 7 October, 2009; v1 submitted 17 June, 2009; originally announced June 2009.

    Comments: 20 pages, 3 figures

    Journal ref: Biometrika 2010, Vol. 97, No. 2, 261-278

  22. Estimating high-dimensional intervention effects from observational data

    Authors: Marloes H. Maathuis, Markus Kalisch, Peter Bühlmann

    Abstract: We assume that we have observational data generated from an unknown underlying directed acyclic graph (DAG) model. A DAG is typically not identifiable from observational data, but it is possible to consistently estimate the equivalence class of a DAG. Moreover, for any given DAG, causal effects can be estimated using intervention calculus. In this paper, we combine these two parts. For each DAG… ▽ More

    Submitted 2 September, 2009; v1 submitted 23 October, 2008; originally announced October 2008.

    Comments: Published in at http://dx.doi.org/10.1214/09-AOS685 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS685 MSC Class: 62-09; 62H99 (Primary)

    Journal ref: Annals of Statistics 2009, Vol. 37, No. 6A, 3133-3164