Search | arXiv e-print repository

Learning Probabilities of Causation from Finite Population Data

Authors: Shuai Wang, Song Jiang, Yizhou Sun, Judea Pearl, Ang Li

Abstract: Probabilities of causation play a crucial role in modern decision-making. This paper addresses the challenge of predicting probabilities of causation for subpopulations with \textbf{insufficient} data using machine learning models. Tian and Pearl first defined and derived tight bounds for three fundamental probabilities of causation: the probability of necessity and sufficiency (PNS), the probabil… ▽ More Probabilities of causation play a crucial role in modern decision-making. This paper addresses the challenge of predicting probabilities of causation for subpopulations with \textbf{insufficient} data using machine learning models. Tian and Pearl first defined and derived tight bounds for three fundamental probabilities of causation: the probability of necessity and sufficiency (PNS), the probability of sufficiency (PS), and the probability of necessity (PN). However, estimating these probabilities requires both experimental and observational distributions specific to each subpopulation, which are often unavailable or impractical to obtain with limited population-level data. Therefore, for most subgroups, the amount of data they have is not enough to guarantee the accuracy of their probabilities. Hence, to estimate these probabilities for subpopulations with \textbf{insufficient} data, we propose using machine learning models that draw insights from subpopulations with sufficient data. Our evaluation of multiple machine learning models indicates that, given the population-level data and an appropriate choice of machine learning model and activation function, PNS can be effectively predicted. Through simulation studies on multiple Structured Causal Models (SCMs), we show that our multilayer perceptron (MLP) model with the Mish activation function achieves a mean absolute error (MAE) of approximately $0.02$ in predicting PNS for $32,768$ subpopulations across most SCMs using data from only $2,000$ subpopulations with known PNS values. △ Less

Submitted 21 May, 2025; originally announced May 2025.

Comments: arXiv admin note: text overlap with arXiv:2502.08858

arXiv:2310.04317 [pdf, other]

A perspective on neuroscience data standardization with Neurodata Without Borders

Authors: Andrea Pierré, Tuan Pham, Jonah Pearl, Sandeep Robert Datta, Jason T. Ritt, Alexander Fleischmann

Abstract: Neuroscience research has evolved to generate increasingly large and complex experimental data sets, and advanced data science tools are taking on central roles in neuroscience research. Neurodata Without Borders (NWB), a standard language for neurophysiology data, has recently emerged as a powerful solution for data management, analysis, and sharing. We here discuss our efforts to implement NWB d… ▽ More Neuroscience research has evolved to generate increasingly large and complex experimental data sets, and advanced data science tools are taking on central roles in neuroscience research. Neurodata Without Borders (NWB), a standard language for neurophysiology data, has recently emerged as a powerful solution for data management, analysis, and sharing. We here discuss our efforts to implement NWB data science pipelines. We describe general principles and specific use cases that illustrate successes, challenges, and non-trivial decisions in software engineering. We hope that our experience can provide guidance for the neuroscience community and help bridge the gap between experimental neuroscience and data science. △ Less

Submitted 22 January, 2024; v1 submitted 6 October, 2023; originally announced October 2023.

Comments: 32 pages, 9 figures

arXiv:2301.12022 [pdf, ps, other]

Epsilon-Identifiability of Causal Quantities

Authors: Ang Li, Scott Mueller, Judea Pearl

Abstract: Identifying the effects of causes and causes of effects is vital in virtually every scientific field. Often, however, the needed probabilities may not be fully identifiable from the data sources available. This paper shows how partial identifiability is still possible for several probabilities of causation. We term this epsilon-identifiability and demonstrate its usefulness in cases where the beha… ▽ More Identifying the effects of causes and causes of effects is vital in virtually every scientific field. Often, however, the needed probabilities may not be fully identifiable from the data sources available. This paper shows how partial identifiability is still possible for several probabilities of causation. We term this epsilon-identifiability and demonstrate its usefulness in cases where the behavior of certain subpopulations can be restricted to within some narrow bounds. In particular, we show how unidentifiable causal effects and counterfactual probabilities can be narrowly bounded when such allowances are made. Often those allowances are easily measured and reasonably assumed. Finally, epsilon-identifiability is applied to the unit selection problem. △ Less

Submitted 27 January, 2023; originally announced January 2023.

arXiv:2210.08874 [pdf, other]

Probabilities of Causation: Role of Observational Data

Authors: Ang Li, Judea Pearl

Abstract: Probabilities of causation play a crucial role in modern decision-making. Pearl defined three binary probabilities of causation, the probability of necessity and sufficiency (PNS), the probability of sufficiency (PS), and the probability of necessity (PN). These probabilities were then bounded by Tian and Pearl using a combination of experimental and observational data. However, observational data… ▽ More Probabilities of causation play a crucial role in modern decision-making. Pearl defined three binary probabilities of causation, the probability of necessity and sufficiency (PNS), the probability of sufficiency (PS), and the probability of necessity (PN). These probabilities were then bounded by Tian and Pearl using a combination of experimental and observational data. However, observational data are not always available in practice; in such a case, Tian and Pearl's Theorem provided valid but less effective bounds using pure experimental data. In this paper, we discuss the conditions that observational data are worth considering to improve the quality of the bounds. More specifically, we defined the expected improvement of the bounds by assuming the observational distributions are uniformly distributed on their feasible interval. We further applied the proposed theorems to the unit selection problem defined by Li and Pearl. △ Less

Submitted 17 October, 2022; originally announced October 2022.

arXiv:2210.08453 [pdf, other]

Learning Probabilities of Causation from Finite Population Data

Authors: Ang Li, Song Jiang, Yizhou Sun, Judea Pearl

Abstract: This paper deals with the problem of learning the probabilities of causation of subpopulations given finite population data. The tight bounds of three basic probabilities of causation, the probability of necessity and sufficiency (PNS), the probability of sufficiency (PS), and the probability of necessity (PN), were derived by Tian and Pearl. However, obtaining the bounds for each subpopulation re… ▽ More This paper deals with the problem of learning the probabilities of causation of subpopulations given finite population data. The tight bounds of three basic probabilities of causation, the probability of necessity and sufficiency (PNS), the probability of sufficiency (PS), and the probability of necessity (PN), were derived by Tian and Pearl. However, obtaining the bounds for each subpopulation requires experimental and observational distributions of each subpopulation, which is usually impractical to estimate given finite population data. We propose a machine learning model that helps to learn the bounds of the probabilities of causation for subpopulations given finite population data. We further show by a simulated study that the machine learning model is able to learn the bounds of PNS for 32768 subpopulations with only knowing roughly 500 of them from the finite population data. △ Less

Submitted 16 October, 2022; originally announced October 2022.

arXiv:2210.08203 [pdf, other]

Unit Selection: Learning Benefit Function from Finite Population Data

Authors: Ang Li, Song Jiang, Yizhou Sun, Judea Pearl

Abstract: The unit selection problem is to identify a group of individuals who are most likely to exhibit a desired mode of behavior, for example, selecting individuals who would respond one way if incentivized and a different way if not. The unit selection problem consists of evaluation and search subproblems. Li and Pearl defined the "benefit function" to evaluate the average payoff of selecting a certain… ▽ More The unit selection problem is to identify a group of individuals who are most likely to exhibit a desired mode of behavior, for example, selecting individuals who would respond one way if incentivized and a different way if not. The unit selection problem consists of evaluation and search subproblems. Li and Pearl defined the "benefit function" to evaluate the average payoff of selecting a certain individual with given characteristics. The search subproblem is then to design an algorithm to identify the characteristics that maximize the above benefit function. The hardness of the search subproblem arises due to the large number of characteristics available for each individual and the sparsity of the data available in each cell of characteristics. In this paper, we present a machine learning framework that uses the bounds of the benefit function that are estimable from the finite population data to learn the bounds of the benefit function for each cell of characteristics. Therefore, we could easily obtain the characteristics that maximize the benefit function. △ Less

Submitted 15 October, 2022; originally announced October 2022.

arXiv:2210.05030 [pdf, ps, other]

Unit Selection: Case Study and Comparison with A/B Test Heuristic

Authors: Ang Li, Judea Pearl

Abstract: The unit selection problem defined by Li and Pearl identifies individuals who have desired counterfactual behavior patterns, for example, individuals who would respond positively if encouraged and would not otherwise. Li and Pearl showed by example that their unit selection model is beyond the A/B test heuristics. In this paper, we reveal the essence of the A/B test heuristics, which are exception… ▽ More The unit selection problem defined by Li and Pearl identifies individuals who have desired counterfactual behavior patterns, for example, individuals who would respond positively if encouraged and would not otherwise. Li and Pearl showed by example that their unit selection model is beyond the A/B test heuristics. In this paper, we reveal the essence of the A/B test heuristics, which are exceptional cases of the benefit function defined by Li and Pearl. Furthermore, We provided more simulated use cases of Li-Pearl's unit selection model to help decision-makers apply their model correctly, explaining that A/B test heuristics are generally problematic. △ Less

Submitted 10 October, 2022; originally announced October 2022.

arXiv:2210.05027 [pdf, other]

Probabilities of Causation: Adequate Size of Experimental and Observational Samples

Authors: Ang Li, Ruirui Mao, Judea Pearl

Abstract: The probabilities of causation are commonly used to solve decision-making problems. Tian and Pearl derived sharp bounds for the probability of necessity and sufficiency (PNS), the probability of sufficiency (PS), and the probability of necessity (PN) using experimental and observational data. The assumption is that one is in possession of a large enough sample to permit an accurate estimation of t… ▽ More The probabilities of causation are commonly used to solve decision-making problems. Tian and Pearl derived sharp bounds for the probability of necessity and sufficiency (PNS), the probability of sufficiency (PS), and the probability of necessity (PN) using experimental and observational data. The assumption is that one is in possession of a large enough sample to permit an accurate estimation of the experimental and observational distributions. In this study, we present a method for determining the sample size needed for such estimation, when a given confidence interval (CI) is specified. We further show by simulation that the proposed sample size delivered stable estimations of the bounds of PNS. △ Less

Submitted 10 October, 2022; originally announced October 2022.

arXiv:2209.11876 [pdf, other]

doi 10.3847/PSJ/ac93f2

Predicting asteroid material properties from a DART-like kinetic impact

Authors: Kathryn M. Kumamoto, J. Michael Owen, Megan Bruck Syal, Jason Pearl, Cody Raskin, Wendy K. Caldwell, Emma Rainey, Angela Stickle, R. Terik Daly, Olivier Barnouin

Abstract: NASA's Double Asteroid Redirection Test (DART) mission is the first full-scale test of the kinetic impactor method for asteroid deflection, in which a spacecraft intentionally impacts an asteroid to change its trajectory. DART represents an important first step for planetary defense technology demonstration, providing a realistic assessment of the effectiveness of the kinetic impact approach on a… ▽ More NASA's Double Asteroid Redirection Test (DART) mission is the first full-scale test of the kinetic impactor method for asteroid deflection, in which a spacecraft intentionally impacts an asteroid to change its trajectory. DART represents an important first step for planetary defense technology demonstration, providing a realistic assessment of the effectiveness of the kinetic impact approach on a near-Earth asteroid. The momentum imparted to the asteroid is transferred from the impacting spacecraft and enhanced by the momentum of material ejected from the impact site. However, the magnitude of the ejecta contribution is dependent on the material properties of the target. These properties, such as strength and shear modulus, are unknown for the DART target asteroid, Dimorphos, as well as most asteroids since such properties are difficult to characterize remotely. This study examines how hydrocode simulations can be used to estimate material properties from information available post-impact, specifically the asteroid size and shape, the velocity and properties of the impacting spacecraft, and the final velocity change imparted to the asteroid. Across >300 three-dimensional simulations varying seven material parameters describing the asteroid, we found many combinations of properties could reproduce a particular asteroid velocity. Additional observations, such as asteroid mass or crater size, are required to further constrain properties like asteroid strength or outcomes like the momentum enhancement provided by impact ejecta. Our results demonstrate the vital importance of having as much knowledge as possible prior to an impact mission, with key material parameters being the asteroid's mass, porosity, strength, and elastic properties. △ Less

Submitted 23 September, 2022; originally announced September 2022.

Comments: Accepted to the Planetary Science Journal

Report number: LLNL-JRNL-837536; LA-UR-22-28492

arXiv:2208.09569 [pdf, other]

Unit Selection with Nonbinary Treatment and Effect

Authors: Ang Li, Judea Pearl

Abstract: The unit selection problem aims to identify a set of individuals who are most likely to exhibit a desired mode of behavior, for example, selecting individuals who would respond one way if encouraged and a different way if not encouraged. Using a combination of experimental and observational data, Li and Pearl derived tight bounds on the "benefit function", which is the payoff/cost associated with… ▽ More The unit selection problem aims to identify a set of individuals who are most likely to exhibit a desired mode of behavior, for example, selecting individuals who would respond one way if encouraged and a different way if not encouraged. Using a combination of experimental and observational data, Li and Pearl derived tight bounds on the "benefit function", which is the payoff/cost associated with selecting an individual with given characteristics. This paper extends the benefit function to the general form such that the treatment and effect are not restricted to binary. We propose an algorithm to test the identifiability of the nonbinary benefit function and an algorithm to compute the bounds of the nonbinary benefit function using experimental and observational data. △ Less

Submitted 19 August, 2022; originally announced August 2022.

arXiv:2208.09568 [pdf, other]

Probabilities of Causation with Nonbinary Treatment and Effect

Authors: Ang Li, Judea Pearl

Abstract: This paper deals with the problem of estimating the probabilities of causation when treatment and effect are not binary. Tian and Pearl derived sharp bounds for the probability of necessity and sufficiency (PNS), the probability of sufficiency (PS), and the probability of necessity (PN) using experimental and observational data. In this paper, we provide theoretical bounds for all types of probabi… ▽ More This paper deals with the problem of estimating the probabilities of causation when treatment and effect are not binary. Tian and Pearl derived sharp bounds for the probability of necessity and sufficiency (PNS), the probability of sufficiency (PS), and the probability of necessity (PN) using experimental and observational data. In this paper, we provide theoretical bounds for all types of probabilities of causation to multivalued treatments and effects. We further discuss examples where our bounds guide practical decisions and use simulation studies to evaluate how informative the bounds are for various combinations of data. △ Less

Submitted 19 August, 2022; originally announced August 2022.

arXiv:2208.09558 [pdf, ps, other]

Personalized Decision Making -- A Conceptual Introduction

Authors: Scott Mueller, Judea Pearl

Abstract: Personalized decision making targets the behavior of a specific individual, while population-based decision making concerns a sub-population resembling that individual. This paper clarifies the distinction between the two and explains why the former leads to more informed decisions. We further show that by combining experimental and observational studies we can obtain valuable information about in… ▽ More Personalized decision making targets the behavior of a specific individual, while population-based decision making concerns a sub-population resembling that individual. This paper clarifies the distinction between the two and explains why the former leads to more informed decisions. We further show that by combining experimental and observational studies we can obtain valuable information about individual behavior and, consequently, improve decisions over those obtained from experimental studies alone. △ Less

Submitted 19 August, 2022; originally announced August 2022.

arXiv:2109.07556 [pdf, other]

Unit Selection with Causal Diagram

Authors: Ang Li, Judea Pearl

Abstract: The unit selection problem aims to identify a set of individuals who are most likely to exhibit a desired mode of behavior, for example, selecting individuals who would respond one way if encouraged and a different way if not encouraged. Using a combination of experimental and observational data, Li and Pearl derived tight bounds on the "benefit function" - the payoff/cost associated with selectin… ▽ More The unit selection problem aims to identify a set of individuals who are most likely to exhibit a desired mode of behavior, for example, selecting individuals who would respond one way if encouraged and a different way if not encouraged. Using a combination of experimental and observational data, Li and Pearl derived tight bounds on the "benefit function" - the payoff/cost associated with selecting an individual with given characteristics. This paper shows that these bounds can be narrowed significantly (enough to change decisions) when structural information is available in the form of a causal model. We address the problem of estimating the benefit function using observational and experimental data when specific graphical criteria are assumed to hold. △ Less

Submitted 15 September, 2021; originally announced September 2021.

arXiv:2106.12121 [pdf, other]

Bounds on Causal Effects and Application to High Dimensional Data

Authors: Ang Li, Judea Pearl

Abstract: This paper addresses the problem of estimating causal effects when adjustment variables in the back-door or front-door criterion are partially observed. For such scenarios, we derive bounds on the causal effects by solving two non-linear optimization problems, and demonstrate that the bounds are sufficient. Using this optimization method, we propose a framework for dimensionality reduction that al… ▽ More This paper addresses the problem of estimating causal effects when adjustment variables in the back-door or front-door criterion are partially observed. For such scenarios, we derive bounds on the causal effects by solving two non-linear optimization problems, and demonstrate that the bounds are sufficient. Using this optimization method, we propose a framework for dimensionality reduction that allows one to trade bias for estimation power, and demonstrate its performance using simulation studies. △ Less

Submitted 22 June, 2021; originally announced June 2021.

arXiv:2104.13730 [pdf, other]

Causes of Effects: Learning individual responses from population data

Authors: Scott Mueller, Ang Li, Judea Pearl

Abstract: The problem of individualization is recognized as crucial in almost every field. Identifying causes of effects in specific events is likewise essential for accurate decision making. However, such estimates invoke counterfactual relationships, and are therefore indeterminable from population data. For example, the probability of benefiting from a treatment concerns an individual having a favorable… ▽ More The problem of individualization is recognized as crucial in almost every field. Identifying causes of effects in specific events is likewise essential for accurate decision making. However, such estimates invoke counterfactual relationships, and are therefore indeterminable from population data. For example, the probability of benefiting from a treatment concerns an individual having a favorable outcome if treated and an unfavorable outcome if untreated. Experiments conditioning on fine-grained features are fundamentally inadequate because we can't test both possibilities for an individual. Tian and Pearl provided bounds on this and other probabilities of causation using a combination of experimental and observational data. Even though those bounds were proven tight, narrower bounds, sometimes significantly so, can be achieved when structural information is available in the form of a causal model. This has the power to solve central problems, such as explainable AI, legal responsibility, and personalized medicine, all of which demand counterfactual logic. We analyze and expand on existing research by applying bounds to the probability of necessity and sufficiency (PNS) along with graphical criteria and practical applications. △ Less

Submitted 2 May, 2021; v1 submitted 28 April, 2021; originally announced April 2021.

arXiv:1909.05434 [pdf, other]

doi 10.22331/q-2021-08-05-518

Classical causal models cannot faithfully explain Bell nonlocality or Kochen-Specker contextuality in arbitrary scenarios

Authors: J. C. Pearl, E. G. Cavalcanti

Abstract: In a recent work, it was shown by one of us (EGC) that Bell-Kochen-Specker inequality violations in phenomena satisfying the no-disturbance condition (a generalisation of the no-signalling condition) cannot in general be explained with a faithful classical causal model -- that is, a classical causal model that satisfies the assumption of no fine-tuning. The proof of that claim however was restrict… ▽ More In a recent work, it was shown by one of us (EGC) that Bell-Kochen-Specker inequality violations in phenomena satisfying the no-disturbance condition (a generalisation of the no-signalling condition) cannot in general be explained with a faithful classical causal model -- that is, a classical causal model that satisfies the assumption of no fine-tuning. The proof of that claim however was restricted to Bell scenarios involving 2 parties or Kochen-Specker-contextuality scenarios involving 2 measurements per context. Here we show that the result holds in the general case of arbitrary numbers of parties or measurements per context; it is not an artefact of the simplest scenarios. This result unifies, in full generality, Bell nonlocality and Kochen-Specker contextuality as violations of a fundamental principle of classical causality. We identify, however, an implicit assumption in the former proof, making it explicit here: that certain operational symmetries of the phenomenon are reflected in the model, rather than requiring fine-tuned choices of model parameters. This clarifies a subtle but important distinction between Bell nonlocality and Kochen-Specker contextuality. △ Less

Submitted 28 July, 2021; v1 submitted 11 September, 2019; originally announced September 2019.

Comments: 13 pages, 10 figures

Journal ref: Quantum 5, 518 (2021)

arXiv:1801.04016 [pdf, ps, other]

Theoretical Impediments to Machine Learning With Seven Sparks from the Causal Revolution

Authors: Judea Pearl

Abstract: Current machine learning systems operate, almost exclusively, in a statistical, or model-free mode, which entails severe theoretical limits on their power and performance. Such systems cannot reason about interventions and retrospection and, therefore, cannot serve as the basis for strong AI. To achieve human level intelligence, learning machines need the guidance of a model of reality, similar to… ▽ More Current machine learning systems operate, almost exclusively, in a statistical, or model-free mode, which entails severe theoretical limits on their power and performance. Such systems cannot reason about interventions and retrospection and, therefore, cannot serve as the basis for strong AI. To achieve human level intelligence, learning machines need the guidance of a model of reality, similar to the ones used in causal inference tasks. To demonstrate the essential role of such models, I will present a summary of seven tasks which are beyond reach of current machine learning systems and which have been accomplished using the tools of causal modeling. △ Less

Submitted 11 January, 2018; originally announced January 2018.

Comments: 8 pages, 3 figures

Report number: R-475

arXiv:1801.03583 [pdf, other]

Graphical Models for Processing Missing Data

Authors: Karthika Mohan, Judea Pearl

Abstract: This paper reviews recent advances in missing data research using graphical models to represent multivariate dependencies. We first examine the limitations of traditional frameworks from three different perspectives: \textit{transparency, estimability and testability}. We then show how procedures based on graphical models can overcome these limitations and provide meaningful performance guarantees… ▽ More This paper reviews recent advances in missing data research using graphical models to represent multivariate dependencies. We first examine the limitations of traditional frameworks from three different perspectives: \textit{transparency, estimability and testability}. We then show how procedures based on graphical models can overcome these limitations and provide meaningful performance guarantees even when data are Missing Not At Random (MNAR). In particular, we identify conditions that guarantee consistent estimation in broad categories of missing data problems, and derive procedures for implementing this estimation. Finally we derive testable implications for missing data models in both MAR (Missing At Random) and MNAR categories. △ Less

Submitted 13 November, 2019; v1 submitted 10 January, 2018; originally announced January 2018.

Comments: 34 pages, 5 figures

Report number: r473-L

arXiv:1708.00235 [pdf, other]

doi 10.1016/j.pss.2017.10.005

Scientific rationale for Uranus and Neptune in situ explorations

Authors: O. Mousis, D. H. Atkinson, T. Cavalié, L. N. Fletcher, M. J. Amato, S. Aslam, F. Ferri, J. -B. Renard, T. Spilker, E. Venkatapathy, P. Wurz, K. Aplin, A. Coustenis, M. Deleuil, M. Dobrijevic, T. Fouchet, T. Guillot, P. Hartogh, T. Hewagama, M. D. Hofstadter, V. Hue, R. Hueso, J. -P. Lebreton, E. Lellouch, J. Moses , et al. (31 additional authors not shown)

Abstract: The ice giants Uranus and Neptune are the least understood class of planets in our solar system but the most frequently observed type of exoplanets. Presumed to have a small rocky core, a deep interior comprising ~70% heavy elements surrounded by a more dilute outer envelope of H2 and He, Uranus and Neptune are fundamentally different from the better-explored gas giants Jupiter and Saturn. Because… ▽ More The ice giants Uranus and Neptune are the least understood class of planets in our solar system but the most frequently observed type of exoplanets. Presumed to have a small rocky core, a deep interior comprising ~70% heavy elements surrounded by a more dilute outer envelope of H2 and He, Uranus and Neptune are fundamentally different from the better-explored gas giants Jupiter and Saturn. Because of the lack of dedicated exploration missions, our knowledge of the composition and atmospheric processes of these distant worlds is primarily derived from remote sensing from Earth-based observatories and space telescopes. As a result, Uranus's and Neptune's physical and atmospheric properties remain poorly constrained and their roles in the evolution of the Solar System not well understood. Exploration of an ice giant system is therefore a high-priority science objective as these systems (including the magnetosphere, satellites, rings, atmosphere, and interior) challenge our understanding of planetary formation and evolution. Here we describe the main scientific goals to be addressed by a future in situ exploration of an ice giant. An atmospheric entry probe targeting the 10-bar level, about 5 scale heights beneath the tropopause, would yield insight into two broad themes: i) the formation history of the ice giants and, in a broader extent, that of the Solar System, and ii) the processes at play in planetary atmospheres. The probe would descend under parachute to measure composition, structure, and dynamics, with data returned to Earth using a Carrier Relay Spacecraft as a relay station. In addition, possible mission concepts and partnerships are presented, and a strawman ice-giant probe payload is described. An ice-giant atmospheric probe could represent a significant ESA contribution to a future NASA ice-giant flagship mission. △ Less

Submitted 1 August, 2017; originally announced August 2017.

Comments: Submitted to Planetary and Space Science

arXiv:1511.02995 [pdf, other]

Incorporating Knowledge into Structural Equation Models using Auxiliary Variables

Authors: Bryant Chen, Judea Pearl, Elias Bareinboim

Abstract: In this paper, we extend graph-based identification methods by allowing background knowledge in the form of non-zero parameter values. Such information could be obtained, for example, from a previously conducted randomized experiment, from substantive understanding of the domain, or even an identification technique. To incorporate such information systematically, we propose the addition of auxilia… ▽ More In this paper, we extend graph-based identification methods by allowing background knowledge in the form of non-zero parameter values. Such information could be obtained, for example, from a previously conducted randomized experiment, from substantive understanding of the domain, or even an identification technique. To incorporate such information systematically, we propose the addition of auxiliary variables to the model, which are constructed so that certain paths will be conveniently cancelled. This cancellation allows the auxiliary variables to help conventional methods of identification (e.g., single-door criterion, instrumental variables, half-trek criterion), as well as model testing (e.g., d-separation, over-identification). Moreover, by iteratively alternating steps of identification and adding auxiliary variables, we can improve the power of existing identification methods via a bootstrapping approach that does not require external knowledge. We operationalize this method for simple instrumental sets (a generalization of instrumental variables) and show that the resulting method is able to identify at least as many models as the most general identification method for linear systems known to date. We further discuss the application of auxiliary variables to the tasks of model testing and z-identification. △ Less

Submitted 2 May, 2016; v1 submitted 10 November, 2015; originally announced November 2015.

arXiv:1503.01603 [pdf, ps, other]

doi 10.1214/14-STS486

External Validity: From Do-Calculus to Transportability Across Populations

Authors: Judea Pearl, Elias Bareinboim

Abstract: The generalizability of empirical findings to new environments, settings or populations, often called "external validity," is essential in most scientific explorations. This paper treats a particular problem of generalizability, called "transportability," defined as a license to transfer causal effects learned in experimental studies to a new population, in which only observational studies can be… ▽ More The generalizability of empirical findings to new environments, settings or populations, often called "external validity," is essential in most scientific explorations. This paper treats a particular problem of generalizability, called "transportability," defined as a license to transfer causal effects learned in experimental studies to a new population, in which only observational studies can be conducted. We introduce a formal representation called "selection diagrams" for expressing knowledge about differences and commonalities between populations of interest and, using this representation, we reduce questions of transportability to symbolic derivations in the do-calculus. This reduction yields graph-based procedures for deciding, prior to observing any data, whether causal effects in the target population can be inferred from experimental findings in the study population. When the answer is affirmative, the procedures identify what experimental and observational findings need be obtained from the two populations, and how they can be combined to ensure bias-free transport. △ Less

Submitted 5 March, 2015; originally announced March 2015.

Comments: Published in at http://dx.doi.org/10.1214/14-STS486 the Statistical Science (http://www.imstat.org/sts/) by the Institute of Mathematical Statistics (http://www.imstat.org). arXiv admin note: text overlap with arXiv:1312.7485

Report number: IMS-STS-STS486

Journal ref: Statistical Science 2014, Vol. 29, No. 4, 579-595

arXiv:1411.7014 [pdf, other]

Efficient Algorithms for Bayesian Network Parameter Learning from Incomplete Data

Authors: Guy Van den Broeck, Karthika Mohan, Arthur Choi, Judea Pearl

Abstract: We propose an efficient family of algorithms to learn the parameters of a Bayesian network from incomplete data. In contrast to textbook approaches such as EM and the gradient method, our approach is non-iterative, yields closed form parameter estimates, and eliminates the need for inference in a Bayesian network. Our approach provides consistent parameter estimates for missing data problems that… ▽ More We propose an efficient family of algorithms to learn the parameters of a Bayesian network from incomplete data. In contrast to textbook approaches such as EM and the gradient method, our approach is non-iterative, yields closed form parameter estimates, and eliminates the need for inference in a Bayesian network. Our approach provides consistent parameter estimates for missing data problems that are MCAR, MAR, and in some cases, MNAR. Empirically, our approach is orders of magnitude faster than EM (as our approach requires no inference). Given sufficient data, we learn parameters that can be orders of magnitude more accurate. △ Less

Submitted 25 November, 2014; originally announced November 2014.

arXiv:1408.1479 [pdf]

Logarithmic-Time Updates and Queries in Probabilistic Networks

Authors: Arthur L. Delcher, Adam J. Grove, Simon Kasif, Judea Pearl

Abstract: In this paper we propose a dynamic data structure that supports efficient algorithms for updating and querying singly connected Bayesian networks (causal trees and polytrees). In the conventional algorithms, new evidence in absorbed in time O(1) and queries are processed in time O(N), where N is the size of the network. We propose a practical algorithm which, after a preprocessing phase, allows… ▽ More In this paper we propose a dynamic data structure that supports efficient algorithms for updating and querying singly connected Bayesian networks (causal trees and polytrees). In the conventional algorithms, new evidence in absorbed in time O(1) and queries are processed in time O(N), where N is the size of the network. We propose a practical algorithm which, after a preprocessing phase, allows us to answer queries in time O(log N) at the expense of O(logn N) time per evidence absorption. The usefulness of sub-linear processing time manifests itself in applications requiring (near) real-time response over large probabilistic databases. △ Less

Submitted 7 August, 2014; originally announced August 2014.

Comments: Appears in Proceedings of the Eleventh Conference on Uncertainty in Artificial Intelligence (UAI1995)

Report number: UAI-P-1995-PG-116-124

arXiv:1312.7485 [pdf]

doi 10.1515/jci-2012-0004

A General Algorithm for Deciding Transportability of Experimental Results

Authors: Elias Bareinboim, Judea Pearl

Abstract: Generalizing empirical findings to new environments, settings, or populations is essential in most scientific explorations. This article treats a particular problem of generalizability, called "transportability", defined as a license to transfer information learned in experimental studies to a different population, on which only observational studies can be conducted. Given a set of assumptions co… ▽ More Generalizing empirical findings to new environments, settings, or populations is essential in most scientific explorations. This article treats a particular problem of generalizability, called "transportability", defined as a license to transfer information learned in experimental studies to a different population, on which only observational studies can be conducted. Given a set of assumptions concerning commonalities and differences between the two populations, Pearl and Bareinboim (2011) derived sufficient conditions that permit such transfer to take place. This article summarizes their findings and supplements them with an effective procedure for deciding when and how transportability is feasible. It establishes a necessary and sufficient condition for deciding when causal effects in the target population are estimable from both the statistical information available and the causal information transferred from the experiments. The article further provides a complete algorithm for computing the transport formula, that is, a way of combining observational and experimental information to synthesize bias-free estimate of the desired causal relation. Finally, the article examines the differences between transportability and other variants of generalizability. △ Less

Submitted 28 December, 2013; originally announced December 2013.

Journal ref: Journal of Causal Inference, 2013; 1(1): 107-134

arXiv:1304.3422 [pdf]

A Constraint Propagation Approach to Probabilistic Reasoning

Authors: Judea Pearl

Abstract: The paper demonstrates that strict adherence to probability theory does not preclude the use of concurrent, self-activated constraint-propagation mechanisms for managing uncertainty. Maintaining local records of sources-of-belief allows both predictive and diagnostic inferences to be activated simultaneously and propagate harmoniously towards a stable equilibrium. The paper demonstrates that strict adherence to probability theory does not preclude the use of concurrent, self-activated constraint-propagation mechanisms for managing uncertainty. Maintaining local records of sources-of-belief allows both predictive and diagnostic inferences to be activated simultaneously and propagate harmoniously towards a stable equilibrium. △ Less

Submitted 27 March, 2013; originally announced April 2013.

Comments: Appears in Proceedings of the First Conference on Uncertainty in Artificial Intelligence (UAI1985)

Report number: UAI-P-1985-PG-31-42

arXiv:1304.3103 [pdf]

Learning Link-Probabilities in Causal Trees

Authors: Igor Roizer, Judea Pearl

Abstract: A learning algorithm is presented which given the structure of a causal tree, will estimate its link probabilities by sequential measurements on the leaves only. Internal nodes of the tree represent conceptual (hidden) variables inaccessible to observation. The method described is incremental, local, efficient, and remains robust to measurement imprecisions. A learning algorithm is presented which given the structure of a causal tree, will estimate its link probabilities by sequential measurements on the leaves only. Internal nodes of the tree represent conceptual (hidden) variables inaccessible to observation. The method described is incremental, local, efficient, and remains robust to measurement imprecisions. △ Less

Submitted 27 March, 2013; originally announced April 2013.

Comments: Appears in Proceedings of the Second Conference on Uncertainty in Artificial Intelligence (UAI1986)

Report number: UAI-P-1986-PG-211-214

arXiv:1304.3102 [pdf]

Distributed Revision of Belief Commitment in Multi-Hypothesis Interpretations

Authors: Judea Pearl

Abstract: This paper extends the applications of belief-networks to include the revision of belief commitments, i.e., the categorical acceptance of a subset of hypotheses which, together, constitute the most satisfactory explanation of the evidence at hand. A coherent model of non-monotonic reasoning is established and distributed algorithms for belief revision are presented. We show that, in singly connect… ▽ More This paper extends the applications of belief-networks to include the revision of belief commitments, i.e., the categorical acceptance of a subset of hypotheses which, together, constitute the most satisfactory explanation of the evidence at hand. A coherent model of non-monotonic reasoning is established and distributed algorithms for belief revision are presented. We show that, in singly connected networks, the most satisfactory explanation can be found in linear time by a message-passing algorithm similar to the one used in belief updating. In multiply-connected networks, the problem may be exponentially hard but, if the network is sparse, topological considerations can be used to render the interpretation task tractable. In general, finding the most probable combination of hypotheses is no more complex than computing the degree of belief for any individual hypothesis. Applications to medical diagnosis are illustrated. △ Less

Submitted 27 March, 2013; originally announced April 2013.

Comments: Appears in Proceedings of the Second Conference on Uncertainty in Artificial Intelligence (UAI1986)

Report number: UAI-P-1986-PG-201-210

arXiv:1304.2736 [pdf]

The Recovery of Causal Poly-Trees from Statistical Data

Authors: George Rebane, Judea Pearl

Abstract: Poly-trees are singly connected causal networks in which variables may arise from multiple causes. This paper develops a method of recovering ply-trees from empirically measured probability distributions of pairs of variables. The method guarantees that, if the measured distributions are generated by a causal process structured as a ply-tree then the topological structure of such tree can be recov… ▽ More Poly-trees are singly connected causal networks in which variables may arise from multiple causes. This paper develops a method of recovering ply-trees from empirically measured probability distributions of pairs of variables. The method guarantees that, if the measured distributions are generated by a causal process structured as a ply-tree then the topological structure of such tree can be recovered precisely and, in addition, the causal directionality of the branches can be determined up to the maximum extent possible. The method also pinpoints the minimum (if any) external semantics required to determine the causal relationships among the variables considered. △ Less

Submitted 27 March, 2013; originally announced April 2013.

Comments: Appears in Proceedings of the Third Conference on Uncertainty in Artificial Intelligence (UAI1987)

Report number: UAI-P-1987-PG-222-228

arXiv:1304.2730 [pdf]

Structuring Causal Tree Models with Continuous Variables

Authors: Lei Xu, Judea Pearl

Abstract: This paper considers the problem of invoking auxiliary, unobservable variables to facilitate the structuring of causal tree models for a given set of continuous variables. Paralleling the treatment of bi-valued variables in [Pearl 1986], we show that if a collection of coupled variables are governed by a joint normal distribution and a tree-structured representation exists, then both the topology… ▽ More This paper considers the problem of invoking auxiliary, unobservable variables to facilitate the structuring of causal tree models for a given set of continuous variables. Paralleling the treatment of bi-valued variables in [Pearl 1986], we show that if a collection of coupled variables are governed by a joint normal distribution and a tree-structured representation exists, then both the topology and all internal relationships of the tree can be uncovered by observing pairwise dependencies among the observed variables (i.e., the leaves of the tree). Furthermore, the conditions for normally distributed variables are less restrictive than those governing bi-valued variables. The result extends the applications of causal tree models which were found useful in evidential reasoning tasks. △ Less

Submitted 27 March, 2013; originally announced April 2013.

Comments: Appears in Proceedings of the Third Conference on Uncertainty in Artificial Intelligence (UAI1987)

Report number: UAI-P-1987-PG-170-179

arXiv:1304.2716 [pdf]

Do We Need Higher-Order Probabilities and, If So, What Do They Mean?

Authors: Judea Pearl

Abstract: The apparent failure of individual probabilistic expressions to distinguish uncertainty about truths from uncertainty about probabilistic assessments have prompted researchers to seek formalisms where the two types of uncertainties are given notational distinction. This paper demonstrates that the desired distinction is already a built-in feature of classical probabilistic models, thus, specialize… ▽ More The apparent failure of individual probabilistic expressions to distinguish uncertainty about truths from uncertainty about probabilistic assessments have prompted researchers to seek formalisms where the two types of uncertainties are given notational distinction. This paper demonstrates that the desired distinction is already a built-in feature of classical probabilistic models, thus, specialized notations are unnecessary. △ Less

Submitted 27 March, 2013; originally announced April 2013.

Comments: Appears in Proceedings of the Third Conference on Uncertainty in Artificial Intelligence (UAI1987)

Report number: UAI-P-1987-PG-47-60

arXiv:1304.2379 [pdf]

Causal Networks: Semantics and Expressiveness

Authors: Tom S. Verma, Judea Pearl

Abstract: Dependency knowledge of the form "x is independent of y once z is known" invariably obeys the four graphoid axioms, examples include probabilistic and database dependencies. Often, such knowledge can be represented efficiently with graphical structures such as undirected graphs and directed acyclic graphs (DAGs). In this paper we show that the graphical criterion called d-separation is a sound r… ▽ More Dependency knowledge of the form "x is independent of y once z is known" invariably obeys the four graphoid axioms, examples include probabilistic and database dependencies. Often, such knowledge can be represented efficiently with graphical structures such as undirected graphs and directed acyclic graphs (DAGs). In this paper we show that the graphical criterion called d-separation is a sound rule for reading independencies from any DAG based on a causal input list drawn from a graphoid. The rule may be extended to cover DAGs that represent functional dependencies as well as conditional dependencies. △ Less

Submitted 27 March, 2013; originally announced April 2013.

Comments: Appears in Proceedings of the Fourth Conference on Uncertainty in Artificial Intelligence (UAI1988)

Report number: UAI-P-1988-PG-352-359

arXiv:1304.2355 [pdf]

On the Logic of Causal Models

Authors: Dan Geiger, Judea Pearl

Abstract: This paper explores the role of Directed Acyclic Graphs (DAGs) as a representation of conditional independence relationships. We show that DAGs offer polynomially sound and complete inference mechanisms for inferring conditional independence relationships from a given causal set of such relationships. As a consequence, d-separation, a graphical criterion for identifying independencies in a DAG,… ▽ More This paper explores the role of Directed Acyclic Graphs (DAGs) as a representation of conditional independence relationships. We show that DAGs offer polynomially sound and complete inference mechanisms for inferring conditional independence relationships from a given causal set of such relationships. As a consequence, d-separation, a graphical criterion for identifying independencies in a DAG, is shown to uncover more valid independencies then any other criterion. In addition, we employ the Armstrong property of conditional independence to show that the dependence relationships displayed by a DAG are inherently consistent, i.e. for every DAG D there exists some probability distribution P that embodies all the conditional independencies displayed in D and none other. △ Less

Submitted 27 March, 2013; originally announced April 2013.

Comments: Appears in Proceedings of the Fourth Conference on Uncertainty in Artificial Intelligence (UAI1988)

Report number: UAI-P-1988-PG-136-147

arXiv:1304.1507 [pdf]

Deciding Consistency of Databases Containing Defeasible and Strict Information

Authors: Moises Goldszmidt, Judea Pearl

Abstract: We propose a norm of consistency for a mixed set of defeasible and strict sentences, based on a probabilistic semantics. This norm establishes a clear distinction between knowledge bases depicting exceptions and those containing outright contradictions. We then define a notion of entailment based also on probabilistic considerations and provide a characterization of the relation between consiste… ▽ More We propose a norm of consistency for a mixed set of defeasible and strict sentences, based on a probabilistic semantics. This norm establishes a clear distinction between knowledge bases depicting exceptions and those containing outright contradictions. We then define a notion of entailment based also on probabilistic considerations and provide a characterization of the relation between consistency and entailment. We derive necessary and sufficient conditions for consistency, and provide a simple decision procedure for testing consistency and deciding whether a sentence is entailed by a database. Finally, it is shown that if al1 sentences are Horn clauses, consistency and entailment can be tested in polynomial time. △ Less

Submitted 27 March, 2013; originally announced April 2013.

Comments: Appears in Proceedings of the Fifth Conference on Uncertainty in Artificial Intelligence (UAI1989)

Report number: UAI-P-1989-PG-134-141

arXiv:1304.1505 [pdf]

d-Separation: From Theorems to Algorithms

Authors: Dan Geiger, Tom S. Verma, Judea Pearl

Abstract: An efficient algorithm is developed that identifies all independencies implied by the topology of a Bayesian network. Its correctness and maximality stems from the soundness and completeness of d-separation with respect to probability theory. The algorithm runs in time O (l E l) where E is the number of edges in the network. An efficient algorithm is developed that identifies all independencies implied by the topology of a Bayesian network. Its correctness and maximality stems from the soundness and completeness of d-separation with respect to probability theory. The algorithm runs in time O (l E l) where E is the number of edges in the network. △ Less

Submitted 27 March, 2013; originally announced April 2013.

Comments: Appears in Proceedings of the Fifth Conference on Uncertainty in Artificial Intelligence (UAI1989)

Report number: UAI-P-1989-PG-118-125

arXiv:1304.1108 [pdf]

On the Equivalence of Causal Models

Authors: Tom S. Verma, Judea Pearl

Abstract: Scientists often use directed acyclic graphs (days) to model the qualitative structure of causal theories, allowing the parameters to be estimated from observational data. Two causal models are equivalent if there is no experiment which could distinguish one from the other. A canonical representation for causal models is presented which yields an efficient graphical criterion for deciding equiva… ▽ More Scientists often use directed acyclic graphs (days) to model the qualitative structure of causal theories, allowing the parameters to be estimated from observational data. Two causal models are equivalent if there is no experiment which could distinguish one from the other. A canonical representation for causal models is presented which yields an efficient graphical criterion for deciding equivalence, and provides a theoretical basis for extracting causal structures from empirical data. This representation is then extended to the more general case of an embedded causal model, that is, a dag in which only a subset of the variables are observable. The canonical representation presented here yields an efficient algorithm for determining when two embedded causal models reflect the same dependency information. This algorithm leads to a model theoretic definition of causation in terms of statistical dependencies. △ Less

Submitted 27 March, 2013; originally announced April 2013.

Comments: Appears in Proceedings of the Sixth Conference on Uncertainty in Artificial Intelligence (UAI1990)

Report number: UAI-P-1990-PG-220-227

arXiv:1303.5435 [pdf]

An Algorithm for Deciding if a Set of Observed Independencies Has a Causal Explanation

Authors: Tom S. Verma, Judea Pearl

Abstract: In a previous paper [Pearl and Verma, 1991] we presented an algorithm for extracting causal influences from independence information, where a causal influence was defined as the existence of a directed arc in all minimal causal models consistent with the data. In this paper we address the question of deciding whether there exists a causal model that explains ALL the observed dependencies and inde… ▽ More In a previous paper [Pearl and Verma, 1991] we presented an algorithm for extracting causal influences from independence information, where a causal influence was defined as the existence of a directed arc in all minimal causal models consistent with the data. In this paper we address the question of deciding whether there exists a causal model that explains ALL the observed dependencies and independencies. Formally, given a list M of conditional independence statements, it is required to decide whether there exists a directed acyclic graph (dag) D that is perfectly consistent with M, namely, every statement in M, and no other, is reflected via dseparation in D. We present and analyze an effective algorithm that tests for the existence of such a day, and produces one, if it exists. △ Less

Submitted 13 March, 2013; originally announced March 2013.

Comments: Appears in Proceedings of the Eighth Conference on Uncertainty in Artificial Intelligence (UAI1992)

Report number: UAI-P-1992-PG-323-330

arXiv:1303.5406 [pdf]

Reasoning With Qualitative Probabilities Can Be Tractable

Authors: Moises Goldszmidt, Judea Pearl

Abstract: We recently described a formalism for reasoning with if-then rules that re expressed with different levels of firmness [18]. The formalism interprets these rules as extreme conditional probability statements, specifying orders of magnitude of disbelief, which impose constraints over possible rankings of worlds. It was shown that, once we compute a priority function Z+ on the rules, the degree to… ▽ More We recently described a formalism for reasoning with if-then rules that re expressed with different levels of firmness [18]. The formalism interprets these rules as extreme conditional probability statements, specifying orders of magnitude of disbelief, which impose constraints over possible rankings of worlds. It was shown that, once we compute a priority function Z+ on the rules, the degree to which a given query is confirmed or denied can be computed in O(log n`) propositional satisfiability tests, where n is the number of rules in the knowledge base. In this paper, we show that computing Z+ requires O(n2 X log n) satisfiability tests, not an exponential number as was conjectured in [18], which reduces to polynomial complexity in the case of Horn expressions. We also show how reasoning with imprecise observations can be incorporated in our formalism and how the popular notions of belief revision and epistemic entrenchment are embodied naturally and tractably. △ Less

Submitted 13 March, 2013; originally announced March 2013.

Comments: Appears in Proceedings of the Eighth Conference on Uncertainty in Artificial Intelligence (UAI1992)

Report number: UAI-P-1992-PG-112-120

arXiv:1303.1501 [pdf]

Deciding Morality of Graphs is NP-complete

Authors: Tom S. Verma, Judea Pearl

Abstract: In order to find a causal explanation for data presented in the form of covariance and concentration matrices it is necessary to decide if the graph formed by such associations is a projection of a directed acyclic graph (dag). We show that the general problem of deciding whether such a dag exists is NP-complete. In order to find a causal explanation for data presented in the form of covariance and concentration matrices it is necessary to decide if the graph formed by such associations is a projection of a directed acyclic graph (dag). We show that the general problem of deciding whether such a dag exists is NP-complete. △ Less

Submitted 6 March, 2013; originally announced March 2013.

Comments: Appears in Proceedings of the Ninth Conference on Uncertainty in Artificial Intelligence (UAI1993)

Report number: UAI-P-1993-PG-391-399

arXiv:1303.1455 [pdf]

From Conditional Oughts to Qualitative Decision Theory

Authors: Judea Pearl

Abstract: The primary theme of this investigation is a decision theoretic account of conditional ought statements (e.g., "You ought to do A, if C") that rectifies glaring deficiencies in classical deontic logic. The resulting account forms a sound basis for qualitative decision theory, thus providing a framework for qualitative planning under uncertainty. In particular, we show that adding causal relation… ▽ More The primary theme of this investigation is a decision theoretic account of conditional ought statements (e.g., "You ought to do A, if C") that rectifies glaring deficiencies in classical deontic logic. The resulting account forms a sound basis for qualitative decision theory, thus providing a framework for qualitative planning under uncertainty. In particular, we show that adding causal relationships (in the form of a single graph) as part of an epistemic state is sufficient to facilitate the analysis of action sequences, their consequences, their interaction with observations, their expected utilities and, hence, the synthesis of plans and strategies under uncertainty. △ Less

Submitted 6 March, 2013; originally announced March 2013.

Comments: Appears in Proceedings of the Ninth Conference on Uncertainty in Artificial Intelligence (UAI1993)

Report number: UAI-P-1993-PG-12-20

arXiv:1302.6835 [pdf]

A Probabilistic Calculus of Actions

Authors: Judea Pearl

Abstract: We present a symbolic machinery that admits both probabilistic and causal information about a given domain and produces probabilistic statements about the effect of actions and the impact of observations. The calculus admits two types of conditioning operators: ordinary Bayes conditioning, P(y|X = x), which represents the observation X = x, and causal conditioning, P(y|do(X = x)), read the probab… ▽ More We present a symbolic machinery that admits both probabilistic and causal information about a given domain and produces probabilistic statements about the effect of actions and the impact of observations. The calculus admits two types of conditioning operators: ordinary Bayes conditioning, P(y|X = x), which represents the observation X = x, and causal conditioning, P(y|do(X = x)), read the probability of Y = y conditioned on holding X constant (at x) by deliberate action. Given a mixture of such observational and causal sentences, together with the topology of the causal graph, the calculus derives new conditional probabilities of both types, thus enabling one to quantify the effects of actions (and policies) from partially specified knowledge bases, such as Bayesian networks in which some conditional probabilities may not be available. △ Less

Submitted 27 February, 2013; originally announced February 2013.

Comments: Appears in Proceedings of the Tenth Conference on Uncertainty in Artificial Intelligence (UAI1994)

Report number: UAI-P-1994-PG-454-462

arXiv:1302.6809 [pdf]

On Testing Whether an Embedded Bayesian Network Represents a Probability Model

Authors: Dan Geiger, Azaria Paz, Judea Pearl

Abstract: Testing the validity of probabilistic models containing unmeasured (hidden) variables is shown to be a hard task. We show that the task of testing whether models are structurally incompatible with the data at hand, requires an exponential number of independence evaluations, each of the form: "X is conditionally independent of Y, given Z." In contrast, a linear number of such evaluations is requi… ▽ More Testing the validity of probabilistic models containing unmeasured (hidden) variables is shown to be a hard task. We show that the task of testing whether models are structurally incompatible with the data at hand, requires an exponential number of independence evaluations, each of the form: "X is conditionally independent of Y, given Z." In contrast, a linear number of such evaluations is required to test a standard Bayesian network (one per vertex). On the positive side, we show that if a network with hidden variables G has a tree skeleton, checking whether G represents a given probability model P requires the polynomial number of such independence evaluations. Moreover, we provide an algorithm that efficiently constructs a tree-structured Bayesian network (with hidden variables) that represents P if such a network exists, and further recognizes when such a network does not exist. △ Less

Submitted 27 February, 2013; originally announced February 2013.

Comments: Appears in Proceedings of the Tenth Conference on Uncertainty in Artificial Intelligence (UAI1994)

Report number: UAI-P-1994-PG-244-252

arXiv:1302.6784 [pdf]

Counterfactual Probabilities: Computational Methods, Bounds and Applications

Authors: Alexander Balke, Judea Pearl

Abstract: Evaluation of counterfactual queries (e.g., "If A were true, would C have been true?") is important to fault diagnosis, planning, and determination of liability. In this paper we present methods for computing the probabilities of such queries using the formulation proposed in [Balke and Pearl, 1994], where the antecedent of the query is interpreted as an external action that forces the propositio… ▽ More Evaluation of counterfactual queries (e.g., "If A were true, would C have been true?") is important to fault diagnosis, planning, and determination of liability. In this paper we present methods for computing the probabilities of such queries using the formulation proposed in [Balke and Pearl, 1994], where the antecedent of the query is interpreted as an external action that forces the proposition A to be true. When a prior probability is available on the causal mechanisms governing the domain, counterfactual probabilities can be evaluated precisely. However, when causal knowledge is specified as conditional probabilities on the observables, only bounds can computed. This paper develops techniques for evaluating these bounds, and demonstrates their use in two applications: (1) the determination of treatment efficacy from studies in which subjects may choose their own treatment, and (2) the determination of liability in product-safety litigation. △ Less

Submitted 27 February, 2013; originally announced February 2013.

Comments: Appears in Proceedings of the Tenth Conference on Uncertainty in Artificial Intelligence (UAI1994)

Report number: UAI-P-1994-PG-46-54

arXiv:1302.4977 [pdf]

Probabilistic Evaluation of Sequential Plans from Causal Models with Hidden Variables

Authors: Judea Pearl, James M. Robins

Abstract: The paper concerns the probabilistic evaluation of plans in the presence of unmeasured variables, each plan consisting of several concurrent or sequential actions. We establish a graphical criterion for recognizing when the effects of a given plan can be predicted from passive observations on measured variables only. When the criterion is satisfied, a closed-form expression is provided for the p… ▽ More The paper concerns the probabilistic evaluation of plans in the presence of unmeasured variables, each plan consisting of several concurrent or sequential actions. We establish a graphical criterion for recognizing when the effects of a given plan can be predicted from passive observations on measured variables only. When the criterion is satisfied, a closed-form expression is provided for the probability that the plan will achieve a specified goal. △ Less

Submitted 20 February, 2013; originally announced February 2013.

Comments: Appears in Proceedings of the Eleventh Conference on Uncertainty in Artificial Intelligence (UAI1995)

Report number: UAI-P-1995-PG-444-453

arXiv:1302.4976 [pdf]

On the Testability of Causal Models with Latent and Instrumental Variables

Authors: Judea Pearl

Abstract: Certain causal models involving unmeasured variables induce no independence constraints among the observed variables but imply, nevertheless, inequality contraints on the observed distribution. This paper derives a general formula for such instrumental variables, that is, exogenous variables that directly affect some variables but not all. With the help of this formula, it is possible to test wh… ▽ More Certain causal models involving unmeasured variables induce no independence constraints among the observed variables but imply, nevertheless, inequality contraints on the observed distribution. This paper derives a general formula for such instrumental variables, that is, exogenous variables that directly affect some variables but not all. With the help of this formula, it is possible to test whether a model involving instrumental variables may account for the data, or, conversely, whether a given variables can be deemed instrumental. △ Less

Submitted 20 February, 2013; originally announced February 2013.

Comments: Appears in Proceedings of the Eleventh Conference on Uncertainty in Artificial Intelligence (UAI1995)

Report number: UAI-P-1995-PG-435-443

arXiv:1302.4948 [pdf]

Testing Identifiability of Causal Effects

Authors: David Galles, Judea Pearl

Abstract: This paper concerns the probabilistic evaluation of the effects of actions in the presence of unmeasured variables. We show that the identification of causal effect between a singleton variable X and a set of variables Y can be accomplished systematically, in time polynomial in the number of variables in the graph. When the causal effect is identifiable, a closed-form expression can be obtained… ▽ More This paper concerns the probabilistic evaluation of the effects of actions in the presence of unmeasured variables. We show that the identification of causal effect between a singleton variable X and a set of variables Y can be accomplished systematically, in time polynomial in the number of variables in the graph. When the causal effect is identifiable, a closed-form expression can be obtained for the probability that the action will achieve a specified goal, or a set of goals. △ Less

Submitted 20 February, 2013; originally announced February 2013.

Comments: Appears in Proceedings of the Eleventh Conference on Uncertainty in Artificial Intelligence (UAI1995)

Report number: UAI-P-1995-PG-185-195

arXiv:1302.4929 [pdf]

Counterfactuals and Policy Analysis in Structural Models

Authors: Alexander Balke, Judea Pearl

Abstract: Evaluation of counterfactual queries (e.g., "If A were true, would C have been true?") is important to fault diagnosis, planning, determination of liability, and policy analysis. We present a method of revaluating counterfactuals when the underlying causal model is represented by structural models - a nonlinear generalization of the simultaneous equations models commonly used in econometrics and… ▽ More Evaluation of counterfactual queries (e.g., "If A were true, would C have been true?") is important to fault diagnosis, planning, determination of liability, and policy analysis. We present a method of revaluating counterfactuals when the underlying causal model is represented by structural models - a nonlinear generalization of the simultaneous equations models commonly used in econometrics and social sciences. This new method provides a coherent means for evaluating policies involving the control of variables which, prior to enacting the policy were influenced by other variables in the system. △ Less

Submitted 20 February, 2013; originally announced February 2013.

Comments: Appears in Proceedings of the Eleventh Conference on Uncertainty in Artificial Intelligence (UAI1995)

Report number: UAI-P-1995-PG-11-18

arXiv:1302.3595 [pdf]

Identifying Independencies in Causal Graphs with Feedback

Authors: Judea Pearl, Rina Dechter

Abstract: We show that the d -separation criterion constitutes a valid test for conditional independence relationships that are induced by feedback systems involving discrete variables. We show that the d -separation criterion constitutes a valid test for conditional independence relationships that are induced by feedback systems involving discrete variables. △ Less

Submitted 13 February, 2013; originally announced February 2013.

Comments: Appears in Proceedings of the Twelfth Conference on Uncertainty in Artificial Intelligence (UAI1996)

Report number: UAI-P-1996-PG-420-426

arXiv:1301.3898 [pdf]

Probabilities of Causation: Bounds and Identification

Authors: Jin Tian, Judea Pearl

Abstract: This paper deals with the problem of estimating the probability that one event was a cause of another in a given scenario. Using structural-semantical definitions of the probabilities of necessary or sufficient causation (or both), we show how to optimally bound these quantities from data obtained in experimental and observational studies, making minimal assumptions concerning the data-generating… ▽ More This paper deals with the problem of estimating the probability that one event was a cause of another in a given scenario. Using structural-semantical definitions of the probabilities of necessary or sufficient causation (or both), we show how to optimally bound these quantities from data obtained in experimental and observational studies, making minimal assumptions concerning the data-generating process. In particular, we strengthen the results of Pearl (1999) by weakening the data-generation assumptions and deriving theoretically sharp bounds on the probabilities of causation. These results delineate precisely how empirical data can be used both in settling questions of attribution and in solving attribution-related problems of decision making. △ Less

Submitted 16 January, 2013; originally announced January 2013.

Comments: Appears in Proceedings of the Sixteenth Conference on Uncertainty in Artificial Intelligence (UAI2000)

Report number: UAI-P-2000-PG-589-598

arXiv:1301.2312 [pdf]

Causal Discovery from Changes

Authors: Jin Tian, Judea Pearl

Abstract: We propose a new method of discovering causal structures, based on the detection of local, spontaneous changes in the underlying data-generating model. We analyze the classes of structures that are equivalent relative to a stream of distributions produced by local changes, and devise algorithms that output graphical representations of these equivalence classes. We present experimental results, usi… ▽ More We propose a new method of discovering causal structures, based on the detection of local, spontaneous changes in the underlying data-generating model. We analyze the classes of structures that are equivalent relative to a stream of distributions produced by local changes, and devise algorithms that output graphical representations of these equivalence classes. We present experimental results, using simulated data, and examine the errors associated with detection of changes and recovery of structures. △ Less

Submitted 10 January, 2013; originally announced January 2013.

Comments: Appears in Proceedings of the Seventeenth Conference on Uncertainty in Artificial Intelligence (UAI2001)

Report number: UAI-P-2001-PG-512-521

arXiv:1301.2300 [pdf]

Direct and Indirect Effects

Authors: Judea Pearl

Abstract: The direct effect of one eventon another can be defined and measured byholding constant all intermediate variables between the two.Indirect effects present conceptual andpractical difficulties (in nonlinear models), because they cannot be isolated by holding certain variablesconstant. This paper shows a way of defining any path-specific effectthat does not invoke blocking the remainingpaths.This p… ▽ More The direct effect of one eventon another can be defined and measured byholding constant all intermediate variables between the two.Indirect effects present conceptual andpractical difficulties (in nonlinear models), because they cannot be isolated by holding certain variablesconstant. This paper shows a way of defining any path-specific effectthat does not invoke blocking the remainingpaths.This permits the assessment of a more naturaltype of direct and indirect effects, one thatis applicable in both linear and nonlinear models. The paper establishesconditions under which such assessments can be estimated consistentlyfrom experimental and nonexperimental data,and thus extends path-analytic techniques tononlinear and nonparametric models. △ Less

Submitted 10 January, 2013; originally announced January 2013.

Comments: Appears in Proceedings of the Seventeenth Conference on Uncertainty in Artificial Intelligence (UAI2001)

Report number: UAI-P-2001-PG-411-420

Showing 1–50 of 67 results for author: Pearl, J