Skip to main content

Showing 1–50 of 65 results for author: Heckerman, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2412.02893  [pdf, other

    cs.CL cs.AI cs.LG stat.AP stat.ME

    Removing Spurious Correlation from Neural Network Interpretations

    Authors: Milad Fotouhi, Mohammad Taha Bahadori, Oluwaseyi Feyisetan, Payman Arabshahi, David Heckerman

    Abstract: The existing algorithms for identification of neurons responsible for undesired and harmful behaviors do not consider the effects of confounders such as topic of the conversation. In this work, we show that confounders can create spurious correlations and propose a new causal mediation approach that controls the impact of the topic. In experiments with two large language models, we study the local… ▽ More

    Submitted 3 December, 2024; originally announced December 2024.

  2. arXiv:2408.11852  [pdf, other

    cs.CL cs.AI cs.LG

    Fast Training Dataset Attribution via In-Context Learning

    Authors: Milad Fotouhi, Mohammad Taha Bahadori, Oluwaseyi Feyisetan, Payman Arabshahi, David Heckerman

    Abstract: We investigate the use of in-context learning and prompt engineering to estimate the contributions of training data in the outputs of instruction-tuned large language models (LLMs). We propose two novel approaches: (1) a similarity-based approach that measures the difference between LLM outputs with and without provided context, and (2) a mixture distribution model approach that frames the problem… ▽ More

    Submitted 18 March, 2025; v1 submitted 14 August, 2024; originally announced August 2024.

  3. arXiv:2404.08839  [pdf, other

    stat.ME cs.LG econ.EM stat.ML

    Multiply-Robust Causal Change Attribution

    Authors: Victor Quintas-Martinez, Mohammad Taha Bahadori, Eduardo Santiago, Jeff Mu, Dominik Janzing, David Heckerman

    Abstract: Comparing two samples of data, we observe a change in the distribution of an outcome variable. In the presence of multiple explanatory variables, how much of the change can be explained by each possible cause? We develop a new estimation strategy that, given a causal model, combines regression and re-weighting methods to quantify the contribution of each causal mechanism. Our proposed methodology… ▽ More

    Submitted 5 September, 2024; v1 submitted 12 April, 2024; originally announced April 2024.

    Journal ref: Proceedings of the 41st International Conference on Machine Learning, Vienna, Austria. PMLR 235, 2024

  4. arXiv:2302.05449  [pdf, other

    cs.AI cs.GL

    Heckerthoughts

    Authors: David Heckerman

    Abstract: This manuscript is technical memoir about my work at Stanford and Microsoft Research. Included are fundamental concepts central to machine learning and artificial intelligence, applications of these concepts, and stories behind their creation.

    Submitted 7 January, 2024; v1 submitted 13 February, 2023; originally announced February 2023.

    Comments: Fixed typos around Equation 1 (thank you Xinlong Du), and added a philosophical note at the end of Section 3.5 about the perception that consciousness is unitary

    MSC Class: 68T01 ACM Class: I.2.0

  5. arXiv:2107.13068  [pdf, other

    cs.LG stat.ME stat.ML

    End-to-End Balancing for Causal Continuous Treatment-Effect Estimation

    Authors: Mohammad Taha Bahadori, Eric Tchetgen Tchetgen, David E. Heckerman

    Abstract: We study the problem of observational causal inference with continuous treatments in the framework of inverse propensity-score weighting. To obtain stable weights, we design a new algorithm based on entropy balancing that learns weights to directly maximize causal inference accuracy using end-to-end optimization. In the process of optimization, these weights are automatically tuned to the specific… ▽ More

    Submitted 10 July, 2022; v1 submitted 27 July, 2021; originally announced July 2021.

    Comments: To be presented in ICML 2022

    MSC Class: 62D20 ACM Class: I.2.6

  6. arXiv:2105.06241  [pdf, ps, other

    cs.LG stat.ML

    Likelihoods and Parameter Priors for Bayesian Networks

    Authors: David Heckerman, Dan Geiger

    Abstract: We develop simple methods for constructing likelihoods and parameter priors for learning about the parameters and structure of a Bayesian network. In particular, we introduce several assumptions that permit the construction of likelihoods and parameter priors for a large number of Bayesian-network structures from a small set of assessments. The most notable assumption is that of likelihood equival… ▽ More

    Submitted 29 June, 2021; v1 submitted 13 May, 2021; originally announced May 2021.

    Comments: This version has improved pointers to the literature

    ACM Class: I.2; G.3

  7. arXiv:2105.03248  [pdf, ps, other

    stat.ML cs.LG math.ST

    Parameter Priors for Directed Acyclic Graphical Models and the Characterization of Several Probability Distributions

    Authors: Dan Geiger, David Heckerman

    Abstract: We develop simple methods for constructing parameter priors for model choice among Directed Acyclic Graphical (DAG) models. In particular, we introduce several assumptions that permit the construction of parameter priors for a large number of DAG models from a small set of assessments. We then present a method for directly computing the marginal likelihood of every DAG model given a random sample… ▽ More

    Submitted 29 June, 2021; v1 submitted 5 May, 2021; originally announced May 2021.

    Comments: This version has improved pointers to the literature. arXiv admin note: substantial text overlap with arXiv:1301.6697

    ACM Class: I.2; G.3

    Journal ref: The Annals of Statistics, 30: 1412-1440, 2002

  8. arXiv:2007.11500  [pdf, other

    cs.LG stat.ML

    Debiasing Concept-based Explanations with Causal Analysis

    Authors: Mohammad Taha Bahadori, David E. Heckerman

    Abstract: Concept-based explanation approach is a popular model interpertability tool because it expresses the reasons for a model's predictions in terms of concepts that are meaningful for the domain experts. In this work, we study the problem of the concepts being correlated with confounding information in the features. We propose a new causal prior graph for modeling the impacts of unobserved variables a… ▽ More

    Submitted 22 May, 2021; v1 submitted 22 July, 2020; originally announced July 2020.

    Comments: Accepted in ICLR 2021

  9. arXiv:2002.00269  [pdf, other

    cs.LG cs.AI stat.ML

    A Tutorial on Learning With Bayesian Networks

    Authors: David Heckerman

    Abstract: A Bayesian network is a graphical model that encodes probabilistic relationships among variables of interest. When used in conjunction with statistical techniques, the graphical model has several advantages for data analysis. One, because the model encodes dependencies among all variables, it readily handles situations where some data entries are missing. Two, a Bayesian network can be used to lea… ▽ More

    Submitted 10 January, 2022; v1 submitted 1 February, 2020; originally announced February 2020.

    Comments: Added a note on averaging causal models

    ACM Class: I.2; G.3

    Journal ref: Original version published in Learning in Graphical Models, M. Jordan, ed., MIT Press, Cambridge, MA, 1999

  10. arXiv:1911.06263  [pdf

    cs.AI math.OC

    Probabilistic Similarity Networks

    Authors: David Heckerman

    Abstract: Normative expert systems have not become commonplace because they have been difficult to build and use. Over the past decade, however, researchers have developed the influence diagram, a graphical representation of a decision maker's beliefs, alternatives, and preferences that serves as the knowledge base of a normative expert system. Most people who have seen the representation find it intuitive… ▽ More

    Submitted 6 November, 2019; originally announced November 2019.

    Report number: ISBN 0-262-01114-X MSC Class: 68T01 ACM Class: I.2.1; G.3

    Journal ref: Probabilistic Similarity Networks. MIT Press, Cambridge, MA, 1991

  11. arXiv:1910.09715  [pdf

    stat.ML cs.AI cs.LG

    Embedded Bayesian Network Classifiers

    Authors: David Heckerman, Chris Meek

    Abstract: Low-dimensional probability models for local distribution functions in a Bayesian network include decision trees, decision graphs, and causal independence models. We describe a new probability model for discrete Bayesian networks, which we call an embedded Bayesian network classifier or EBNC. The model for a node $Y$ given parents $\bf X$ is obtained from a (usually different) Bayesian network for… ▽ More

    Submitted 21 October, 2019; originally announced October 2019.

    Report number: Microsoft Research Technical Report MS-TR-97-06, March 1997 MSC Class: 68T99

  12. arXiv:1801.00727  [pdf, other

    cs.AI stat.AP

    Accounting for hidden common causes when inferring cause and effect from observational data

    Authors: David Heckerman

    Abstract: Identifying causal relationships from observation data is difficult, in large part, due to the presence of hidden common causes. In some cases, where just the right patterns of conditional independence and dependence lie in the data---for example, Y-structures---it is possible to identify cause and effect. In other cases, the analyst deliberately makes an uncertain assumption that hidden common ca… ▽ More

    Submitted 3 January, 2018; v1 submitted 2 January, 2018; originally announced January 2018.

    Comments: Presented at the NIPS workshop on causal inference (NIPS 2017), Long Beach, CA, USA

  13. arXiv:1611.02126  [pdf, ps, other

    cs.AI math.CO math.PR

    Dependence and Relevance: A probabilistic view

    Authors: Dan Geiger, David Heckerman

    Abstract: We examine three probabilistic concepts related to the sentence "two variables have no bearing on each other". We explore the relationships between these three concepts and establish their relevance to the process of constructing similarity networks---a tool for acquiring probabilistic knowledge from human experts. We also establish a precise relationship between connectedness in Bayesian networks… ▽ More

    Submitted 27 October, 2016; originally announced November 2016.

  14. arXiv:1407.7281  [pdf

    cs.AI

    Modular Belief Updates and Confusion about Measures of Certainty in Artificial Intelligence Research

    Authors: Eric J. Horvitz, David Heckerman

    Abstract: Over the last decade, there has been growing interest in the use or measures or change in belief for reasoning with uncertainty in artificial intelligence research. An important characteristic of several methodologies that reason with changes in belief or belief updates, is a property that we term modularity. We call updates that satisfy this property modular updates. Whereas probabilistic measure… ▽ More

    Submitted 27 July, 2014; originally announced July 2014.

    Comments: Appears in Proceedings of the First Conference on Uncertainty in Artificial Intelligence (UAI1985)

    Report number: UAI-P-1985-PG-283-286

  15. arXiv:1304.3851   

    cs.AI

    Proceedings of the Ninth Conference on Uncertainty in Artificial Intelligence (1993)

    Authors: David Heckerman, E. Mamdani

    Abstract: This is the Proceedings of the Ninth Conference on Uncertainty in Artificial Intelligence, which was held in Washington, DC, July 9-11, 1993

    Submitted 13 April, 2013; originally announced April 2013.

    Report number: UAI1993

  16. arXiv:1304.3419  [pdf

    cs.AI

    Probabilistic Interpretations for MYCIN's Certainty Factors

    Authors: David Heckerman

    Abstract: This paper examines the quantities used by MYCIN to reason with uncertainty, called certainty factors. It is shown that the original definition of certainty factors is inconsistent with the functions used in MYCIN to combine the quantities. This inconsistency is used to argue for a redefinition of certainty factors in terms of the intuitively appealing desiderata associated with the combining func… ▽ More

    Submitted 27 March, 2013; originally announced April 2013.

    Comments: Appears in Proceedings of the First Conference on Uncertainty in Artificial Intelligence (UAI1985)

    Report number: UAI-P-1985-PG-9-20

  17. arXiv:1304.3107  [pdf

    cs.AI

    A Backwards View for Assessment

    Authors: Ross D. Shachter, David Heckerman

    Abstract: Much artificial intelligence research focuses on the problem of deducing the validity of unobservable propositions or hypotheses from observable evidence.! Many of the knowledge representation techniques designed for this problem encode the relationship between evidence and hypothesis in a directed manner. Moreover, the direction in which evidence is stored is typically from evidence to hypothesis… ▽ More

    Submitted 27 March, 2013; originally announced April 2013.

    Comments: Appears in Proceedings of the Second Conference on Uncertainty in Artificial Intelligence (UAI1986)

    Report number: UAI-P-1986-PG-237-242

  18. arXiv:1304.3091  [pdf

    cs.AI

    An Axiomatic Framework for Belief Updates

    Authors: David Heckerman

    Abstract: In the 1940's, a physicist named Cox provided the first formal justification for the axioms of probability based on the subjective or Bayesian interpretation. He showed that if a measure of belief satisfies several fundamental properties, then the measure must be some monotonic transformation of a probability. In this paper, measures of change in belief or belief updates are examined. In the spiri… ▽ More

    Submitted 27 March, 2013; originally announced April 2013.

    Comments: Appears in Proceedings of the Second Conference on Uncertainty in Artificial Intelligence (UAI1986)

    Report number: UAI-P-1986-PG-123-128

  19. arXiv:1304.3090  [pdf

    cs.AI

    The Myth of Modularity in Rule-Based Systems

    Authors: David Heckerman, Eric J. Horvitz

    Abstract: In this paper, we examine the concept of modularity, an often cited advantage of the ruled-based representation methodology. We argue that the notion of modularity consists of two distinct concepts which we call syntactic modularity and semantic modularity. We argue that when reasoning under certainty, it is reasonable to regard the rule-based approach as both syntactically and semantically modula… ▽ More

    Submitted 27 March, 2013; originally announced April 2013.

    Comments: Appears in Proceedings of the Second Conference on Uncertainty in Artificial Intelligence (UAI1986)

    Report number: UAI-P-1986-PG-115-122

  20. arXiv:1304.2747  [pdf

    cs.AI

    The Role of Calculi in Uncertain Inference Systems

    Authors: Michael P. Wellman, David Heckerman

    Abstract: Much of the controversy about methods for automated decision making has focused on specific calculi for combining beliefs or propagating uncertainty. We broaden the debate by (1) exploring the constellation of secondary tasks surrounding any primary decision problem, and (2) identifying knowledge engineering concerns that present additional representational tradeoffs. We argue on pragmatic grounds… ▽ More

    Submitted 27 March, 2013; originally announced April 2013.

    Comments: Appears in Proceedings of the Third Conference on Uncertainty in Artificial Intelligence (UAI1987)

    Report number: UAI-P-1987-PG-321-331

  21. arXiv:1304.2724  [pdf

    cs.AI

    A Perspective on Confidence and Its Use in Focusing Attention During Knowledge Acquisition

    Authors: David Heckerman, Holly B. Jimison

    Abstract: We present a representation of partial confidence in belief and preference that is consistent with the tenets of decision-theory. The fundamental insight underlying the representation is that if a person is not completely confident in a probability or utility assessment, additional modeling of the assessment may improve decisions to which it is relevant. We show how a traditional decision-analytic… ▽ More

    Submitted 27 March, 2013; originally announced April 2013.

    Comments: Appears in Proceedings of the Third Conference on Uncertainty in Artificial Intelligence (UAI1987)

    Report number: UAI-P-1987-PG-123-131

  22. arXiv:1304.2357  [pdf, other

    cs.AI

    An Empirical Comparison of Three Inference Methods

    Authors: David Heckerman

    Abstract: In this paper, an empirical evaluation of three inference methods for uncertain reasoning is presented in the context of Pathfinder, a large expert system for the diagnosis of lymph-node pathology. The inference procedures evaluated are (1) Bayes' theorem, assuming evidence is conditionally independent given each hypothesis; (2) odds-likelihood updating, assuming evidence is conditionally independ… ▽ More

    Submitted 24 January, 2023; v1 submitted 27 March, 2013; originally announced April 2013.

    Comments: Appears in Proceedings of the Fourth Conference on Uncertainty in Artificial Intelligence (UAI1988). LaTex errors corrected in this version

    Report number: UAI-P-1988-PG-158-169 MSC Class: 68T37 ACM Class: I.2.1

  23. arXiv:1304.1511  [pdf, other

    cs.AI

    A Tractable Inference Algorithm for Diagnosing Multiple Diseases

    Authors: David Heckerman

    Abstract: We examine a probabilistic model for the diagnosis of multiple diseases. In the model, diseases and findings are represented as binary variables. Also, diseases are marginally independent, features are conditionally independent given disease instances, and diseases interact to produce findings via a noisy OR-gate. An algorithm for computing the posterior probability of each disease, given a set of… ▽ More

    Submitted 5 December, 2022; v1 submitted 27 March, 2013; originally announced April 2013.

    Comments: Appears in Proceedings of the Fifth Conference on Uncertainty in Artificial Intelligence (UAI1989)

    Report number: UAI-P-1989-PG-174-181

  24. arXiv:1304.1510  [pdf

    cs.AI

    The Compilation of Decision Models

    Authors: David Heckerman, John S. Breese, Eric J. Horvitz

    Abstract: We introduce and analyze the problem of the compilation of decision models from a decision-theoretic perspective. The techniques described allow us to evaluate various configurations of compiled knowledge given the nature of evidential relationships in a domain, the utilities associated with alternative actions, the costs of run-time delays, and the costs of memory. We describe procedures for sele… ▽ More

    Submitted 27 March, 2013; originally announced April 2013.

    Comments: Appears in Proceedings of the Fifth Conference on Uncertainty in Artificial Intelligence (UAI1989)

    Report number: UAI-P-1989-PG-162-173

  25. arXiv:1304.1145  [pdf

    cs.AI

    Separable and transitive graphoids

    Authors: Dan Geiger, David Heckerman

    Abstract: We examine three probabilistic formulations of the sentence a and b are totally unrelated with respect to a given set of variables U. First, two variables a and b are totally independent if they are independent given any value of any subset of the variables in U. Second, two variables are totally uncoupled if U can be partitioned into two marginally independent sets containing a and b respectively… ▽ More

    Submitted 16 May, 2015; v1 submitted 27 March, 2013; originally announced April 2013.

    Comments: Appears in Proceedings of the Sixth Conference on Uncertainty in Artificial Intelligence (UAI1990)

    Report number: UAI-P-1990-PG-538-545

  26. arXiv:1304.1114  [pdf

    cs.AI

    A Combination of Cutset Conditioning with Clique-Tree Propagation in the Pathfinder System

    Authors: Jaap Suermondt, Gregory F. Cooper, David Heckerman

    Abstract: Cutset conditioning and clique-tree propagation are two popular methods for performing exact probabilistic inference in Bayesian belief networks. Cutset conditioning is based on decomposition of a subset of network nodes, whereas clique-tree propagation depends on aggregation of nodes. We describe a means to combine cutset conditioning and clique- tree propagation in an approach called aggregati… ▽ More

    Submitted 27 March, 2013; originally announced April 2013.

    Comments: Appears in Proceedings of the Sixth Conference on Uncertainty in Artificial Intelligence (UAI1990)

    Report number: UAI-P-1990-PG-273-280

  27. arXiv:1304.1091  [pdf

    cs.AI

    Problem Formulation as the Reduction of a Decision Model

    Authors: David Heckerman, Eric J. Horvitz

    Abstract: In this paper, we extend the QMRDT probabilistic model for the domain of internal medicine to include decisions about treatments. In addition, we describe how we can use the comprehensive decision model to construct a simpler decision model for a specific patient. In so doing, we transform the task of problem formulation to that of narrowing of a larger problem.

    Submitted 16 May, 2015; v1 submitted 27 March, 2013; originally announced April 2013.

    Comments: Appears in Proceedings of the Sixth Conference on Uncertainty in Artificial Intelligence (UAI1990)

    Report number: UAI-P-1990-PG-82-89

  28. arXiv:1304.1085  [pdf

    cs.AI

    Similarity Networks for the Construction of Multiple-Faults Belief Networks

    Authors: David Heckerman

    Abstract: A similarity network is a tool for constructing belief networks for the diagnosis of a single fault. In this paper, we examine modifications to the similarity-network representation that facilitate the construction of belief networks for the diagnosis of multiple coexisting faults.

    Submitted 16 May, 2015; v1 submitted 27 March, 2013; originally announced April 2013.

    Comments: Appears in Proceedings of the Sixth Conference on Uncertainty in Artificial Intelligence (UAI1990)

    Report number: UAI-P-1990-PG-32-39

  29. arXiv:1303.5720  [pdf

    cs.AI

    An Approximate Nonmyopic Computation for Value of Information

    Authors: David Heckerman, Eric J. Horvitz, Blackford Middleton

    Abstract: Value-of-information analyses provide a straightforward means for selecting the best next observation to make, and for determining whether it is better to gather additional information or to act immediately. Determining the next best test to perform, given a state of uncertainty about the world, requires a consideration of the value of making all possible sequences of observations. In practice, de… ▽ More

    Submitted 16 May, 2015; v1 submitted 20 March, 2013; originally announced March 2013.

    Comments: Appears in Proceedings of the Seventh Conference on Uncertainty in Artificial Intelligence (UAI1991)

    Report number: UAI-P-1991-PG-135-141

  30. arXiv:1303.5718  [pdf

    cs.AI

    Advances in Probabilistic Reasoning

    Authors: Dan Geiger, David Heckerman

    Abstract: This paper discuses multiple Bayesian networks representation paradigms for encoding asymmetric independence assertions. We offer three contributions: (1) an inference mechanism that makes explicit use of asymmetric independence to speed up computations, (2) a simplified definition of similarity networks and extensions of their theory, and (3) a generalized representation scheme that encodes more… ▽ More

    Submitted 16 May, 2015; v1 submitted 20 March, 2013; originally announced March 2013.

    Comments: Appears in Proceedings of the Seventh Conference on Uncertainty in Artificial Intelligence (UAI1991)

    Report number: UAI-P-1991-PG-118-126

  31. arXiv:1303.1493  [pdf

    cs.AI

    Inference Algorithms for Similarity Networks

    Authors: Dan Geiger, David Heckerman

    Abstract: We examine two types of similarity networks each based on a distinct notion of relevance. For both types of similarity networks we present an efficient inference algorithm that works under the assumption that every event has a nonzero probability of occurrence. Another inference algorithm is developed for type 1 similarity networks that works under no restriction, albeit less efficiently.

    Submitted 16 May, 2015; v1 submitted 6 March, 2013; originally announced March 2013.

    Comments: Appears in Proceedings of the Ninth Conference on Uncertainty in Artificial Intelligence (UAI1993)

    Report number: UAI-P-1993-PG-326-334

  32. arXiv:1303.1468  [pdf

    cs.AI

    Causal Independence for Knowledge Acquisition and Inference

    Authors: David Heckerman

    Abstract: I introduce a temporal belief-network representation of causal independence that a knowledge engineer can use to elicit probabilistic models. Like the current, atemporal belief-network representation of causal independence, the new representation makes knowledge acquisition tractable. Unlike the atemproal representation, however, the temporal representation can simplify inference, and does not req… ▽ More

    Submitted 16 May, 2015; v1 submitted 6 March, 2013; originally announced March 2013.

    Comments: Appears in Proceedings of the Ninth Conference on Uncertainty in Artificial Intelligence (UAI1993)

    Report number: UAI-P-1993-PG-122-127

  33. arXiv:1303.1463  [pdf

    cs.AI

    Diagnosis of Multiple Faults: A Sensitivity Analysis

    Authors: David Heckerman, Michael Shwe

    Abstract: We compare the diagnostic accuracy of three diagnostic inference models: the simple Bayes model, the multimembership Bayes model, which is isomorphic to the parallel combination function in the certainty-factor model, and a model that incorporates the noisy OR-gate interaction. The comparison is done on 20 clinicopathological conference (CPC) cases from the American Journal of Medicine-challenging… ▽ More

    Submitted 16 May, 2015; v1 submitted 6 March, 2013; originally announced March 2013.

    Comments: Appears in Proceedings of the Ninth Conference on Uncertainty in Artificial Intelligence (UAI1993)

    Report number: UAI-P-1993-PG-80-87

  34. arXiv:1302.6816  [pdf

    cs.AI

    A Decision-Based View of Causality

    Authors: David Heckerman, Ross D. Shachter

    Abstract: Most traditional models of uncertainty have focused on the associational relationship among variables as captured by conditional dependence. In order to successfully manage intelligent systems for decision making, however, we must be able to predict the effects of actions. In this paper, we attempt to unite two branches of research that address such predictions: causal modeling and decision analys… ▽ More

    Submitted 16 May, 2015; v1 submitted 27 February, 2013; originally announced February 2013.

    Comments: Appears in Proceedings of the Tenth Conference on Uncertainty in Artificial Intelligence (UAI1994)

    Report number: UAI-P-1994-PG-302-310

  35. arXiv:1302.6815  [pdf

    cs.AI

    Learning Bayesian Networks: The Combination of Knowledge and Statistical Data

    Authors: David Heckerman, Dan Geiger, David Maxwell Chickering

    Abstract: We describe algorithms for learning Bayesian networks from a combination of user knowledge and statistical data. The algorithms have two components: a scoring metric and a search procedure. The scoring metric takes a network structure, statistical data, and a user's prior knowledge, and returns a score proportional to the posterior probability of the network structure given the data. The search pr… ▽ More

    Submitted 16 May, 2015; v1 submitted 27 February, 2013; originally announced February 2013.

    Comments: Appears in Proceedings of the Tenth Conference on Uncertainty in Artificial Intelligence (UAI1994)

    Report number: UAI-P-1994-PG-293-301

  36. arXiv:1302.6814  [pdf

    cs.AI

    A New Look at Causal Independence

    Authors: David Heckerman, John S. Breese

    Abstract: Heckerman (1993) defined causal independence in terms of a set of temporal conditional independence statements. These statements formalized certain types of causal interaction where (1) the effect is independent of the order that causes are introduced and (2) the impact of a single cause on the effect does not depend on what other causes have previously been applied. In this paper, we introduce an… ▽ More

    Submitted 16 May, 2015; v1 submitted 27 February, 2013; originally announced February 2013.

    Comments: Appears in Proceedings of the Tenth Conference on Uncertainty in Artificial Intelligence (UAI1994)

    Report number: UAI-P-1994-PG-286-292

  37. arXiv:1302.6808  [pdf, other

    cs.AI cs.LG stat.ML

    Learning Gaussian Networks

    Authors: Dan Geiger, David Heckerman

    Abstract: We describe algorithms for learning Bayesian networks from a combination of user knowledge and statistical data. The algorithms have two components: a scoring metric and a search procedure. The scoring metric takes a network structure, statistical data, and a user's prior knowledge, and returns a score proportional to the posterior probability of the network structure given the data. The search pr… ▽ More

    Submitted 27 June, 2021; v1 submitted 27 February, 2013; originally announced February 2013.

    Comments: This version has improved pointers to the literature

    Report number: UAI-P-1994-PG-235-243 ACM Class: I.2; G.3

  38. arXiv:1302.4958  [pdf

    cs.AI

    A Bayesian Approach to Learning Causal Networks

    Authors: David Heckerman

    Abstract: Whereas acausal Bayesian networks represent probabilistic independence, causal Bayesian networks represent causal relationships. In this paper, we examine Bayesian methods for learning both types of networks. Bayesian methods for learning acausal networks are fairly well developed. These methods often employ assumptions to facilitate the construction of priors, including the assumptions of paramet… ▽ More

    Submitted 16 May, 2015; v1 submitted 20 February, 2013; originally announced February 2013.

    Comments: Appears in Proceedings of the Eleventh Conference on Uncertainty in Artificial Intelligence (UAI1995)

    Report number: UAI-P-1995-PG-285-295

  39. arXiv:1302.4957  [pdf, other

    cs.AI

    Learning Bayesian Networks: A Unification for Discrete and Gaussian Domains

    Authors: David Heckerman, Dan Geiger

    Abstract: We examine Bayesian methods for learning Bayesian networks from a combination of prior knowledge and statistical data. In particular, we unify the approaches we presented at last year's conference for discrete and Gaussian domains. We derive a general Bayesian scoring metric, appropriate for both domains. We then use this metric in combination with well-known statistical facts about the Dirichlet… ▽ More

    Submitted 29 June, 2021; v1 submitted 20 February, 2013; originally announced February 2013.

    Comments: This version has improved pointers to the literature

    Report number: UAI-P-1995-PG-274-284 ACM Class: I.2; G.3

  40. arXiv:1302.4956  [pdf

    cs.AI

    A Definition and Graphical Representation for Causality

    Authors: David Heckerman, Ross D. Shachter

    Abstract: We present a precise definition of cause and effect in terms of a fundamental notion called unresponsiveness. Our definition is based on Savage's (1954) formulation of decision theory and departs from the traditional view of causation in that our causal assertions are made relative to a set of decisions. An important consequence of this departure is that we can reason about cause locally, not requ… ▽ More

    Submitted 16 May, 2015; v1 submitted 20 February, 2013; originally announced February 2013.

    Comments: Appears in Proceedings of the Eleventh Conference on Uncertainty in Artificial Intelligence (UAI1995)

    Report number: UAI-P-1995-PG-262-273

  41. arXiv:1302.4949  [pdf

    cs.AI cs.LG

    A Characterization of the Dirichlet Distribution with Application to Learning Bayesian Networks

    Authors: Dan Geiger, David Heckerman

    Abstract: We provide a new characterization of the Dirichlet distribution. This characterization implies that under assumptions made by several previous authors for learning belief networks, a Dirichlet prior on the parameters is inevitable.

    Submitted 20 February, 2013; originally announced February 2013.

    Comments: Appears in Proceedings of the Eleventh Conference on Uncertainty in Artificial Intelligence (UAI1995)

    Report number: UAI-P-1995-PG-196-207

  42. arXiv:1302.3580  [pdf

    cs.LG cs.AI stat.ML

    Asymptotic Model Selection for Directed Networks with Hidden Variables

    Authors: Dan Geiger, David Heckerman, Christopher Meek

    Abstract: We extend the Bayesian Information Criterion (BIC), an asymptotic approximation for the marginal likelihood, to Bayesian networks with hidden variables. This approximation can be used to select models given large samples of data. The standard BIC as well as our extension punishes the complexity of a model according to the dimension of its parameters. We argue that the dimension of a Bayesian netwo… ▽ More

    Submitted 16 May, 2015; v1 submitted 13 February, 2013; originally announced February 2013.

    Comments: Appears in Proceedings of the Twelfth Conference on Uncertainty in Artificial Intelligence (UAI1996)

    Report number: UAI-P-1996-PG-283-290

  43. arXiv:1302.3567  [pdf

    cs.LG cs.AI stat.ML

    Efficient Approximations for the Marginal Likelihood of Incomplete Data Given a Bayesian Network

    Authors: David Maxwell Chickering, David Heckerman

    Abstract: We discuss Bayesian methods for learning Bayesian networks when data sets are incomplete. In particular, we examine asymptotic approximations for the marginal likelihood of incomplete data given a Bayesian network. We consider the Laplace approximation and the less accurate but more efficient BIC/MDL approximation. We also consider approximations proposed by Draper (1993) and Cheeseman and Stutz (… ▽ More

    Submitted 16 May, 2015; v1 submitted 13 February, 2013; originally announced February 2013.

    Comments: Appears in Proceedings of the Twelfth Conference on Uncertainty in Artificial Intelligence (UAI1996)

    Report number: UAI-P-1996-PG-158-168

  44. arXiv:1302.3563  [pdf

    cs.AI

    Decision-Theoretic Troubleshooting: A Framework for Repair and Experiment

    Authors: John S. Breese, David Heckerman

    Abstract: We develop and extend existing decision-theoretic methods for troubleshooting a nonfunctioning device. Traditionally, diagnosis with Bayesian networks has focused on belief updating---determining the probabilities of various faults given current observations. In this paper, we extend this paradigm to include taking actions. In particular, we consider three classes of actions: (1) we can make obser… ▽ More

    Submitted 17 May, 2015; v1 submitted 13 February, 2013; originally announced February 2013.

    Comments: Appears in Proceedings of the Twelfth Conference on Uncertainty in Artificial Intelligence (UAI1996)

    Report number: UAI-P-1996-PG-124-132

  45. arXiv:1302.1561  [pdf

    cs.AI cs.LG

    Structure and Parameter Learning for Causal Independence and Causal Interaction Models

    Authors: Christopher Meek, David Heckerman

    Abstract: This paper discusses causal independence models and a generalization of these models called causal interaction models. Causal interaction models are models that have independent mechanisms where a mechanism can have several causes. In addition to introducing several particular types of causal interaction models, we show how we can apply the Bayesian approach to learning causal interaction models o… ▽ More

    Submitted 16 May, 2015; v1 submitted 6 February, 2013; originally announced February 2013.

    Comments: Appears in Proceedings of the Thirteenth Conference on Uncertainty in Artificial Intelligence (UAI1997)

    Report number: UAI-P-1997-PG-366-375

  46. arXiv:1302.1545  [pdf

    cs.LG stat.ML

    Models and Selection Criteria for Regression and Classification

    Authors: David Heckerman, Christopher Meek

    Abstract: When performing regression or classification, we are interested in the conditional probability distribution for an outcome or class variable Y given a set of explanatoryor input variables X. We consider Bayesian models for this task. In particular, we examine a special class of models, which we call Bayesian regression/classification (BRC) models, that can be factored into independent conditiona… ▽ More

    Submitted 6 February, 2013; originally announced February 2013.

    Comments: Appears in Proceedings of the Thirteenth Conference on Uncertainty in Artificial Intelligence (UAI1997)

    Report number: UAI-P-1997-PG-223-228

  47. arXiv:1302.1528  [pdf

    cs.LG cs.AI stat.ML

    A Bayesian Approach to Learning Bayesian Networks with Local Structure

    Authors: David Maxwell Chickering, David Heckerman, Christopher Meek

    Abstract: Recently several researchers have investigated techniques for using data to learn Bayesian networks containing compact representations for the conditional probability distributions (CPDs) stored at each node. The majority of this work has concentrated on using decision-tree representations for the CPDs. In addition, researchers typically apply non-Bayesian (or asymptotically Bayesian) scoring func… ▽ More

    Submitted 16 May, 2015; v1 submitted 6 February, 2013; originally announced February 2013.

    Comments: Appears in Proceedings of the Thirteenth Conference on Uncertainty in Artificial Intelligence (UAI1997)

    Report number: UAI-P-1997-PG-80-89

  48. arXiv:1301.7415  [pdf

    cs.LG cs.AI stat.ML

    Learning Mixtures of DAG Models

    Authors: Bo Thiesson, Christopher Meek, David Maxwell Chickering, David Heckerman

    Abstract: We describe computationally efficient methods for learning mixtures in which each component is a directed acyclic graphical model (mixtures of DAGs or MDAGs). We argue that simple search-and-score algorithms are infeasible for a variety of problems, and introduce a feasible approach in which parameter and structure search is interleaved and expected data is treated as real data. Our approach can b… ▽ More

    Submitted 16 May, 2015; v1 submitted 30 January, 2013; originally announced January 2013.

    Comments: Appears in Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence (UAI1998)

    Report number: UAI-P-1998-PG-504-513

  49. arXiv:1301.7401  [pdf

    cs.LG stat.ML

    An Experimental Comparison of Several Clustering and Initialization Methods

    Authors: Marina Meila, David Heckerman

    Abstract: We examine methods for clustering in high dimensions. In the first part of the paper, we perform an experimental comparison between three batch clustering algorithms: the Expectation-Maximization (EM) algorithm, a winner take all version of the EM algorithm reminiscent of the K-means algorithm, and model-based hierarchical agglomerative clustering. We learn naive-Bayes models with a hidden root no… ▽ More

    Submitted 16 May, 2015; v1 submitted 30 January, 2013; originally announced January 2013.

    Comments: Appears in Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence (UAI1998)

    Report number: UAI-P-1998-PG-386-395

  50. arXiv:1301.7385  [pdf

    cs.AI cs.HC

    The Lumiere Project: Bayesian User Modeling for Inferring the Goals and Needs of Software Users

    Authors: Eric J. Horvitz, John S. Breese, David Heckerman, David Hovel, Koos Rommelse

    Abstract: The Lumiere Project centers on harnessing probability and utility to provide assistance to computer software users. We review work on Bayesian user models that can be employed to infer a users needs by considering a user's background, actions, and queries. Several problems were tackled in Lumiere research, including (1) the construction of Bayesian models for reasoning about the time-varying goal… ▽ More

    Submitted 30 January, 2013; originally announced January 2013.

    Comments: Appears in Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence (UAI1998)

    Report number: UAI-P-1998-PG-256-265