Skip to main content

Showing 1–18 of 18 results for author: Draper, D

Searching in archive stat. Search in all archives.
.
  1. arXiv:2409.04729  [pdf, other

    stat.ME cond-mat.stat-mech physics.comp-ph stat.CO

    A Unified Framework for Cluster Methods with Tensor Networks

    Authors: Erdong Guo, David Draper

    Abstract: Markov Chain Monte Carlo (MCMC), and Tensor Networks (TN) are two powerful frameworks for numerically investigating many-body systems, each offering distinct advantages. MCMC, with its flexibility and theoretical consistency, is well-suited for simulating arbitrary systems by sampling. TN, on the other hand, provides a powerful tensor-based language for capturing the entanglement properties intrin… ▽ More

    Submitted 7 September, 2024; originally announced September 2024.

    Comments: Dedicated to the memory of Prof. David Draper

  2. arXiv:2302.07779  [pdf, ps, other

    stat.ME

    Discussion of Martingale Posterior Distributions by E. Fong, C. Holmes, and S. G. Walker

    Authors: David Draper, Erdong Guo

    Abstract: In this discussion note, we respond to the fascinating paper "Martingale Posterior Distributions" by E. Fong, C. Holmes, and S. G. Walker with a couple of comments. On the basis of previous research, a theorem is stated regarding the relationship between frequentist bootstrap and stick-breaking process.

    Submitted 14 January, 2023; originally announced February 2023.

    Comments: 2 pages. A written contribution to the RSS discussion meeting for JRSS-B

  3. arXiv:2212.13621  [pdf, other

    stat.ML cs.AI cs.CV cs.LG

    Annealing Double-Head: An Architecture for Online Calibration of Deep Neural Networks

    Authors: Erdong Guo, David Draper, Maria De Iorio

    Abstract: Model calibration, which is concerned with how frequently the model predicts correctly, not only plays a vital part in statistical model design, but also has substantial practical applications, such as optimal decision-making in the real world. However, it has been discovered that modern deep neural networks are generally poorly calibrated due to the overestimation (or underestimation) of predicti… ▽ More

    Submitted 15 January, 2023; v1 submitted 27 December, 2022; originally announced December 2022.

    Comments: Revised Preprint. 19 pages, 10 figures, 4 tables. Typos fixed, and references added

  4. arXiv:2111.14046  [pdf, other

    stat.ML cs.LG cs.NE quant-ph

    Neural Tangent Kernel of Matrix Product States: Convergence and Applications

    Authors: Erdong Guo, David Draper

    Abstract: In this work, we study the Neural Tangent Kernel (NTK) of Matrix Product States (MPS) and the convergence of its NTK in the infinite bond dimensional limit. We prove that the NTK of MPS asymptotically converges to a constant matrix during the gradient descent (training) process (and also the initialization phase) as the bond dimensions of MPS go to infinity by the observation that the variation of… ▽ More

    Submitted 27 November, 2021; originally announced November 2021.

    Comments: 19 pages, 1 figure

  5. arXiv:2111.14040  [pdf, other

    math.PR math.ST stat.AP stat.ME

    A Simple Necessary Condition For Independence of Real-Valued Random Variables

    Authors: David Draper, Erdong Guo, Robert Lund, Jon Woody

    Abstract: The standard method to check for the independence of two real-valued random variables -- demonstrating that the bivariate joint distribution factors into the product of its marginals -- is both necessary and sufficient. Here we present a simple necessary condition based on the support sets of the random variables, which -- if not satisfied -- avoids the need to extract the marginals from the joint… ▽ More

    Submitted 27 November, 2021; originally announced November 2021.

    Comments: 25 pages, 5 figures

  6. arXiv:2111.12267  [pdf, other

    stat.OT math.ST stat.AP stat.ME

    The Practical Scope of the Central Limit Theorem

    Authors: David Draper, Erdong Guo

    Abstract: The \textit{Central Limit Theorem (CLT)} is at the heart of a great deal of applied problem-solving in statistics and data science, but the theorem is silent on an important implementation issue: \textit{how much data do you need for the CLT to give accurate answers to practical questions?} Here we examine several approaches to addressing this issue -- along the way reviewing the history of this p… ▽ More

    Submitted 23 November, 2021; originally announced November 2021.

    Comments: 47 pages, 17 figures

  7. arXiv:2103.08277  [pdf, ps, other

    stat.ML cs.LG cs.NE quant-ph

    Representation Theorem for Matrix Product States

    Authors: Erdong Guo, David Draper

    Abstract: In this work, we investigate the universal representation capacity of the Matrix Product States (MPS) from the perspective of boolean functions and continuous functions. We show that MPS can accurately realize arbitrary boolean functions by providing a construction method of the corresponding MPS structure for an arbitrarily given boolean gate. Moreover, we prove that the function space of MPS wit… ▽ More

    Submitted 15 March, 2021; originally announced March 2021.

    Comments: 19 pages

  8. arXiv:2101.02333  [pdf, ps, other

    stat.ML cs.LG cs.NE

    Infinitely Wide Tensor Networks as Gaussian Process

    Authors: Erdong Guo, David Draper

    Abstract: Gaussian Process is a non-parametric prior which can be understood as a distribution on the function space intuitively. It is known that by introducing appropriate prior to the weights of the neural networks, Gaussian Process can be obtained by taking the infinite-width limit of the Bayesian neural networks from a Bayesian perspective. In this paper, we explore the infinitely wide Tensor Networks… ▽ More

    Submitted 6 January, 2021; originally announced January 2021.

    Comments: 20 pages, 4 figures

  9. arXiv:2101.00245  [pdf, other

    stat.ML cs.CV cs.LG cs.NE

    The Bayesian Method of Tensor Networks

    Authors: Erdong Guo, David Draper

    Abstract: Bayesian learning is a powerful learning framework which combines the external information of the data (background information) with the internal information (training data) in a logically consistent way in inference and prediction. By Bayes rule, the external information (prior distribution) and the internal information (training data likelihood) are combined coherently, and the posterior distrib… ▽ More

    Submitted 1 January, 2021; originally announced January 2021.

    Comments: 13 pages, 4 figures

  10. arXiv:1712.00849  [pdf, other

    stat.CO

    Comment: A brief survey of the current state of play for Bayesian computation in data science at Big-Data scale

    Authors: David Draper, Alexander Terenin

    Abstract: We wish to contribute to the discussion of "Comparing Consensus Monte Carlo Strategies for Distributed Bayesian Computation" by offering our views on the current best methods for Bayesian computation, both at big-data scale and with smaller data sets, as summarized in Table 1. This table is certainly an over-simplification of a highly complicated area of research in constant (present and likely fu… ▽ More

    Submitted 14 December, 2017; v1 submitted 3 December, 2017; originally announced December 2017.

    Journal ref: Brazilian Journal of Probability and Statistics 31(4):686-691, 2017

  11. Pólya Urn Latent Dirichlet Allocation: a doubly sparse massively parallel sampler

    Authors: Alexander Terenin, Måns Magnusson, Leif Jonsson, David Draper

    Abstract: Latent Dirichlet Allocation (LDA) is a topic model widely used in natural language processing and machine learning. Most approaches to training the model rely on iterative algorithms, which makes it difficult to run LDA on big corpora that are best analyzed in parallel and distributed computational environments. Indeed, current approaches to parallel inference either don't converge to the correct… ▽ More

    Submitted 22 October, 2020; v1 submitted 11 April, 2017; originally announced April 2017.

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence 41(7):1709-1719, 2019

  12. GPU-accelerated Gibbs sampling: a case study of the Horseshoe Probit model

    Authors: Alexander Terenin, Shawfeng Dong, David Draper

    Abstract: Gibbs sampling is a widely used Markov chain Monte Carlo (MCMC) method for numerically approximating integrals of interest in Bayesian statistics and other mathematical sciences. Many implementations of MCMC methods do not extend easily to parallel computing environments, as their inherently sequential nature incurs a large synchronization cost. In the case study illustrated by this paper, we show… ▽ More

    Submitted 21 March, 2018; v1 submitted 15 August, 2016; originally announced August 2016.

    Journal ref: Statistics and Computing 29(2):301-310, 2019

  13. arXiv:1509.08999  [pdf, other

    stat.CO

    Asynchronous Gibbs Sampling

    Authors: Alexander Terenin, Daniel Simpson, David Draper

    Abstract: Gibbs sampling is a Markov Chain Monte Carlo (MCMC) method often used in Bayesian learning. MCMC methods can be difficult to deploy on parallel and distributed systems due to their inherently sequential nature. We study asynchronous Gibbs sampling, which achieves parallelism by simply ignoring sequential requirements. This method has been shown to produce good empirical results for some hierarchic… ▽ More

    Submitted 29 February, 2020; v1 submitted 29 September, 2015; originally announced September 2015.

    Journal ref: Artificial Intelligence and Statistics, 2020

  14. arXiv:1509.03940  [pdf, other

    stat.AP

    Causal Inference in Repeated Observational Studies: A Case Study of eBay Product Releases

    Authors: Vadim von Brzeski, Matt Taddy, David Draper

    Abstract: Causal inference in observational studies is notoriously difficult, due to the fact that the experimenter is not in charge of the treatment assignment mechanism. Many potential con- founding factors (PCFs) exist in such a scenario, and if one seeks to estimate the causal effect of the treatment on a response, one needs to control for such factors. Identifying all relevant PCFs may be difficult (or… ▽ More

    Submitted 13 September, 2015; originally announced September 2015.

  15. arXiv:1507.06597   

    math.ST stat.ME

    Cox's Theorem and the Jaynesian Interpretation of Probability

    Authors: Alexander Terenin, David Draper

    Abstract: There are multiple proposed interpretations of probability theory: one such interpretation is true-false logic under uncertainty. Cox's Theorem is a representation theorem that states, under a certain set of axioms describing the meaning of uncertainty, that every true-false logic under uncertainty is isomorphic to conditional probability theory. This result was used by Jaynes to develop a philoso… ▽ More

    Submitted 8 February, 2020; v1 submitted 23 July, 2015; originally announced July 2015.

    Comments: This work is withdrawn due to a critical error which we are unable to repair without completely changing the framework. The first author deeply regrets this error, which was committed when he was still obtaining his master's degree and had yet to learn a proper degree of carefulness needed when devising theoretical arguments

  16. arXiv:1412.8563  [pdf, other

    stat.AP

    A nonparametric Bayesian analysis of heterogeneous treatment effects in digital experimentation

    Authors: Matt Taddy, Matt Gardner, Liyun Chen, David Draper

    Abstract: Randomized controlled trials play an important role in how Internet companies predict the impact of policy decisions and product changes. In these `digital experiments', different units (people, devices, products) respond differently to the treatment. This article presents a fast and scalable Bayesian nonparametric analysis of such heterogeneous treatment effects and their measurement in relation… ▽ More

    Submitted 18 December, 2015; v1 submitted 29 December, 2014; originally announced December 2014.

  17. Power-Expected-Posterior Priors for Variable Selection in Gaussian Linear Models

    Authors: Dimitris Fouskakis, Ioannis Ntzoufras, David Draper

    Abstract: In the context of the expected-posterior prior (EPP) approach to Bayesian variable selection in linear models, we combine ideas from power-prior and unit-information-prior methodologies to simultaneously produce a minimally-informative prior and diminish the effect of training samples. The result is that in practice our power-expected-posterior (PEP) methodology is sufficiently insensitive to the… ▽ More

    Submitted 20 May, 2014; v1 submitted 9 July, 2013; originally announced July 2013.

    Journal ref: Bayesian Anal. Volume 10, Number 1 (2015), 75-107

  18. Bayesian variable selection using cost-adjusted BIC, with application to cost-effective measurement of quality of health care

    Authors: D. Fouskakis, I. Ntzoufras, D. Draper

    Abstract: In the field of quality of health care measurement, one approach to assessing patient sickness at admission involves a logistic regression of mortality within 30 days of admission on a fairly large number of sickness indicators (on the order of 100) to construct a sickness scale, employing classical variable selection methods to find an ``optimal'' subset of 10--20 indicators. Such ``benefit-onl… ▽ More

    Submitted 17 August, 2009; originally announced August 2009.

    Comments: Published in at http://dx.doi.org/10.1214/08-AOAS207 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS207

    Journal ref: Annals of Applied Statistics 2009, Vol. 3, No. 2, 663-690