Skip to main content

Showing 1–28 of 28 results for author: Stephens, D A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2410.01151  [pdf, other

    stat.AP

    The Effects of Air Pollution on Health: A Longitudinal Study of Los Angeles County Accounting for Measurement Error

    Authors: Yanfei Qu, David A. Stephens

    Abstract: This study develops a Bayesian hierarchical model to explore the effects of air pollution on respiratory and cardiovascular mortality in Los Angeles County. The model takes into account various pollutants such as PM2.5, PM10, CO, SO2, NO2 and O3, as well as a related meteorological factor: temperature. The objective is to identify the significant factors affecting selected health outcomes without… ▽ More

    Submitted 27 January, 2025; v1 submitted 1 October, 2024; originally announced October 2024.

  2. arXiv:2402.08877  [pdf, other

    stat.ME

    Computational Considerations for the Linear Model of Coregionalization

    Authors: Renaud Alie, David A. Stephens, Alexandra M. Schmidt

    Abstract: In the last two decades, the linear model of coregionalization (LMC) has been widely used to model multivariate spatial processes. However, it can be a challenging task to conduct likelihood-based inference for such models because of the cubic cost associated with Gaussian likelihood evaluations. Starting from an analogy with matrix normal models, we propose a reformulation of the LMC likelihood t… ▽ More

    Submitted 2 December, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

  3. arXiv:2306.11908  [pdf, ps, other

    stat.ML cs.LG stat.ME

    Generalized Random Forests using Fixed-Point Trees

    Authors: David Fleischer, David A. Stephens, Archer Y. Yang

    Abstract: We propose a computationally efficient alternative to generalized random forests (GRFs) for estimating heterogeneous effects in large dimensions. While GRFs rely on a gradient-based splitting criterion, which in large dimensions is computationally expensive and unstable, our method introduces a fixed-point approximation that eliminates the need for Jacobian estimation. This gradient-free approach… ▽ More

    Submitted 16 June, 2025; v1 submitted 20 June, 2023; originally announced June 2023.

    Comments: 44 pages, 17 figures

  4. arXiv:2304.12548  [pdf, other

    stat.ME

    The impact of directly observed therapy on the efficacy of Tuberculosis treatment: A Bayesian multilevel approach

    Authors: Widemberg S. Nobre, Alexandra M. Schmidt, Erica E. M. Moodie, David A. Stephens

    Abstract: We propose and discuss a Bayesian procedure to estimate the average treatment effect (ATE) for multilevel observations in the presence of confounding. We focus on situations where the confounders may be latent (e.g., spatial latent effects). This work is motivated by an interest in determining the causal impact of directly observed therapy (DOT) on the successful treatment of Tuberculosis (TB); th… ▽ More

    Submitted 24 April, 2023; originally announced April 2023.

  5. arXiv:2303.15281  [pdf, other

    stat.ME

    Bayesian inference for optimal dynamic treatment regimes in practice

    Authors: Daniel Rodriguez Duque, Erica E. M. Moodie, David A. Stephens

    Abstract: In this work, we examine recently developed methods for Bayesian inference of optimal dynamic treatment regimes (DTRs). DTRs are a set of treatment decision rules aimed at tailoring patient care to patient-specific characteristics, thereby falling within the realm of precision medicine. In this field, researchers seek to tailor therapy with the intention of improving health outcomes; therefore, th… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

  6. arXiv:2303.08735  [pdf, other

    stat.ME stat.AP

    A Bayesian Non-Stationary Heteroskedastic Time Series Model for Multivariate Critical Care Data

    Authors: Zayd Omar, David A. Stephens, Alexandra M. Schmidt, David L. Buckeridge

    Abstract: We propose a multivariate GARCH model for non-stationary health time series by modifying the variance of the observations of the standard state space model. The proposed model provides an intuitive way of dealing with heteroskedastic data using the conditional nature of state space models. We follow the Bayesian paradigm to perform the inference procedure. In particular, we use Markov chain Monte… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

  7. arXiv:2301.03710  [pdf, other

    stat.ME

    A time-dependent Poisson-Gamma model for recruitment forecasting in multicenter studies

    Authors: Armando Turchetta, Nicolas Savy, David A. Stephens, Erica E. M. Moodie, Marina B. Klein

    Abstract: Forecasting recruitments is a key component of the monitoring phase of multicenter studies. One of the most popular techniques in this field is the Poisson-Gamma recruitment model, a Bayesian technique built on a doubly stochastic Poisson process. This approach is based on the modeling of enrollments as a Poisson process where the recruitment rates are assumed to be constant over time and to follo… ▽ More

    Submitted 9 January, 2023; originally announced January 2023.

  8. arXiv:2204.09862  [pdf, ps, other

    stat.ME

    Targeting functional parameters with semiparametric Bayesian inference

    Authors: Vivian Y. Meng, David A. Stephens

    Abstract: Typical Bayesian inference requires parameter identification via likelihood parameterization, which has invited criticism for being less flexible than the Frequentist framework and subject to misspecification. Though misspecification may be avoided by functional parameter inference under a nonparametric model space, there does not exist a flexible Bayesian semiparametric model that would allow ful… ▽ More

    Submitted 25 November, 2022; v1 submitted 20 April, 2022; originally announced April 2022.

  9. arXiv:2204.02231  [pdf, ps, other

    stat.OT

    Causal inference: critical developments, past and future

    Authors: Erica EM Moodie, David A Stephens

    Abstract: Causality is a subject of philosophical debate and a central scientific issue with a long history. In the statistical domain, the study of cause and effect based on the notion of `fairness' in comparisons dates back several hundred years, and yet statistical concepts and developments that form the area of causal inference are only decades old. In this paper, we review core tenets and methods of ca… ▽ More

    Submitted 5 April, 2022; originally announced April 2022.

  10. arXiv:2203.06743  [pdf, other

    stat.ME

    Bayesian Analysis of Sigmoidal Gaussian Cox Processes via Data Augmentation

    Authors: Renaud Alie, David A. Stephens, Alexandra M. Schmidt

    Abstract: Many models for point process data are defined through a thinning procedure where locations of a base process (often Poisson) are either kept (observed) or discarded (thinned). In this paper, we go back to the fundamentals of the distribution theory for point processes to establish a link between the base thinning mechanism and the joint density of thinned and observed locations in any of such mod… ▽ More

    Submitted 10 December, 2024; v1 submitted 13 March, 2022; originally announced March 2022.

  11. arXiv:2201.12831  [pdf, ps, other

    stat.ME

    Causal inference under mis-specification: adjustment based on the propensity score

    Authors: David A. Stephens, Widemberg S. Nobre, Erica E. M. Moodie, Alexandra M. Schmidt

    Abstract: We study Bayesian approaches to causal inference via propensity score regression. Much of the Bayesian literature on propensity score methods have relied on approaches that cannot be viewed as fully Bayesian in the context of conventional `likelihood times prior' posterior inference; in addition, most methods rely on parametric and distributional assumptions, and presumed correct specification. We… ▽ More

    Submitted 30 January, 2022; originally announced January 2022.

  12. arXiv:2108.01041  [pdf, ps, other

    stat.ME

    Bayesian Sample Size Calculations for SMART Studies

    Authors: Armando Turchetta, Erica E. M. Moodie, David A. Stephens, Sylvie D. Lambert

    Abstract: In the management of most chronic conditions characterized by the lack of universally effective treatments, adaptive treatment strategies (ATSs) have been growing in popularity as they offer a more individualized approach, and sequential multiple assignment randomized trials (SMARTs) have gained attention as the most suitable clinical trial design to formalize the study of these strategies. While… ▽ More

    Submitted 2 August, 2021; originally announced August 2021.

    Comments: Main article 16 pages, 3 figures, 2 tables. Appendix 11 pages, 10 tables. Submitted to Biometrics

  13. Bayesian inference for continuous-time hidden Markov models with an unknown number of states

    Authors: Yu Luo, David A. Stephens

    Abstract: We consider the modeling of data generated by a latent continuous-time Markov jump process with a state space of finite but unknown dimensions. Typically in such models, the number of states has to be pre-specified, and Bayesian inference for a fixed number of states has not been studied until recently. In addition, although approaches to address the problem for discrete-time models have been deve… ▽ More

    Submitted 20 June, 2021; originally announced June 2021.

    MSC Class: 62F15; 62M05; 60J27

    Journal ref: Statistics and Computing (2021), 31

  14. arXiv:2105.12259  [pdf, other

    stat.ME

    Estimation of Optimal Dynamic Treatment Regimes using Gaussian Process Emulation

    Authors: Daniel Rodriguez Duque, David A. Stephens, Erica E. M. Moodie

    Abstract: In precision medicine, identifying optimal sequences of decision rules, termed dynamic treatment regimes (DTRs), is an important undertaking. One approach investigators may take to infer about optimal DTRs is via Bayesian dynamic Marginal Structural Models (MSMs). These models represent the expected outcome under adherence to a DTR for DTRs in a family indexed by a parameter $ ψ$; the function map… ▽ More

    Submitted 7 June, 2022; v1 submitted 25 May, 2021; originally announced May 2021.

  15. arXiv:2103.12293  [pdf, other

    math.OC cs.LG stat.ML

    Stochastic Reweighted Gradient Descent

    Authors: Ayoub El Hanchi, David A. Stephens

    Abstract: Despite the strong theoretical guarantees that variance-reduced finite-sum optimization algorithms enjoy, their applicability remains limited to cases where the memory overhead they introduce (SAG/SAGA), or the periodic full gradient computation they require (SVRG/SARAH) are manageable. A promising approach to achieving variance reduction while avoiding these drawbacks is the use of importance sam… ▽ More

    Submitted 23 March, 2021; originally announced March 2021.

  16. arXiv:2103.12243  [pdf, other

    cs.LG math.OC stat.ML

    Adaptive Importance Sampling for Finite-Sum Optimization and Sampling with Decreasing Step-Sizes

    Authors: Ayoub El Hanchi, David A. Stephens

    Abstract: Reducing the variance of the gradient estimator is known to improve the convergence rate of stochastic gradient-based optimization and sampling algorithms. One way of achieving variance reduction is to design importance sampling strategies. Recently, the problem of designing such schemes was formulated as an online learning problem with bandit feedback, and algorithms with sub-linear static regret… ▽ More

    Submitted 22 March, 2021; originally announced March 2021.

    Comments: Advances in Neural Information Processing Systems, Dec 2020, Vancouver, Canada

  17. arXiv:2103.04086  [pdf, other

    stat.ME

    Assessing the validity of Bayesian inference using loss functions

    Authors: Yu Luo, David A. Stephens, Daniel J. Graham, Emma J. McCoy

    Abstract: In the usual Bayesian setting, a full probabilistic model is required to link the data and parameters, and the form of this model and the inference and prediction mechanisms are specified via de Finetti's representation. In general, such a formulation is not robust to model mis-specification of its component parts. An alternative approach is to draw inference based on loss functions, where the qua… ▽ More

    Submitted 9 February, 2023; v1 submitted 6 March, 2021; originally announced March 2021.

  18. arXiv:2006.01799  [pdf, ps, other

    stat.ME math.ST

    The role of exchangeability in causal inference

    Authors: Olli Saarela, David A. Stephens, Erica E. M. Moodie

    Abstract: Though the notion of exchangeability has been discussed in the causal inference literature under various guises, it has rarely taken its original meaning as a symmetry property of probability distributions. As this property is a standard component of Bayesian inference, we argue that in Bayesian causal inference it is natural to link the causal model, including the notion of confounding and defini… ▽ More

    Submitted 15 December, 2022; v1 submitted 2 June, 2020; originally announced June 2020.

    Journal ref: Statistical Science. 2023 Aug; 38(3): 369-385

  19. arXiv:1906.10252  [pdf, other

    stat.ME stat.AP stat.CO

    Bayesian Clustering for Continuous-Time Hidden Markov Models

    Authors: Yu Luo, David A. Stephens, David L. Buckeridge

    Abstract: We develop clustering procedures for longitudinal trajectories based on a continuous-time hidden Markov model (CTHMM) and a generalized linear observation model. Specifically in this paper, we carry out finite and infinite mixture model-based clustering for a CTHMM and achieve inference using Markov chain Monte Carlo (MCMC). For a finite mixture model with prior on the number of components, we imp… ▽ More

    Submitted 26 March, 2021; v1 submitted 24 June, 2019; originally announced June 2019.

    MSC Class: 62F15; 91C20

    Journal ref: Canadian Journal of Statistics (2021)

  20. arXiv:1904.09394  [pdf, other

    math.ST stat.AP

    Estimating Sparse Networks with Hubs

    Authors: Annaliza McGillivray, Abbas Khalili, David A. Stephens

    Abstract: Graphical modelling techniques based on sparse selection have been applied to infer complex networks in many fields, including biology and medicine, engineering, finance, and social sciences. One structural feature of some of the networks in such applications that poses a challenge for statistical inference is the presence of a small number of strongly interconnected nodes in a network which are c… ▽ More

    Submitted 1 March, 2020; v1 submitted 19 April, 2019; originally announced April 2019.

    MSC Class: 62H12; 62F12; 62J07

  21. arXiv:1708.09443  [pdf, other

    stat.AP q-bio.QM

    Transmission clusters in the HIV-1 epidemic among men who have sex with men in Montreal, Quebec, Canada

    Authors: Luc Villandré, Aurélie Labbe, Ruxandra-Ilinca Ibanescu, Bluma Brenner, Michel Roger, David A Stephens

    Abstract: Background. Several studies have used phylogenetics to investigate Human Immunodeficiency Virus (HIV) transmission among Men who have Sex with Men (MSMs) in Montreal, Quebec, Canada, revealing many transmission clusters. The Quebec HIV genotyping program sequence database now includes viral sequences from close to 4,000 HIV-positive individuals classified as MSMs. In this paper, we investigate clu… ▽ More

    Submitted 30 August, 2017; originally announced August 2017.

  22. arXiv:1708.02648  [pdf, ps, other

    stat.ME

    DM-PhyClus: A Bayesian phylogenetic algorithm for infectious disease transmission cluster inference

    Authors: Luc Villandré, Aurélie Labbe, Bluma Brenner, Michel Roger, David A. Stephens

    Abstract: Background. Conventional phylogenetic clustering approaches rely on arbitrary cutpoints applied a posteriori to phylogenetic estimates. Although in practice, Bayesian and bootstrap-based clustering tend to lead to similar estimates, they often produce conflicting measures of confidence in clusters. The current study proposes a new Bayesian phylogenetic clustering algorithm, which we refer to as DM… ▽ More

    Submitted 8 August, 2017; originally announced August 2017.

  23. arXiv:1707.08354  [pdf, other

    stat.AP q-bio.PE

    A hierarchical Bayesian model for predicting ecological interactions using scaled evolutionary relationships

    Authors: Mohamad Elmasri, Maxwell J. Farrell, T. Jonathan Davies, David A. Stephens

    Abstract: Identifying undocumented or potential future interactions among species is a challenge facing modern ecologists. Recent link prediction methods rely on trait data, however large species interaction databases are typically sparse and covariates are limited to only a fraction of species. On the other hand, evolutionary relationships, encoded as phylogenetic trees, can act as proxies for underlying t… ▽ More

    Submitted 19 September, 2019; v1 submitted 26 July, 2017; originally announced July 2017.

    Comments: To appear in the Annals of Applied Statistics

  24. arXiv:1704.08229  [pdf, ps, other

    stat.ME

    Generalized G-estimation and Model Selection

    Authors: M. P. Wallace, E. E. M. Moodie, D. A. Stephens

    Abstract: Dynamic treatment regimes (DTRs) aim to formalize personalized medicine by tailoring treatment decisions to individual patient characteristics. G-estimation for DTR identification targets the parameters of a structural nested mean model known as the blip function from which the optimal DTR is derived. Despite considerable work deriving such estimation methods, there has been little focus on extend… ▽ More

    Submitted 26 April, 2017; originally announced April 2017.

  25. A Bayesian view of doubly robust causal inference

    Authors: Olli Saarela, Léo R. Belzile, David A. Stephens

    Abstract: In causal inference confounding may be controlled either through regression adjustment in an outcome model, or through propensity score adjustment or inverse probability of treatment weighting, or both. The latter approaches, which are based on modelling of the treatment assignment mechanism and their doubly robust extensions have been difficult to motivate using formal Bayesian arguments, in prin… ▽ More

    Submitted 15 January, 2017; originally announced January 2017.

    Comments: Author's original version. 21 pages, including supplementary material

    MSC Class: 62F15

    Journal ref: Biometrika (2016), 103 (3): 667-681

  26. Two-sample Bayesian Nonparametric Hypothesis Testing

    Authors: Chris C. Holmes, François Caron, Jim E. Griffin, David A. Stephens

    Abstract: In this article we describe Bayesian nonparametric procedures for two-sample hypothesis testing. Namely, given two sets of samples $\mathbf{y}^{\scriptscriptstyle(1)}\;$\stackrel{\scriptscriptstyle{iid}}{\s im}$\;F^{\scriptscriptstyle(1)}$ and $\mathbf{y}^{\scriptscriptstyle(2 )}\;$\stackrel{\scriptscriptstyle{iid}}{\sim}$\;F^{\scriptscriptstyle( 2)}$, with… ▽ More

    Submitted 11 May, 2015; v1 submitted 27 October, 2009; originally announced October 2009.

    Comments: Published at http://dx.doi.org/10.1214/14-BA914 in the Bayesian Analysis (http://projecteuclid.org/euclid.ba) by the International Society of Bayesian Analysis (http://bayesian.org/)

    Report number: VTeX-BA-BA914

    Journal ref: Bayesian Analysis 2015, Vol. 10, No. 2, 297-320

  27. arXiv:0711.0186  [pdf, ps, other

    stat.CO

    Population-Based Reversible Jump Markov Chain Monte Carlo

    Authors: Ajay Jasra, David A. Stephens, Chris C. Holmes

    Abstract: In this paper we present an extension of population-based Markov chain Monte Carlo (MCMC) to the trans-dimensional case. One of the main challenges in MCMC-based inference is that of simulating from high and trans-dimensional target measures. In such cases, MCMC methods may not adequately traverse the support of the target; the simulation results will be unreliable. We develop population methods… ▽ More

    Submitted 1 November, 2007; originally announced November 2007.

  28. arXiv:0709.0139  [pdf, ps, other

    stat.ME stat.AP

    Non-Regular Likelihood Inference for Seasonally Persistent Processes

    Authors: Emma J. McCoy, Sofia C. Olhede, David A. Stephens

    Abstract: The estimation of parameters in the frequency spectrum of a seasonally persistent stationary stochastic process is addressed. For seasonal persistence associated with a pole in the spectrum located away from frequency zero, a new Whittle-type likelihood is developed that explicitly acknowledges the location of the pole. This Whittle likelihood is a large sample approximation to the distribution… ▽ More

    Submitted 2 September, 2007; originally announced September 2007.

    Comments: 57 pages, including 5 figures