Skip to main content

Showing 1–23 of 23 results for author: Paciorek, C J

Searching in archive stat. Search in all archives.
.
  1. arXiv:2207.08649  [pdf, other

    stat.AP stat.ME

    Analyzing trends in precipitation patterns using Hidden Markov model stochastic weather generators

    Authors: Christopher J. Paciorek

    Abstract: We develop a flexible spline-based Bayesian hidden Markov model stochastic weather generator to statistically model daily precipitation over time by season at individual locations. The model naturally accounts for missing data (considered missing at random), avoiding potential sensitivity from systematic missingness patterns or from using arbitrary cutoffs to deal with missingness when computing m… ▽ More

    Submitted 18 July, 2022; originally announced July 2022.

    Comments: 38 pages, 19 figures, 3 tables

  2. arXiv:2106.13359  [pdf, ps, other

    stat.CO

    A numerically stable online implementation and exploration of WAIC through variations of the predictive density, using NIMBLE

    Authors: Joshua E. Hug, Christopher J. Paciorek

    Abstract: We go through the process of crafting a robust and numerically stable online algorithm for the computation of the Watanabe-Akaike information criteria (WAIC). We implement this algorithm in the NIMBLE software. The implementation is performed in an online manner and does not require the storage in memory of the complete samples from the posterior distribution. This algorithm allows the user to spe… ▽ More

    Submitted 24 June, 2021; originally announced June 2021.

    Comments: 27 pages, 9 tables. This is a preprint of the MA in Statistics thesis of Joshua Hug at the University of California, Berkeley, submitted May 2021

  3. arXiv:2101.11583  [pdf, other

    stat.ME stat.AP

    Computational strategies and estimation performance with Bayesian semiparametric Item Response Theory models

    Authors: Sally Paganin, Christopher J. Paciorek, Claudia Wehrhahn, Abel Rodriguez, Sophia Rabe-Hesketh, Perry de Valpine

    Abstract: Item response theory (IRT) models typically rely on a normality assumption for subject-specific latent traits, which is often unrealistic in practice. Semiparametric extensions based on Dirichlet process mixtures offer a more flexible representation of the unknown distribution of the latent trait. However, the use of such models in the IRT literature has been extremely limited, in good part becaus… ▽ More

    Submitted 10 August, 2022; v1 submitted 27 January, 2021; originally announced January 2021.

  4. Detected changes in precipitation extremes at their native scales derived from in situ measurements

    Authors: Mark D. Risser, Christopher J. Paciorek, Travis A. O'Brien, Michael F. Wehner, William D. Collins

    Abstract: The gridding of daily accumulated precipitation -- especially extremes -- from ground-based station observations is problematic due to the fractal nature of precipitation, and therefore estimates of long period return values and their changes based on such gridded daily data sets are generally underestimated. In this paper, we characterize high-resolution changes in observed extreme precipitation… ▽ More

    Submitted 13 August, 2019; v1 submitted 15 February, 2019; originally announced February 2019.

  5. arXiv:1807.04177  [pdf, other

    stat.AP physics.ao-ph

    A probabilistic gridded product for daily precipitation extremes over the United States

    Authors: Mark D. Risser, Christopher J. Paciorek, Michael F. Wehner, Travis A. O'Brien, William D. Collins

    Abstract: Gridded data products, for example interpolated daily measurements of precipitation from weather stations, are commonly used as a convenient substitute for direct observations because these products provide a spatially and temporally continuous and complete source of data. However, when the goal is to characterize climatological features of extreme precipitation over a spatial domain (e.g., a map… ▽ More

    Submitted 2 January, 2019; v1 submitted 11 July, 2018; originally announced July 2018.

  6. arXiv:1706.03388  [pdf, ps, other

    stat.ME

    Quantifying statistical uncertainty in the attribution of human influence on severe weather

    Authors: Christopher J. Paciorek, Dáithí A. Stone, Michael F. Wehner

    Abstract: Event attribution in the context of climate change seeks to understand the role of anthropogenic greenhouse gas emissions on extreme weather events, either specific events or classes of events. A common approach to event attribution uses climate model output under factual (real-world) and counterfactual (world that might have been without anthropogenic greenhouse gas emissions) scenarios to estima… ▽ More

    Submitted 3 February, 2018; v1 submitted 11 June, 2017; originally announced June 2017.

    Comments: 41 pages, 11 figures, 1 table

  7. arXiv:1703.10002  [pdf, other

    stat.AP

    Spatially-Dependent Multiple Testing Under Model Misspecification, with Application to Detection of Anthropogenic Influence on Extreme Climate Events

    Authors: Mark D. Risser, Christopher J. Paciorek, Daithi Stone

    Abstract: The Weather Risk Attribution Forecast (WRAF) is a forecasting tool that uses output from global climate models to make simultaneous attribution statements about whether and how greenhouse gas emissions have contributed to extreme weather across the globe. However, in conducting a large number of simultaneous hypothesis tests, the WRAF is prone to identifying false "discoveries." A common technique… ▽ More

    Submitted 14 November, 2017; v1 submitted 29 March, 2017; originally announced March 2017.

  8. arXiv:1703.06206  [pdf, other

    stat.CO

    Sequential Monte Carlo Methods in the nimble R Package

    Authors: Nicholas Michaud, Perry de Valpine, Daniel Turek, Christopher J. Paciorek, Dao Nguyen

    Abstract: nimble is an R package for constructing algorithms and conducting inference on hierarchical models. The nimble package provides a unique combination of flexible model specification and the ability to program model-generic algorithms. Specifically, the package allows users to code models in the BUGS language, and it allows users to write algorithms that can be applied to any appropriate model. In t… ▽ More

    Submitted 4 March, 2020; v1 submitted 17 March, 2017; originally announced March 2017.

    Comments: 32 pages. 5 figures. To be published in Journal of Statistical Software. The nimble R package is available on CRAN at https://CRAN.R-project.org/package=nimble

    MSC Class: 62-04 ACM Class: G.3

  9. Quantifying the effect of interannual ocean variability on the attribution of extreme climate events to human influence

    Authors: Mark D. Risser, Daithi A. Stone, Christopher J. Paciorek, Michael F. Wehner, Oliver Angelil

    Abstract: In recent years, the climate change research community has become highly interested in describing the anthropogenic influence on extreme weather events, commonly termed "event attribution." Limitations in the observational record and in computational resources motivate the use of uncoupled, atmosphere/land-only climate models with prescribed ocean conditions run over a short period, leading up to… ▽ More

    Submitted 28 September, 2016; v1 submitted 28 June, 2016; originally announced June 2016.

  10. Quantile-based bias correction and uncertainty quantification of extreme event attribution statements

    Authors: Soyoung Jeon, Christopher J. Paciorek, Michael F. Wehner

    Abstract: Extreme event attribution characterizes how anthropogenic climate change may have influenced the probability and magnitude of selected individual extreme weather and climate events. Attribution statements often involve quantification of the fraction of attributable risk (FAR) or the risk ratio (RR) and associated confidence intervals. Many such analyses use climate model output to characterize ext… ▽ More

    Submitted 12 February, 2016; originally announced February 2016.

    Comments: 28 pages, 4 figures, 3 tables

    Journal ref: Weather and Climate Extremes (2016) 12:24-32

  11. arXiv:1601.02698  [pdf, other

    stat.CO

    Efficient Markov Chain Monte Carlo Sampling for Hierarchical Hidden Markov Models

    Authors: Daniel Turek, Perry de Valpine, Christopher J. Paciorek

    Abstract: Traditional Markov chain Monte Carlo (MCMC) sampling of hidden Markov models (HMMs) involves latent states underlying an imperfect observation process, and generates posterior samples for top-level parameters concurrently with nuisance latent variables. When potentially many HMMs are embedded within a hierarchical model, this can result in prohibitively long MCMC runtimes. We study combinations of… ▽ More

    Submitted 11 January, 2016; originally announced January 2016.

  12. Statistically-estimated tree composition for the northeastern United States at the time of Euro-American settlement

    Authors: Christopher J. Paciorek, Simon J. Goring, Andrew L. Thurman, Charles V. Cogbill, John W. Williams, David J. Mladenoff, Jody A. Peters, Jun Zhu, Jason S. McLachlan

    Abstract: We present a gridded 8 km-resolution data product of the estimated composition of tree taxa at the time of Euro-American settlement of the northeastern United States and the statistical methodology used to produce the product from trees recorded by land surveyors. Composition is defined as the proportion of stems larger than approximately 20 cm diameter at breast height for 22 tree taxa, generally… ▽ More

    Submitted 3 April, 2016; v1 submitted 29 August, 2015; originally announced August 2015.

    Comments: 23 pages, 5 tables, 3 figures

    Journal ref: PLoS ONE (2016) 11(2): e0150087

  13. Programming with models: writing statistical algorithms for general model structures with NIMBLE

    Authors: Perry de Valpine, Daniel Turek, Christopher J. Paciorek, Clifford Anderson-Bergman, Duncan Temple Lang, Rastislav Bodik

    Abstract: We describe NIMBLE, a system for programming statistical algorithms for general model structures within R. NIMBLE is designed to meet three challenges: flexible model specification, a language for programming algorithms that can use different models, and a balance between high-level programmability and execution efficiency. For model specification, NIMBLE extends the BUGS language and creates mode… ▽ More

    Submitted 12 April, 2016; v1 submitted 19 May, 2015; originally announced May 2015.

    Comments: 20 pages, 2 figures

    Journal ref: Journal of Computational and Graphical Statistics (2017) 26: 403-413

  14. arXiv:1503.05621  [pdf, other

    stat.CO

    Automated Parameter Blocking for Efficient Markov-Chain Monte Carlo Sampling

    Authors: Daniel Turek, Perry de Valpine, Christopher J. Paciorek, Clifford Anderson-Bergman

    Abstract: Markov chain Monte Carlo (MCMC) sampling is an important and commonly used tool for the analysis of hierarchical models. Nevertheless, practitioners generally have two options for MCMC: utilize existing software that generates a black-box "one size fits all" algorithm, or the challenging (and time consuming) task of implementing a problem-specific MCMC algorithm. Either choice may result in ineffi… ▽ More

    Submitted 18 March, 2015; originally announced March 2015.

  15. Nonlinear predictive latent process models for integrating spatio-temporal exposure data from multiple sources

    Authors: Nikolay Bliznyuk, Christopher J. Paciorek, Joel Schwartz, Brent Coull

    Abstract: Spatio-temporal prediction of levels of an environmental exposure is an important problem in environmental epidemiology. Our work is motivated by multiple studies on the spatio-temporal distribution of mobile source, or traffic related, particles in the greater Boston area. When multiple sources of exposure information are available, a joint model that pools information across sources maximizes da… ▽ More

    Submitted 13 November, 2014; originally announced November 2014.

    Comments: Published in at http://dx.doi.org/10.1214/14-AOAS737 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS737

    Journal ref: Annals of Applied Statistics 2014, Vol. 8, No. 3, 1538-1560

  16. Bayesian Estimation of Population-Level Trends in Measures of Health Status

    Authors: Mariel M. Finucane, Christopher J. Paciorek, Goodarz Danaei, Majid Ezzati

    Abstract: Improving health worldwide will require rigorous quantification of population-level trends in health status. However, global-level surveys are not available, forcing researchers to rely on fragmentary country-specific data of varying quality. We present a Bayesian model that systematically combines disparate data to make country-, region- and global-level estimates of time trends in important heal… ▽ More

    Submitted 19 May, 2014; originally announced May 2014.

    Comments: Published in at http://dx.doi.org/10.1214/13-STS427 the Statistical Science (http://www.imstat.org/sts/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-STS-STS427

    Journal ref: Statistical Science 2014, Vol. 29, No. 1, 18-25

  17. Parallelizing Gaussian Process Calculations in R

    Authors: Christopher J. Paciorek, Benjamin Lipshitz, Wei Zhuo, Prabhat, Cari G. Kaufman, Rollin C. Thomas

    Abstract: We consider parallel computation for Gaussian process calculations to overcome computational and memory constraints on the size of datasets that can be analyzed. Using a hybrid parallelization approach that uses both threading (shared memory) and message-passing (distributed memory), we implement the core linear algebra operations used in spatial statistics and Gaussian process regression in an R… ▽ More

    Submitted 21 May, 2013; originally announced May 2013.

    Comments: 21 pages, 8 figures

    Journal ref: Journal of Statistical Software 2015, Vol. 63, Number 10, 1-23

  18. Semiparametric Bayesian Density Estimation with Disparate Data Sources: A Meta-Analysis of Global Childhood Undernutrition

    Authors: Mariel M. Finucane, Christopher J. Paciorek, Gretchen A. Stevens, Majid Ezzati

    Abstract: Undernutrition, resulting in restricted growth, and quantified here using height-for-age z-scores, is an important contributor to childhood morbidity and mortality. Since all levels of mild, moderate and severe undernutrition are of clinical and public health importance, it is of interest to estimate the shape of the z-scores' distributions. We present a finite normal mixture model that uses dat… ▽ More

    Submitted 28 June, 2014; v1 submitted 22 January, 2013; originally announced January 2013.

    Comments: 41 total pages, 6 figures, 1 table

    Journal ref: Journal of the American Statistical Association (2015) 110: 889-901

  19. Measurement error in two-stage analyses, with application to air pollution epidemiology

    Authors: Adam A. Szpiro, Christopher J. Paciorek

    Abstract: Public health researchers often estimate health effects of exposures (e.g., pollution, diet, lifestyle) that cannot be directly measured for study subjects. A common strategy in environmental epidemiology is to use a first-stage (exposure) model to estimate the exposure based on covariates and/or spatio-temporal proximity and to use predictions from the exposure model as the covariate of interest… ▽ More

    Submitted 30 June, 2013; v1 submitted 27 October, 2012; originally announced October 2012.

    Comments: 35 pages, 4 figures, 2 tables

    Journal ref: Environmetrics (2013) 24: 501-517

  20. Spatial models for point and areal data using Markov random fields on a fine grid

    Authors: Christopher J. Paciorek

    Abstract: I consider the use of Markov random fields (MRFs) on a fine grid to represent latent spatial processes when modeling point-level and areal data, including situations with spatial misalignment. Point observations are related to the grid cell in which they reside, while areal observations are related to the (approximate) integral over the latent process within the area of interest. I review several… ▽ More

    Submitted 6 April, 2013; v1 submitted 26 April, 2012; originally announced April 2012.

    Comments: 26 pages, 10 figures

    Journal ref: Electronic Journal of Statistics, Vol. 7 (2013) 946-972

  21. The Importance of Scale for Spatial-Confounding Bias and Precision of Spatial Regression Estimators

    Authors: Christopher J. Paciorek

    Abstract: Residuals in regression models are often spatially correlated. Prominent examples include studies in environmental epidemiology to understand the chronic health effects of pollutants. I consider the effects of residual spatial structure on the bias and precision of regression coefficients, developing a simple framework in which to understand the key issues and derive informative analytic results.… ▽ More

    Submitted 4 November, 2010; originally announced November 2010.

    Comments: Published in at http://dx.doi.org/10.1214/10-STS326 the Statistical Science (http://www.imstat.org/sts/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-STS-STS326

    Journal ref: Statistical Science 2010, Vol. 25, No. 1, 107-125

  22. arXiv:1008.2218  [pdf, other

    stat.ME stat.AP stat.CO

    Combining spatial information sources while accounting for systematic errors in proxies

    Authors: Christopher J. Paciorek

    Abstract: Environmental research increasingly uses high-dimensional remote sensing and numerical model output to help fill space-time gaps between traditional observations. Such output is often a noisy proxy for the process of interest. Thus one needs to separate and assess the signal and noise (often called discrepancy) in the proxy given complicated spatio-temporal dependencies. Here I extend a popular tw… ▽ More

    Submitted 13 September, 2011; v1 submitted 12 August, 2010; originally announced August 2010.

    Comments: 5 figures, 2 tables

    Journal ref: Journal of the Royal Statistical Society, Series C (Applied Statistics) 61: 429-451 (2012)

  23. Practical large-scale spatio-temporal modeling of particulate matter concentrations

    Authors: Christopher J. Paciorek, Jeff D. Yanosky, Robin C. Puett, Francine Laden, Helen H. Suh

    Abstract: The last two decades have seen intense scientific and regulatory interest in the health effects of particulate matter (PM). Influential epidemiological studies that characterize chronic exposure of individuals rely on monitoring data that are sparse in space and time, so they often assign the same exposure to participants in large geographic areas and across time. We estimate monthly PM during 1… ▽ More

    Submitted 8 June, 2009; originally announced June 2009.

    Comments: Published in at http://dx.doi.org/10.1214/08-AOAS204 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS204

    Journal ref: Annals of Applied Statistics 2009, Vol. 3, No. 1, 370-397