Skip to main content

Showing 1–50 of 54 results for author: Wikle, C K

Searching in archive stat. Search in all archives.
.
  1. arXiv:2504.02960  [pdf, other

    stat.OT

    An Anytime Valid Test for Complete Spatial Randomness

    Authors: Vaidehi Dixit, Christopher K. Wikle, Scott H. Holan

    Abstract: A relevant question when analyzing spatial point patterns is that of spatial randomness. More specifically, before any model can be fit to a point pattern a first step is to test the data for departures from complete spatial randomness (CSR). Traditional techniques employ distance or quadrat counts based methods to test for CSR based on batched data. In this paper, we consider the practical scenar… ▽ More

    Submitted 3 April, 2025; originally announced April 2025.

  2. arXiv:2502.12334  [pdf, other

    stat.ME

    Inference for Log-Gaussian Cox Point Processes using Bayesian Deep Learning: Application to Human Oral Microbiome Image Data

    Authors: Shuwan Wang, Christopher K. Wikle, Athanasios C. Micheas, Jessica L. Mark Welch, Jacqueline R. Starr, Kyu Ha Lee

    Abstract: It is common in nature to see aggregation of objects in space. Exploring the mechanism associated with the locations of such clustered observations can be essential to understanding the phenomenon, such as the source of spatial heterogeneity, or comparison to other event generating processes in the same domain. Log-Gaussian Cox processes (LGCPs) represent an important class of models for quantifyi… ▽ More

    Submitted 18 March, 2025; v1 submitted 17 February, 2025; originally announced February 2025.

  3. arXiv:2502.04685  [pdf, other

    physics.flu-dyn stat.AP stat.ML

    Capturing Extreme Events in Turbulence using an Extreme Variational Autoencoder (xVAE)

    Authors: Likun Zhang, Kiran Bhaganagar, Christopher K. Wikle

    Abstract: Turbulent flow fields are characterized by extreme events that are statistically intermittent and carry a significant amount of energy and physical importance. To emulate these flows, we introduce the extreme variational Autoencoder (xVAE), which embeds a max-infinitely divisible process with heavy-tailed distributions into a standard VAE framework, enabling accurate modeling of extreme events. xV… ▽ More

    Submitted 7 February, 2025; originally announced February 2025.

  4. arXiv:2410.10641  [pdf, other

    cs.LG stat.ME

    Echo State Networks for Spatio-Temporal Area-Level Data

    Authors: Zhenhua Wang, Scott H. Holan, Christopher K. Wikle

    Abstract: Spatio-temporal area-level datasets play a critical role in official statistics, providing valuable insights for policy-making and regional planning. Accurate modeling and forecasting of these datasets can be extremely useful for policymakers to develop informed strategies for future planning. Echo State Networks (ESNs) are efficient methods for capturing nonlinear temporal dynamics and generating… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

    Comments: 23 pages, 4 figures

  5. arXiv:2410.09673  [pdf, other

    stat.AP

    Incorporating Asymmetric Loss for Real Estate Prediction with Area-level Spatial Data

    Authors: Vaidehi Dixit, Scott H. Holan, Christopher K. Wikle

    Abstract: We investigate two asymmetric loss functions, namely LINEX loss and power divergence loss for optimal spatial prediction with area-level data. With our motivation arising from the real estate industry, namely in real estate valuation, we use the Zillow Home Value Index (ZHVI) for county-level values to show the change in prediction when the loss is different (asymmetric) from a traditional squared… ▽ More

    Submitted 12 October, 2024; originally announced October 2024.

  6. arXiv:2406.17729  [pdf, other

    physics.ao-ph cs.LG stat.ML

    Uncertainty-enabled machine learning for emulation of regional sea-level change caused by the Antarctic Ice Sheet

    Authors: Myungsoo Yoo, Giri Gopalan, Matthew J. Hoffman, Sophie Coulson, Holly Kyeore Han, Christopher K. Wikle, Trevor Hillebrand

    Abstract: Projecting sea-level change in various climate-change scenarios typically involves running forward simulations of the Earth's gravitational, rotational and deformational (GRD) response to ice mass change, which requires high computational cost and time. Here we build neural-network emulators of sea-level change at 27 coastal locations, due to the GRD effects associated with future Antarctic Ice Sh… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  7. arXiv:2312.12287  [pdf, other

    stat.ME

    A Criterion for Aggregation Error for Multivariate Spatial Data

    Authors: Ranadeep Daw, Jonathan R. Bradley, Christopher K. Wikle, Scott H. Holan

    Abstract: The criterion for aggregation error (CAGE) is an important metric that aims to measure errors that arise in multiscale (or multi-resolution) spatial data, referred to as the modifiable areal unit problem and the ecological fallacy. Specifically, CAGE is a measure of between scale variance of eigenvectors in a Karhunen-Loéve expansion (KLE), motivated by a theoretical result, referred to as the ``n… ▽ More

    Submitted 10 February, 2025; v1 submitted 19 December, 2023; originally announced December 2023.

  8. arXiv:2308.04391  [pdf, other

    physics.ao-ph stat.AP

    Calibrated Forecasts of Quasi-Periodic Climate Processes with Deep Echo State Networks and Penalized Quantile Regression

    Authors: Matthew Bonas, Christopher K. Wikle, Stefano Castruccio

    Abstract: Among the most relevant processes in the Earth system for human habitability are quasi-periodic, ocean-driven multi-year events whose dynamics are currently incompletely characterized by physical models, and hence poorly predictable. This work aims at showing how 1) data-driven, stochastic machine learning approaches provide an affordable yet flexible means to forecast these processes; 2) the asso… ▽ More

    Submitted 8 August, 2023; originally announced August 2023.

  9. arXiv:2307.08079  [pdf, other

    stat.ML cs.LG stat.ME

    Flexible and efficient emulation of spatial extremes processes via variational autoencoders

    Authors: Likun Zhang, Xiaoyu Ma, Christopher K. Wikle, Raphaël Huser

    Abstract: Many real-world processes have complex tail dependence structures that cannot be characterized using classical Gaussian processes. More flexible spatial extremes models exhibit appealing extremal dependence properties but are often exceedingly prohibitive to fit and simulate from in high dimensions. In this paper, we aim to push the boundaries on computation and modeling of high-dimensional spatia… ▽ More

    Submitted 18 December, 2024; v1 submitted 16 July, 2023; originally announced July 2023.

    Comments: 30 pages, 8 figures

    MSC Class: 68T07 (Primary); 60G70; 62H11 (Secondary)

  10. arXiv:2306.04696  [pdf, other

    stat.AP

    Bayesian Ensemble Echo State Networks for Enhancing Binary Stochastic Cellular Automata

    Authors: Nicholas Grieshop, Christopher K. Wikle

    Abstract: Binary spatio-temporal data are common in many application areas. Such data can be considered from many perspectives, including via deterministic or stochastic cellular automata, where local rules govern the transition probabilities that describe the evolution of the 0 and 1 states across space and time. One implementation of a stochastic cellular automata for such data is with a spatio-temporal g… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

  11. arXiv:2306.03214  [pdf, other

    stat.AP

    Data-Driven Modeling of Wildfire Spread with Stochastic Cellular Automata and Latent Spatio-Temporal Dynamics

    Authors: Nicholas Grieshop, Christopher K. Wikle

    Abstract: We propose a Bayesian stochastic cellular automata modeling approach to model the spread of wildfires with uncertainty quantification. The model considers a dynamic neighborhood structure that allows neighbor states to inform transition probabilities in a multistate categorical model. Additional spatial information is captured by the use of a temporally evolving latent spatio-temporal dynamic proc… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

  12. arXiv:2302.04960  [pdf, ps, other

    stat.ME

    Using Echo State Networks to Inform Physical Models for Fire Front Propagation

    Authors: Myungsoo Yoo, Christopher K. Wikle

    Abstract: Wildfires can be devastating, causing significant damage to property, ecosystem disruption, and loss of life. Forecasting the evolution of wildfire boundaries is essential to real-time wildfire management. To this end, substantial attention in the wildifre literature has focused on the level set method, which effectively represents complicated boundaries and their change over time. Nevertheless, m… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.

  13. arXiv:2211.09797  [pdf, other

    stat.ME stat.AP

    Bayesian Hierarchical Models For Multi-type Survey Data Using Spatially Correlated Covariates Measured With Error

    Authors: Saikat Nandy, Scott H. Holan, Jonathan R. Bradley, Christopher K. Wikle

    Abstract: We introduce Bayesian hierarchical models for predicting high-dimensional tabular survey data which can be distributed from one or multiple classes of distributions (e.g., Gaussian, Poisson, Binomial, etc.). We adopt a Bayesian implementation of a Hierarchical Generalized Transformation (HGT) model to deal with the non-conjugacy of non-Gaussian data models when estimated using a Latent Gaussian Pr… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

    Comments: 28 pages, 8 figures

  14. arXiv:2211.04682  [pdf

    stat.CO

    REDS: Random Ensemble Deep Spatial prediction

    Authors: Ranadeep Daw, Christopher K. Wikle

    Abstract: There has been a great deal of recent interest in the development of spatial prediction algorithms for very large datasets and/or prediction domains. These methods have primarily been developed in the spatial statistics community, but there has been growing interest in the machine learning community for such methods, primarily driven by the success of deep Gaussian process regression approaches an… ▽ More

    Submitted 8 November, 2022; originally announced November 2022.

  15. arXiv:2210.14978  [pdf, ps, other

    stat.ME

    A Bayesian Spatio-Temporal Level Set Dynamic Model and Application to Fire Front Propagation

    Authors: Myungsoo Yoo, Christopher K. Wikle

    Abstract: Intense wildfires impact nature, humans, and society, causing catastrophic damage to property and the ecosystem, as well as the loss of life. Forecasting wildfire front propagation is essential in order to support fire fighting efforts and plan evacuations. The level set method has been widely used to analyze the change in surfaces, shapes, and boundaries. In particular, a signed distance function… ▽ More

    Submitted 26 October, 2022; originally announced October 2022.

  16. arXiv:2210.10663  [pdf, other

    stat.ME

    A Review of Data-Driven Discovery for Dynamic Systems

    Authors: Joshua S. North, Christopher K. Wikle, Erin M. Schliep

    Abstract: Many real-world scientific processes are governed by complex nonlinear dynamic systems that can be represented by differential equations. Recently, there has been increased interest in learning, or discovering, the forms of the equations driving these complex nonlinear dynamic system using data-driven approaches. In this paper we review the current literature on data-driven discovery for dynamic s… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

    Comments: 38 pages, 1 figure, 1 table

  17. arXiv:2209.02750  [pdf, other

    stat.ME

    A Bayesian Approach for Spatio-Temporal Data-Driven Dynamic Equation Discovery

    Authors: Joshua S. North, Christopher K. Wikle, Erin M. Schliep

    Abstract: Differential equations based on physical principals are used to represent complex dynamic systems in all fields of science and engineering. Through repeated use in both academics and industry, these equations have been shown to represent real-world dynamics well. Since the true dynamics of these complex systems are generally unknown, learning the governing equations can improve our understanding o… ▽ More

    Submitted 6 September, 2022; originally announced September 2022.

    Comments: 41 pages, 10 Tables, 5 Figures

  18. arXiv:2206.02218  [pdf, other

    stat.ML cs.LG stat.ME

    Statistical Deep Learning for Spatial and Spatio-Temporal Data

    Authors: Christopher K. Wikle, Andrew Zammit-Mangion

    Abstract: Deep neural network models have become ubiquitous in recent years, and have been applied to nearly all areas of science, engineering, and industry. These models are particularly useful for data that have strong dependencies in space (e.g., images) and time (e.g., sequences). Indeed, deep models have also been extensively used by the statistical community to model spatial and spatio-temporal data t… ▽ More

    Submitted 5 June, 2022; originally announced June 2022.

    Comments: 27 pages, 1 figure

  19. arXiv:2109.09949  [pdf, other

    stat.AP

    A Bayesian Hidden Semi-Markov Model with Covariate-Dependent State Duration Parameters for High-Frequency Environmental Data

    Authors: Shirley Rojas-Salazar, Erin M. Schliep, Christopher K. Wikle, Emily H. Stanley, Stephen R. Carpenter, Noah R. Lottig

    Abstract: Environmental time series data observed at high frequencies can be studied with approaches such as hidden Markov and semi-Markov models (HMM and HSMM). HSMMs extend the HMM by explicitly modeling the time spent in each state. In a discrete-time HSMM, the duration in each state can be modeled with a zero-truncated Poisson distribution, where the duration parameter may be state-specific but constant… ▽ More

    Submitted 21 September, 2021; originally announced September 2021.

    Comments: arXiv admin note: substantial text overlap with arXiv:2010.10739

  20. arXiv:2108.12354  [pdf, other

    stat.ME

    Correcting spatial Gaussian process parameter and prediction variance estimation under informative sampling

    Authors: Erin M. Schliep, Christopher K. Wikle, Ranadeep Daw

    Abstract: Informative sampling designs can impact spatial prediction, or kriging, in two important ways. First, the sampling design can bias spatial covariance parameter estimation, which in turn can bias spatial kriging estimates. Second, even with unbiased estimates of the spatial covariance parameters, since the kriging variance is a function of the observation locations, these estimates will vary based… ▽ More

    Submitted 27 August, 2021; originally announced August 2021.

    Comments: 21 pages, 7 figures

  21. arXiv:2010.10739  [pdf, other

    stat.AP

    A Bayesian Hidden Semi-Markov Model with Covariate-Dependent State Duration Parameters for High-Frequency Data from Wearable Devices

    Authors: Shirley Rojas-Salazar, Erin M. Schliep, Christopher K. Wikle, Matthew Hawkey

    Abstract: Data collected by wearable devices in sports provide valuable information about an athlete's behavior such as their activity, performance, and ability. These time series data can be studied with approaches such as hidden Markov and semi-Markov models (HMM and HSMM) for varied purposes including activity recognition and event detection. HSMMs extend the HMM by explicitly modeling the time spent in… ▽ More

    Submitted 20 October, 2020; originally announced October 2020.

  22. arXiv:2010.03985  [pdf, other

    stat.ME stat.AP

    A higher-order singular value decomposition tensor emulator for spatio-temporal simulators

    Authors: Giri Gopalan, Christopher K. Wikle

    Abstract: We introduce methodology to construct an emulator for environmental and ecological spatio-temporal processes that uses the higher order singular value decomposition (HOSVD) as an extension of singular value decomposition (SVD) approaches to emulation. Some important advantages of the method are that it allows for the use of a combination of supervised learning methods (e.g., random forests and Gau… ▽ More

    Submitted 12 July, 2021; v1 submitted 7 October, 2020; originally announced October 2020.

    Journal ref: Journal of Agricultural, Biological, and Environmental Statistics 2021

  23. arXiv:2009.04003  [pdf, other

    cs.LG stat.ML

    Bayesian Inverse Reinforcement Learning for Collective Animal Movement

    Authors: Toryn L. J. Schafer, Christopher K. Wikle, Mevin B. Hooten

    Abstract: Agent-based methods allow for defining simple rules that generate complex group behaviors. The governing rules of such models are typically set a priori and parameters are tuned from observed behavior trajectories. Instead of making simplifying assumptions across all anticipated scenarios, inverse reinforcement learning provides inference on the short-term (local) rules governing long term behavio… ▽ More

    Submitted 11 June, 2022; v1 submitted 8 September, 2020; originally announced September 2020.

  24. arXiv:2003.06924  [pdf, other

    stat.AP

    On the spatial and temporal shift in the archetypal seasonal temperature cycle as driven by annual and semi-annual harmonics

    Authors: Joshua S. North, Erin M. Schliep, Christopher K. Wikle

    Abstract: Statistical methods are required to evaluate and quantify the uncertainty in environmental processes, such as land and sea surface temperature, in a changing climate. Typically, annual harmonics are used to characterize the variation in the seasonal temperature cycle. However, an often overlooked feature of the climate seasonal cycle is the semi-annual harmonic, which can account for a significant… ▽ More

    Submitted 15 March, 2020; originally announced March 2020.

    Comments: 25 pages, 11 color figures

  25. arXiv:1910.13524  [pdf, other

    stat.ML cs.LG stat.ME

    Deep Integro-Difference Equation Models for Spatio-Temporal Forecasting

    Authors: Andrew Zammit-Mangion, Christopher K. Wikle

    Abstract: Integro-difference equation (IDE) models describe the conditional dependence between the spatial process at a future time point and the process at the present time point through an integral operator. Nonlinearity or temporal dependence in the dynamics is often captured by allowing the operator parameters to vary temporally, or by re-fitting a model with a temporally-invariant linear operator in a… ▽ More

    Submitted 27 January, 2020; v1 submitted 29 October, 2019; originally announced October 2019.

    Comments: 22 pages, 10 figures

  26. A Bayesian Markov model with Pólya-Gamma sampling for estimating individual behavior transition probabilities from accelerometer classifications

    Authors: Toryn L. J. Schafer, Christopher K. Wikle, Jay A. VonBank, Bart M. Ballard, Mitch D. Weegman

    Abstract: The use of accelerometers in wildlife tracking provides a fine-scale data source for understanding animal behavior and decision-making. Current methods in movement ecology focus on behavior as a driver of movement mechanisms. Our Markov model is a flexible and efficient method for inference related to effects on behavior that considers dependence between current and past behaviors. We applied this… ▽ More

    Submitted 19 May, 2020; v1 submitted 7 August, 2019; originally announced August 2019.

  27. Spatio-Temporal Change of Support Modeling with R

    Authors: Andrew M. Raim, Scott H. Holan, Jonathan R. Bradley, Christopher K. Wikle

    Abstract: Spatio-temporal change of support methods are designed for statistical analysis on spatial and temporal domains which can differ from those of the observed data. Previous work introduced a parsimonious class of Bayesian hierarchical spatio-temporal models, which we refer to as STCOS, for the case of Gaussian outcomes. Application of STCOS methodology from this literature requires a level of profic… ▽ More

    Submitted 2 July, 2020; v1 submitted 26 April, 2019; originally announced April 2019.

  28. arXiv:1902.08321  [pdf, other

    stat.ML cs.LG

    Comparison of Deep Neural Networks and Deep Hierarchical Models for Spatio-Temporal Data

    Authors: Christopher K. Wikle

    Abstract: Spatio-temporal data are ubiquitous in the agricultural, ecological, and environmental sciences, and their study is important for understanding and predicting a wide variety of processes. One of the difficulties with modeling spatial processes that change in time is the complexity of the dependence structures that must describe how such a process varies, and the presence of high-dimensional comple… ▽ More

    Submitted 21 February, 2019; originally announced February 2019.

    Comments: 26 pages, including 6 figures and references

  29. arXiv:1812.03555  [pdf, ps, other

    stat.ME

    Spatio-Temporal Models for Big Multinomial Data using the Conditional Multivariate Logit-Beta Distribution

    Authors: Jonathan R. Bradley, Christopher K. Wikle, Scott H. Holan

    Abstract: We introduce a Bayesian approach for analyzing high-dimensional multinomial data that are referenced over space and time. In particular, the proportions associated with multinomial data are assumed to have a logit link to a latent spatio-temporal mixed effects model. This strategy allows for covariances that are nonstationarity in both space and time, asymmetric, and parsimonious. We also introduc… ▽ More

    Submitted 9 December, 2018; originally announced December 2018.

  30. arXiv:1811.08472  [pdf, other

    stat.ME stat.AP

    A Hierarchical Spatio-Temporal Statistical Model Motivated by Glaciology

    Authors: Giri Gopalan, Birgir Hrafnkelsson, Christopher K. Wikle, Håvard Rue, Guðfinna Aðalgeirsdóttir, Alexander H. Jarosch, Finnur Pálsson

    Abstract: In this paper, we extend and analyze a Bayesian hierarchical spatio-temporal model for physical systems. A novelty is to model the discrepancy between the output of a computer simulator for a physical process and the actual process values with a multivariate random walk. For computational efficiency, linear algebra for bandwidth limited matrices is utilized, and first-order emulator inference allo… ▽ More

    Submitted 4 June, 2019; v1 submitted 20 November, 2018; originally announced November 2018.

    Comments: Revision accepted for publication by the Journal of Agricultural, Biological, and Environmental Statistics

  31. arXiv:1806.10728  [pdf, other

    stat.ML cs.LG

    Deep Echo State Networks with Uncertainty Quantification for Spatio-Temporal Forecasting

    Authors: Patrick L. McDermott, Christopher K. Wikle

    Abstract: Long-lead forecasting for spatio-temporal systems can often entail complex nonlinear dynamics that are difficult to specify it a priori. Current statistical methodologies for modeling these processes are often highly parameterized and thus, challenging to implement from a computational perspective. One potential parsimonious solution to this problem is a method from the dynamical systems and engin… ▽ More

    Submitted 3 September, 2018; v1 submitted 27 June, 2018; originally announced June 2018.

  32. arXiv:1802.02626  [pdf, other

    stat.ME

    Interpolating Population Distributions using Public-use Data: An Application to Income Segregation using American Community Survey Data

    Authors: Matthew Simpson, Scott H. Holan, Christopher K. Wikle, Jonathan R. Bradley

    Abstract: Income segregation measures the extent to which households choose to live near other households with similar incomes. Sociologists theorize that income segregation can exacerbate the impacts of income inequality, and have developed indices to measure it at the metro area level, including the information theory index introduced in \citet{reardon2011income}, and the divergence index presented in \ci… ▽ More

    Submitted 23 November, 2021; v1 submitted 7 February, 2018; originally announced February 2018.

  33. arXiv:1711.00636  [pdf, other

    stat.ME stat.ML

    Bayesian Recurrent Neural Network Models for Forecasting and Quantifying Uncertainty in Spatial-Temporal Data

    Authors: Patrick L. McDermott, Christopher K. Wikle

    Abstract: Recurrent neural networks (RNNs) are nonlinear dynamical models commonly used in the machine learning and dynamical systems literature to represent complex dynamical or sequential relationships between variables. More recently, as deep learning models have become more common, RNNs have been used to forecast increasingly complicated systems. Dynamical spatio-temporal processes represent a class of… ▽ More

    Submitted 6 February, 2018; v1 submitted 2 November, 2017; originally announced November 2017.

  34. arXiv:1708.05094  [pdf, other

    stat.ML stat.AP

    An Ensemble Quadratic Echo State Network for Nonlinear Spatio-Temporal Forecasting

    Authors: Patrick L. McDermott, Christopher K. Wikle

    Abstract: Spatio-temporal data and processes are prevalent across a wide variety of scientific disciplines. These processes are often characterized by nonlinear time dynamics that include interactions across multiple scales of spatial and temporal variability. The data sets associated with many of these processes are increasing in size due to advances in automated data measurement, management, and numerical… ▽ More

    Submitted 16 August, 2017; originally announced August 2017.

  35. Ensemble Kalman methods for high-dimensional hierarchical dynamic space-time models

    Authors: Matthias Katzfuss, Jonathan R. Stroud, Christopher K. Wikle

    Abstract: We propose a new class of filtering and smoothing methods for inference in high-dimensional, nonlinear, non-Gaussian, spatio-temporal state-space models. The main idea is to combine the ensemble Kalman filter and smoother, developed in the geophysics literature, with state-space algorithms from the statistics literature. Our algorithms address a variety of estimation scenarios, including on-line a… ▽ More

    Submitted 8 August, 2018; v1 submitted 23 April, 2017; originally announced April 2017.

    Journal ref: Journal of the American Statistical Association, Theory & Methods (2019+)

  36. arXiv:1701.07506  [pdf, ps, other

    stat.ME

    Bayesian Hierarchical Models with Conjugate Full-Conditional Distributions for Dependent Data from the Natural Exponential Family

    Authors: Jonathan R. Bradley, Scott H. Holan, Christopher K. Wikle

    Abstract: We introduce a Bayesian approach for analyzing (possibly) high-dimensional dependent data that are distributed according to a member from the natural exponential family of distributions. This problem requires extensive methodological advancements, as jointly modeling high-dimensional dependent data leads to the so-called "big n problem." The computational complexity of the "big n problem" is furth… ▽ More

    Submitted 17 April, 2019; v1 submitted 25 January, 2017; originally announced January 2017.

  37. arXiv:1701.04485  [pdf, other

    stat.ME

    A Hierarchical Spatio-Temporal Analog Forecasting Model for Count Data

    Authors: Patrick L. McDermott, Christopher K. Wikle, Joshua Millspaugh

    Abstract: 1. Analog forecasting has been successful at producing robust forecasts for a variety of ecological and physical processes. Analog forecasting is a mechanism-free nonlinear method that forecasts a system forward in time by examining how past states deemed similar to the current state moved forward. Previous work on analog forecasting has typically been presented in an empirical or heuristic contex… ▽ More

    Submitted 16 January, 2017; originally announced January 2017.

  38. arXiv:1611.03835  [pdf, other

    stat.ME stat.CO

    A Bayesian adaptive ensemble Kalman filter for sequential state and parameter estimation

    Authors: Jonathan R. Stroud, Matthias Katzfuss, Christopher K. Wikle

    Abstract: This paper proposes new methodology for sequential state and parameter estimation within the ensemble Kalman filter. The method is fully Bayesian and propagates the joint posterior density of states and parameters over time. In order to implement the method we consider two representations of the marginal posterior distribution of the parameters: a grid-based approach and a Gaussian approximation.… ▽ More

    Submitted 11 November, 2016; originally announced November 2016.

    Comments: 19 pages

  39. arXiv:1512.07273  [pdf, ps, other

    stat.ME

    Computationally Efficient Distribution Theory for Bayesian Inference of High-Dimensional Dependent Count-Valued Data

    Authors: Jonathan R. Bradley, Scott H. Holan, Christopher K. Wikle

    Abstract: We introduce a Bayesian approach for multivariate spatio-temporal prediction for high-dimensional count-valued data. Our primary interest is when there are possibly millions of data points referenced over different variables, geographic regions, and times. This problem requires extensive methodological advancements, as jointly modeling correlated data of this size leads to the so-called "big n pro… ▽ More

    Submitted 22 December, 2015; originally announced December 2015.

  40. arXiv:1508.01451  [pdf, ps, other

    stat.ME

    Spatio-Temporal Change of Support with Application to American Community Survey Multi-Year Period Estimates

    Authors: Jonathan R. Bradley, Christopher K. Wikle, Scott H. Holan

    Abstract: We present hierarchical Bayesian methodology to perform spatio-temporal change of support (COS) for survey data with Gaussian sampling errors. This methodology is motivated by the American Community Survey (ACS), which is an ongoing survey administered by the U.S. Census Bureau that provides timely information on several key demographic variables. The ACS has published 1-year, 3-year, and 5-year p… ▽ More

    Submitted 24 August, 2015; v1 submitted 6 August, 2015; originally announced August 2015.

  41. Generating Partially Synthetic Geocoded Public Use Data with Decreased Disclosure Risk Using Differential Smoothing

    Authors: Harrison Quick, Scott H. Holan, Christopher K. Wikle

    Abstract: When collecting geocoded confidential data with the intent to disseminate, agencies often resort to altering the geographies prior to making data publicly available due to data privacy obligations. An alternative to releasing aggregated and/or perturbed data is to release multiply-imputed synthetic data, where sensitive values are replaced with draws from statistical models designed to capture imp… ▽ More

    Submitted 20 July, 2015; originally announced July 2015.

    Journal ref: Journal of the Royal Statistical Society Series A, 181 (2018), 649-661

  42. arXiv:1506.06169  [pdf, other

    stat.ME stat.AP

    A Model-Based Approach for Analog Spatio-Temporal Dynamic Forecasting

    Authors: Patrick L. McDermott, Christopher K. Wikle

    Abstract: Analog forecasting has been applied in a variety of fields for predicting future states of complex nonlinear systems that require flexible forecasting methods. Past analog methods have almost exclu- sively been used in an empirical framework without the structure of a model-based approach. We propose a Bayesian model framework for analog forecasting, building upon previous analog methods but accou… ▽ More

    Submitted 12 February, 2016; v1 submitted 19 June, 2015; originally announced June 2015.

  43. Bayesian binomial mixture models for estimating abundance in ecological monitoring studies

    Authors: Guohui Wu, Scott H. Holan, Charles H. Nilon, Christopher K. Wikle

    Abstract: Investigation of species abundance has become a vital component of many ecological monitoring studies. The primary objective of these studies is to understand how specific species are distributed across the study domain, as well as quantification of the sampling efficiency for detecting these species. To achieve these goals, preselected locations are sampled during scheduled visits, in which the n… ▽ More

    Submitted 11 May, 2015; originally announced May 2015.

    Comments: Published at http://dx.doi.org/10.1214/14-AOAS801 in the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS801

    Journal ref: Annals of Applied Statistics 2015, Vol. 9, No. 1, 1-26

  44. Multivariate spatio-temporal models for high-dimensional areal data with application to Longitudinal Employer-Household Dynamics

    Authors: Jonathan R. Bradley, Scott H. Holan, Christopher K. Wikle

    Abstract: Many data sources report related variables of interest that are also referenced over geographic regions and time; however, there are relatively few general statistical methods that one can readily use that incorporate these multivariate spatio-temporal dependencies. Additionally, many multivariate spatio-temporal areal data sets are extremely high dimensional, which leads to practical issues when… ▽ More

    Submitted 29 January, 2016; v1 submitted 3 March, 2015; originally announced March 2015.

    Comments: Published at http://dx.doi.org/10.1214/15-AOAS862 in the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org). arXiv admin note: substantial text overlap with arXiv:1407.7479

    Report number: IMS-AOAS-AOAS862

    Journal ref: Annals of Applied Statistics 2015, Vol. 9, No. 4, 1761-1791

  45. arXiv:1502.01974  [pdf, ps, other

    stat.ME

    Regionalization of Multiscale Spatial Processes using a Criterion for Spatial Aggregation Error

    Authors: Jonathan R. Bradley, Christopher K. Wikle, Scott H. Holan

    Abstract: The modifiable areal unit problem and the ecological fallacy are known problems that occur when modeling multiscale spatial processes. We investigate how these forms of spatial aggregation error can guide a regionalization over a spatial domain of interest. By "regionalization" we mean a specification of geographies that define the spatial support for areal data. This topic has been studied vigoro… ▽ More

    Submitted 10 December, 2015; v1 submitted 6 February, 2015; originally announced February 2015.

  46. arXiv:1408.2757  [pdf, ps, other

    stat.ME

    Bayesian Lattice Filters for Time-Varying Autoregression and Time-Frequency Analysis

    Authors: Wen-Hsi Yang, Scott H. Holan, Christopher K. Wikle

    Abstract: Modeling nonstationary processes is of paramount importance to many scientific disciplines including environmental science, ecology, and finance, among others. Consequently, flexible methodology that provides accurate estimation across a wide range of processes is a subject of ongoing interest. We propose a novel approach to model-based time-frequency estimation using time-varying autoregressive m… ▽ More

    Submitted 12 August, 2014; originally announced August 2014.

    Comments: 49 pages, 16 figures

  47. Bayesian Marked Point Process Modeling for Generating Fully Synthetic Public Use Data with Point-Referenced Geography

    Authors: Harrison Quick, Scott H. Holan, Christopher K. Wikle, Jerome P. Reiter

    Abstract: Many data stewards collect confidential data that include fine geography. When sharing these data with others, data stewards strive to disseminate data that are informative for a wide range of spatial and non-spatial analyses while simultaneously protecting the confidentiality of data subjects' identities and attributes. Typically, data stewards meet this challenge by coarsening the resolution of… ▽ More

    Submitted 29 July, 2014; originally announced July 2014.

    Journal ref: Spatial Statistics, 14 (2015), 439-451

  48. arXiv:1407.7479  [pdf, ps, other

    stat.ME

    Mixed Effects Modeling for Areal Data that Exhibit Multivariate-Spatio-Temporal Dependencies

    Authors: Jonathan R. Bradley, Scott H. Holan, Christopher K. Wikle

    Abstract: There are many data sources available that report related variables of interest that are also referenced over geographic regions and time; however, there are relatively few general statistical methods that one can readily use that incorporate these multivariate-spatio-temporal dependencies. As such, we introduce the multivariate-spatio-temporal mixed effects model (MSTM) to analyze areal data with… ▽ More

    Submitted 4 September, 2014; v1 submitted 28 July, 2014; originally announced July 2014.

  49. arXiv:1405.7227  [pdf, ps, other

    stat.AP

    Bayesian Spatial Change of Support for Count-Valued Survey Data

    Authors: Jonathan R. Bradley, Christopher K. Wikle, Scott H. Holan

    Abstract: We introduce Bayesian spatial change of support methodology for count-valued survey data with known survey variances. Our proposed methodology is motivated by the American Community Survey (ACS), an ongoing survey administered by the U.S. Census Bureau that provides timely information on several key demographic variables. Specifically, the ACS produces 1-year, 3-year, and 5-year "period-estimates,… ▽ More

    Submitted 28 October, 2014; v1 submitted 28 May, 2014; originally announced May 2014.

  50. arXiv:1405.3880  [pdf, ps, other

    stat.ME

    Bayesian Semiparametric Hierarchical Empirical Likelihood Spatial Models

    Authors: Aaron T. Porter, Scott H. Holan, Christopher K. Wikle

    Abstract: We introduce a general hierarchical Bayesian framework that incorporates a flexible nonparametric data model specification through the use of empirical likelihood methodology, which we term semiparametric hierarchical empirical likelihood (SHEL) models. Although general dependence structures can be readily accommodated, we focus on spatial modeling, a relatively underdeveloped area in the empirica… ▽ More

    Submitted 15 May, 2014; originally announced May 2014.

    Comments: 29 pages, 3 figues