-
Monitoring Deforestation Using Multivariate Bayesian Online Changepoint Detection with Outliers
Authors:
Laura J. Wendelberger,
Josh M. Gray,
Brian J. Reich,
Alyson G. Wilson
Abstract:
Near real time change detection is important for a variety of Earth monitoring applications and remains a high priority for remote sensing science. Data sparsity, subtle changes, seasonal trends, and the presence of outliers make detecting actual landscape changes challenging. Adams and MacKay (2007) introduced Bayesian Online Changepoint Detection (BOCPD), a computationally efficient, exact Bayes…
▽ More
Near real time change detection is important for a variety of Earth monitoring applications and remains a high priority for remote sensing science. Data sparsity, subtle changes, seasonal trends, and the presence of outliers make detecting actual landscape changes challenging. Adams and MacKay (2007) introduced Bayesian Online Changepoint Detection (BOCPD), a computationally efficient, exact Bayesian method for change detection. Incorporation of prior information allows for relaxed dependence on dense data and an extensive stable period, making this method applicable to relatively short time series and multiple changepoint detection. In this paper we conduct BOCPD with a multivariate linear regression framework that supports seasonal trends. We introduce a mechanism to make BOCPD robust against occasional outliers without compromising the computational efficiency of an exact posterior change distribution nor the detection latency. We show via simulations that the method effectively detects change in the presence of outliers. The method is then applied to monitor deforestation in Myanmar where we show superior performance compared to current online changepoint detection methods.
△ Less
Submitted 27 December, 2021; v1 submitted 23 December, 2021;
originally announced December 2021.
-
Adaptively Sampling via Regional Variance-Based Sensitivities
Authors:
Brian W. Bush,
Joanne Wendelberger,
Rebecca Hanes
Abstract:
Inspired by the well-established variance-based methods for global sensitivity analysis, we develop a local total sensitivity index that decomposes the global total sensitivity conditions by independent variables' values. We employ this local sensitivity index in a new method of experimental design that sequentially and adaptively samples the domain of a multivariate function according to local co…
▽ More
Inspired by the well-established variance-based methods for global sensitivity analysis, we develop a local total sensitivity index that decomposes the global total sensitivity conditions by independent variables' values. We employ this local sensitivity index in a new method of experimental design that sequentially and adaptively samples the domain of a multivariate function according to local contributions to the global variance. The method is demonstrated on a nonlinear illustrative example that has a three-dimensional domain and a three-dimensional codomain, but also on a complex, high-dimensional simulation for assessing the industrial viability of the production of bioproducts from biomass.
△ Less
Submitted 20 July, 2021;
originally announced July 2021.
-
Selecting Diverse Models for Scientific Insight
Authors:
Laura J. Wendelberger,
Brian J. Reich,
Alyson G. Wilson
Abstract:
Model selection often aims to choose a single model, assuming that the form of the model is correct. However, there may be multiple possible underlying explanatory patterns in a set of predictors that could explain a response. Model selection without regard for model uncertainty can fail to bring these patterns to light. We explore multi-model penalized regression (MMPR) to acknowledge model uncer…
▽ More
Model selection often aims to choose a single model, assuming that the form of the model is correct. However, there may be multiple possible underlying explanatory patterns in a set of predictors that could explain a response. Model selection without regard for model uncertainty can fail to bring these patterns to light. We explore multi-model penalized regression (MMPR) to acknowledge model uncertainty in the context of penalized regression. We examine how different penalty settings can promote either shrinkage or sparsity of coefficients in separate models. The method is tuned to explicitly limit model similarity. A choice of penalty form that enforces variable selection is applied to predict stacking fault energy (SFE) from steel alloy composition. The aim is to identify multiple models with different subsets of covariates that explain a single type of response.
△ Less
Submitted 15 December, 2021; v1 submitted 16 June, 2020;
originally announced June 2020.
-
Partitioning a Large Simulation as It Runs
Authors:
Kary Myers,
Earl Lawrence,
Michael Fugate,
Claire McKay Bowen,
Lawrence Ticknor,
Jon Woodring,
Joanne Wendelberger,
Jim Ahrens
Abstract:
As computer simulations continue to grow in size and complexity, they present a particularly challenging class of big data problems. Many application areas are moving toward exascale computing systems, systems that perform $10^{18}$ FLOPS (FLoating-point Operations Per Second) --- a billion billion calculations per second. Simulations at this scale can generate output that exceeds both the storage…
▽ More
As computer simulations continue to grow in size and complexity, they present a particularly challenging class of big data problems. Many application areas are moving toward exascale computing systems, systems that perform $10^{18}$ FLOPS (FLoating-point Operations Per Second) --- a billion billion calculations per second. Simulations at this scale can generate output that exceeds both the storage capacity and the bandwidth available for transfer to storage, making post-processing and analysis challenging. One approach is to embed some analyses in the simulation while the simulation is running --- a strategy often called in situ analysis --- to reduce the need for transfer to storage. Another strategy is to save only a reduced set of time steps rather than the full simulation. Typically the selected time steps are evenly spaced, where the spacing can be defined by the budget for storage and transfer. This paper combines both of these ideas to introduce an online in situ method for identifying a reduced set of time steps of the simulation to save. Our approach significantly reduces the data transfer and storage requirements, and it provides improved fidelity to the simulation to facilitate post-processing and reconstruction. We illustrate the method using a computer simulation that supported NASA's 2009 Lunar Crater Observation and Sensing Satellite mission.
△ Less
Submitted 23 September, 2015; v1 submitted 2 September, 2014;
originally announced September 2014.