Search | arXiv e-print repository

General multi-fidelity surrogate models: Framework and active learning strategies for efficient rare event simulation

Authors: Promit Chakroborty, Somayajulu L. N. Dhulipala, Yifeng Che, Wen Jiang, Benjamin W. Spencer, Jason D. Hales, Michael D. Shields

Abstract: Estimating the probability of failure for complex real-world systems using high-fidelity computational models is often prohibitively expensive, especially when the probability is small. Exploiting low-fidelity models can make this process more feasible, but merging information from multiple low-fidelity and high-fidelity models poses several challenges. This paper presents a robust multi-fidelity… ▽ More Estimating the probability of failure for complex real-world systems using high-fidelity computational models is often prohibitively expensive, especially when the probability is small. Exploiting low-fidelity models can make this process more feasible, but merging information from multiple low-fidelity and high-fidelity models poses several challenges. This paper presents a robust multi-fidelity surrogate modeling strategy in which the multi-fidelity surrogate is assembled using an active learning strategy using an on-the-fly model adequacy assessment set within a subset simulation framework for efficient reliability analysis. The multi-fidelity surrogate is assembled by first applying a Gaussian process correction to each low-fidelity model and assigning a model probability based on the model's local predictive accuracy and cost. Three strategies are proposed to fuse these individual surrogates into an overall surrogate model based on model averaging and deterministic/stochastic model selection. The strategies also dictate which model evaluations are necessary. No assumptions are made about the relationships between low-fidelity models, while the high-fidelity model is assumed to be the most accurate and most computationally expensive model. Through two analytical and two numerical case studies, including a case study evaluating the failure probability of Tristructural isotropic-coated (TRISO) nuclear fuels, the algorithm is shown to be highly accurate while drastically reducing the number of high-fidelity model calls (and hence computational cost). △ Less

Submitted 6 December, 2022; originally announced December 2022.

arXiv:2211.11115 [pdf, other]

Multifidelity Active Learning for Failure Estimation of TRISO Nuclear Fuel

Authors: Somayajulu L. N. Dhulipala, Promit Chakroborty, Michael D. Shields, Wen Jiang, Benjamin W. Spencer, Jason D. Hales

Abstract: The Tristructural isotropic (TRISO)-coated particle fuel is a robust nuclear fuel proposed to be used for multiple modern nuclear technologies. Therefore, characterizing its safety is vital for the reliable operation of nuclear technologies. However, the TRISO fuel failure probabilities are small and the computational model is time consuming to evaluate them using traditional Monte Carlo-type appr… ▽ More The Tristructural isotropic (TRISO)-coated particle fuel is a robust nuclear fuel proposed to be used for multiple modern nuclear technologies. Therefore, characterizing its safety is vital for the reliable operation of nuclear technologies. However, the TRISO fuel failure probabilities are small and the computational model is time consuming to evaluate them using traditional Monte Carlo-type approaches. In the paper, we present a multifidelity active learning approach to efficiently estimate small failure probabilities given an expensive computational model. Active learning suggests the next best training set for optimal subsequent predictive performance and multifidelity modeling uses cheaper low-fidelity models to approximate the high-fidelity model output. After presenting the multifidelity active learning approach, we apply it to efficiently predict TRISO failure probability and make comparisons to the reference results. △ Less

Submitted 20 November, 2022; originally announced November 2022.

arXiv:2201.02172 [pdf, other]

Reliability Estimation of an Advanced Nuclear Fuel using Coupled Active Learning, Multifidelity Modeling, and Subset Simulation

Authors: Somayajulu L. N. Dhulipala, Michael D. Shields, Promit Chakroborty, Wen Jiang, Benjamin W. Spencer, Jason D. Hales, Vincent M. Laboure, Zachary M. Prince, Chandrakanth Bolisetti, Yifeng Che

Abstract: Tristructural isotropic (TRISO)-coated particle fuel is a robust nuclear fuel and determining its reliability is critical for the success of advanced nuclear technologies. However, TRISO failure probabilities are small and the associated computational models are expensive. We used coupled active learning, multifidelity modeling, and subset simulation to estimate the failure probabilities of TRISO… ▽ More Tristructural isotropic (TRISO)-coated particle fuel is a robust nuclear fuel and determining its reliability is critical for the success of advanced nuclear technologies. However, TRISO failure probabilities are small and the associated computational models are expensive. We used coupled active learning, multifidelity modeling, and subset simulation to estimate the failure probabilities of TRISO fuels using several 1D and 2D models. With multifidelity modeling, we replaced expensive high-fidelity (HF) model evaluations with information fusion from two low-fidelity (LF) models. For the 1D TRISO models, we considered three multifidelity modeling strategies: only Kriging, Kriging LF prediction plus Kriging correction, and deep neural network (DNN) LF prediction plus Kriging correction. While the results across these multifidelity modeling strategies compared satisfactorily, strategies employing information fusion from two LF models consistently called the HF model least often. Next, for the 2D TRISO model, we considered two multifidelity modeling strategies: DNN LF prediction plus Kriging correction (data-driven) and 1D TRISO LF prediction plus Kriging correction (physics-based). The physics-based strategy, as expected, consistently required the fewest calls to the HF model. However, the data-driven strategy had a lower overall simulation time since the DNN predictions are instantaneous, and the 1D TRISO model requires a non-negligible simulation time. △ Less

Submitted 6 January, 2022; originally announced January 2022.

arXiv:2106.13790 [pdf, other]

doi 10.1016/j.jcp.2022.111506

Active Learning with Multifidelity Modeling for Efficient Rare Event Simulation

Authors: S. L. N. Dhulipala, M. D. Shields, B. W. Spencer, C. Bolisetti, A. E. Slaughter, V. M. Laboure, P. Chakroborty

Abstract: While multifidelity modeling provides a cost-effective way to conduct uncertainty quantification with computationally expensive models, much greater efficiency can be achieved by adaptively deciding the number of required high-fidelity (HF) simulations, depending on the type and complexity of the problem and the desired accuracy in the results. We propose a framework for active learning with multi… ▽ More While multifidelity modeling provides a cost-effective way to conduct uncertainty quantification with computationally expensive models, much greater efficiency can be achieved by adaptively deciding the number of required high-fidelity (HF) simulations, depending on the type and complexity of the problem and the desired accuracy in the results. We propose a framework for active learning with multifidelity modeling emphasizing the efficient estimation of rare events. Our framework works by fusing a low-fidelity (LF) prediction with an HF-inferred correction, filtering the corrected LF prediction to decide whether to call the high-fidelity model, and for enhanced subsequent accuracy, adapting the correction for the LF prediction after every HF model call. The framework does not make any assumptions as to the LF model type or its correlations with the HF model. In addition, for improved robustness when estimating smaller failure probabilities, we propose using dynamic active learning functions that decide when to call the HF model. We demonstrate our framework using several academic case studies and two finite element (FE) model case studies: estimating Navier-Stokes velocities using the Stokes approximation and estimating stresses in a transversely isotropic model subjected to displacements via a coarsely meshed isotropic model. Across these case studies, not only did the proposed framework estimate the failure probabilities accurately, but compared with either Monte Carlo or a standard variance reduction method, it also required only a small fraction of the calls to the HF model. △ Less

Submitted 25 June, 2021; originally announced June 2021.

arXiv:1907.10762 [pdf]

Fitting motion models to contextual player behavior

Authors: Bartholomew Spencer, Karl Jackson, Sam Robertson

Abstract: The objective of this study was to incorporate contextual information into the modelling of player movements. This was achieved by combining the distributions of forthcoming passing contests that players committed to and those they did not. The resultant array measures the probability a player would commit to forthcoming contests in their vicinity. Commitment-based motion models were fit on 46220… ▽ More The objective of this study was to incorporate contextual information into the modelling of player movements. This was achieved by combining the distributions of forthcoming passing contests that players committed to and those they did not. The resultant array measures the probability a player would commit to forthcoming contests in their vicinity. Commitment-based motion models were fit on 46220 samples of player behavior in the Australian Football League. It was found that the shape of commitment-based models differed greatly to displacement-based models for Australian footballers. Player commitment arrays were used to measure the spatial occupancy and dominance of the attacking team. The spatial characteristics of pass receivers were extracted for 2934 passes. Positional trends in passing were identified. Furthermore, passes were clustered into three components using Gaussian mixture models. Passes in the AFL are most commonly to one-on-one contests or unmarked players. Furthermore, passes were rarely greater than 25 m. △ Less

Submitted 24 July, 2019; originally announced July 2019.

Comments: 8 pages, 4 figures, IACSS 2019

arXiv:1811.08793 [pdf]

doi 10.1016/j.ymssp.2018.11.052

LQD-RKHS-based distribution-to-distribution regression methodology for restoring the probability distributions of missing SHM data

Authors: Zhicheng Chen, Yuequan Bao, Hui Li, Billie F. Spencer Jr

Abstract: Data loss is a critical problem in structural health monitoring (SHM). Probability distributions play a highly important role in many applications. Improving the quality of distribution estimations made using incomplete samples is highly important. Missing samples can be compensated for by applying conventional missing data restoration methods; however, ensuring that restored samples roughly follo… ▽ More Data loss is a critical problem in structural health monitoring (SHM). Probability distributions play a highly important role in many applications. Improving the quality of distribution estimations made using incomplete samples is highly important. Missing samples can be compensated for by applying conventional missing data restoration methods; however, ensuring that restored samples roughly follow underlying distributions of true missing data remains a challenge. Another strategy involves directly restoring the probability density function (PDF) for a sensor when samples are missing by leveraging distribution information from another sensor with complete data using distribution regression techniques; existing methods include the conventional distribution-to-distribution regression (DDR) and distribution-to-warping function regression (DWR) methods. Due to constraints on PDFs and warping functions, the regression functions of both methods are estimated from the Nadaraya-Watson kernel estimator with relatively low degrees of precision. This article proposes a new indirect distribution-to-distribution regression method in the context of functional data analysis for restoring distributions of missing SHM data. PDFs are transformed to ordinary functions residing in a Hilbert space via the newly proposed log-quantile-density (LQD) transformation; the regression for distributions is realized in the transformed space via a functional regression model constructed based on the theory of Reproducing Kernel Hilbert Space (RKHS), corresponding result is subsequently mapped back to the density space through the inverse LQD transformation. Test results using field monitoring data indicate that the new method significantly outperforms conventional methods in general cases; however, in extrapolation cases, the new method is inferior to the distribution-to-warping function regression method. △ Less

Submitted 8 December, 2018; v1 submitted 21 November, 2018; originally announced November 2018.

Comments: This is a manuscript. Readers are suggested to read the formal version published in the journal "Mechanical Systems and Signal Processing", https://doi.org/10.1016/j.ymssp.2018.11.052

Journal ref: Mechanical Systems and Signal Processing 2019;121:655-674

arXiv:1806.01757 [pdf, other]

Estimating Shortest Path Length Distributions via Random Walk Sampling

Authors: Minhui Zheng, Bruce D. Spencer

Abstract: In a network, the shortest paths between nodes are of great importance as they allow the fastest and strongest interaction between nodes. However measuring the shortest paths between all nodes in a large network is computationally expensive. In this paper we propose a method to estimate the shortest path length (SPL) distribution of a network by random walk sampling. To deal with the unequal inclu… ▽ More In a network, the shortest paths between nodes are of great importance as they allow the fastest and strongest interaction between nodes. However measuring the shortest paths between all nodes in a large network is computationally expensive. In this paper we propose a method to estimate the shortest path length (SPL) distribution of a network by random walk sampling. To deal with the unequal inclusion probabilities of dyads (pairs of nodes) in the sample, we generalize the usage of Hansen-Hurwitz estimator and Horvitz-Thompson estimator (and their ratio forms) and apply them to the sampled dyads. Based on theory of Markov chains we prove that the selection probability of a dyad is proportional to the product of the degrees of the two nodes. To approximate the actual SPL for a dyad, we use the observed SPL in the induced subgraph for networks with large degree variability, i.e., the standard deviation is at least two times of the mean, and for networks with small degree variability, estimate the SPL using landmarks for networks with small degree variability. By simulation studies and applications to real networks, we find that 1) for large networks, high estimation accuracy can be achieved by using a single random or multiple random walks with total number of steps equal to at least 20% of the nodes in the network; 2) the estimation performance increases as the network size increases but tends to stabilize when the network is large enough; 3) a single random walk performs as well as multiple random walks; 4) the Horvitz-Thompson ratio estimator performs best among the four estimators. △ Less

Submitted 9 June, 2018; v1 submitted 5 June, 2018; originally announced June 2018.

arXiv:1305.4977 [pdf, ps, other]

doi 10.1214/14-AOAS800

Estimating network degree distributions under sampling: An inverse problem, with applications to monitoring social media networks

Authors: Yaonan Zhang, Eric D. Kolaczyk, Bruce D. Spencer

Abstract: Networks are a popular tool for representing elements in a system and their interconnectedness. Many observed networks can be viewed as only samples of some true underlying network. Such is frequently the case, for example, in the monitoring and study of massive, online social networks. We study the problem of how to estimate the degree distribution - an object of fundamental interest - of a true… ▽ More Networks are a popular tool for representing elements in a system and their interconnectedness. Many observed networks can be viewed as only samples of some true underlying network. Such is frequently the case, for example, in the monitoring and study of massive, online social networks. We study the problem of how to estimate the degree distribution - an object of fundamental interest - of a true underlying network from its sampled network. In particular, we show that this problem can be formulated as an inverse problem. Playing a key role in this formulation is a matrix relating the expectation of our sampled degree distribution to the true underlying degree distribution. Under many network sampling designs, this matrix can be defined entirely in terms of the design and is found to be ill-conditioned. As a result, our inverse problem frequently is ill-posed. Accordingly, we offer a constrained, penalized weighted least-squares approach to solving this problem. A Monte Carlo variant of Stein's unbiased risk estimation (SURE) is used to select the penalization parameter. We explore the behavior of our resulting estimator of network degree distribution in simulation, using a variety of combinations of network models and sampling regimes. In addition, we demonstrate the ability of our method to accurately reconstruct the degree distributions of various sub-communities within online social networks corresponding to Friendster, Orkut and LiveJournal. Overall, our results show that the true degree distributions from both homogeneous and inhomogeneous networks can be recovered with substantially greater accuracy than reflected in the empirical degree distribution resulting from the original sampling. △ Less

Submitted 28 May, 2015; v1 submitted 21 May, 2013; originally announced May 2013.

Comments: Published at http://dx.doi.org/10.1214/14-AOAS800 in the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

Report number: IMS-AOAS-AOAS800

Journal ref: Annals of Applied Statistics 2015, Vol. 9, No. 1, 166-199

Showing 1–8 of 8 results for author: Spencer, B