-
Global Vulnerability Assessment of Mobile Telecommunications Infrastructure to Climate Hazards using Crowdsourced Open Data
Authors:
Edward J. Oughton,
Tom Russell,
Jeongjin Oh,
Sara Ballan,
Jim W. Hall
Abstract:
The ongoing change in Earth`s climate is causing an increase in the frequency and severity of climate-related hazards, for example, from coastal flooding, riverine flooding, and tropical cyclones. There is currently an urgent need to quantify the potential impacts of these events on infrastructure and users, especially for hitherto neglected infrastructure sectors, such as telecommunications, part…
▽ More
The ongoing change in Earth`s climate is causing an increase in the frequency and severity of climate-related hazards, for example, from coastal flooding, riverine flooding, and tropical cyclones. There is currently an urgent need to quantify the potential impacts of these events on infrastructure and users, especially for hitherto neglected infrastructure sectors, such as telecommunications, particularly given our increasing dependence on digital technologies. In this analysis a global assessment is undertaken, quantifying the number of mobile cells vulnerable to climate hazards using open crowdsourced data equating to 7.6 million 2G, 3G, 4G and 5G assets. For a 0.01% annual probability event under a high emissions scenario (RCP8.5), the number of affected cells is estimated at 2.26 million for tropical cyclones, equating to USD 1.01 billion in direct damage (an increase against the historical baseline of 14% and 44%, respectively). Equally, for coastal flooding the number of potentially affected cells for an event with a 0.01% annual probability under RCP8.5 is 109.9 thousand, equating to direct damage costs of USD 2.69 billion (an increase against the baseline of 70% and 78%, respectively). The findings demonstrate the need for risk analysts to include mobile communications (and telecommunications more broadly) in future critical national infrastructure assessments. Indeed, this paper contributes a proven assessment methodology to the literature for use in future research for assessing this critical infrastructure sector.
△ Less
Submitted 7 November, 2023;
originally announced November 2023.
-
Predicting Malaria Incidence Using Artifical Neural Networks and Disaggregation Regression
Authors:
Jack A. Hall,
Tim C. D. Lucas
Abstract:
Disaggregation modelling is a method of predicting disease risk at high resolution using aggregated response data. High resolution disease mapping is an important public health tool to aid the optimisation of resources, and is commonly used in assisting responses to diseases such as malaria. Current disaggregation regression methods are slow, inflexible, and do not easily allow non-linear terms.…
▽ More
Disaggregation modelling is a method of predicting disease risk at high resolution using aggregated response data. High resolution disease mapping is an important public health tool to aid the optimisation of resources, and is commonly used in assisting responses to diseases such as malaria. Current disaggregation regression methods are slow, inflexible, and do not easily allow non-linear terms.
Neural networks may offer a solution to the limitations of current disaggregation methods. This project aimed to design a neural network which mimics the behaviour of disaggregation, then benchmark it against current methods for accuracy, flexibility and speed.
Cross-validation and nested cross-validation tested neural networks against traditional disaggregation for accuracy and execution speed was measured.
Neural networks did not improve on the accuracy of current disaggregation methods, although did see an improvement in execution time. The neural network models are more flexible and offer potential for further improvements on all metrics. The R package 'Kedis' (Keras-Disaggregation) is introduced as a user-friendly method of implementing neural network disaggregation models.
△ Less
Submitted 17 April, 2023;
originally announced April 2023.
-
Mutual Information for Explainable Deep Learning of Multiscale Systems
Authors:
Søren Taverniers,
Eric J. Hall,
Markos A. Katsoulakis,
Daniel M. Tartakovsky
Abstract:
Timely completion of design cycles for complex systems ranging from consumer electronics to hypersonic vehicles relies on rapid simulation-based prototyping. The latter typically involves high-dimensional spaces of possibly correlated control variables (CVs) and quantities of interest (QoIs) with non-Gaussian and possibly multimodal distributions. We develop a model-agnostic, moment-independent gl…
▽ More
Timely completion of design cycles for complex systems ranging from consumer electronics to hypersonic vehicles relies on rapid simulation-based prototyping. The latter typically involves high-dimensional spaces of possibly correlated control variables (CVs) and quantities of interest (QoIs) with non-Gaussian and possibly multimodal distributions. We develop a model-agnostic, moment-independent global sensitivity analysis (GSA) that relies on differential mutual information to rank the effects of CVs on QoIs. The data requirements of this information-theoretic approach to GSA are met by replacing computationally intensive components of the physics-based model with a deep neural network surrogate. Subsequently, the GSA is used to explain the network predictions, and the surrogate is deployed to close design loops. Viewed as an uncertainty quantification method for interrogating the surrogate, this framework is compatible with a wide variety of black-box models. We demonstrate that the surrogate-driven mutual information GSA provides useful and distinguishable rankings on two applications of interest in energy storage. Consequently, our information-theoretic GSA provides an "outer loop" for accelerated product design by identifying the most and least sensitive input directions and performing subsequent optimization over appropriately reduced parameter subspaces.
△ Less
Submitted 19 May, 2021; v1 submitted 7 September, 2020;
originally announced September 2020.
-
GINNs: Graph-Informed Neural Networks for Multiscale Physics
Authors:
Eric J. Hall,
Søren Taverniers,
Markos A. Katsoulakis,
Daniel M. Tartakovsky
Abstract:
We introduce the concept of a Graph-Informed Neural Network (GINN), a hybrid approach combining deep learning with probabilistic graphical models (PGMs) that acts as a surrogate for physics-based representations of multiscale and multiphysics systems. GINNs address the twin challenges of removing intrinsic computational bottlenecks in physics-based models and generating large data sets for estimat…
▽ More
We introduce the concept of a Graph-Informed Neural Network (GINN), a hybrid approach combining deep learning with probabilistic graphical models (PGMs) that acts as a surrogate for physics-based representations of multiscale and multiphysics systems. GINNs address the twin challenges of removing intrinsic computational bottlenecks in physics-based models and generating large data sets for estimating probability distributions of quantities of interest (QoIs) with a high degree of confidence. Both the selection of the complex physics learned by the NN and its supervised learning/prediction are informed by the PGM, which includes the formulation of structured priors for tunable control variables (CVs) to account for their mutual correlations and ensure physically sound CV and QoI distributions. GINNs accelerate the prediction of QoIs essential for simulation-based decision-making where generating sufficient sample data using physics-based models alone is often prohibitively expensive. Using a real-world application grounded in supercapacitor-based energy storage, we describe the construction of GINNs from a Bayesian network-embedded homogenized model for supercapacitor dynamics, and demonstrate their ability to produce kernel density estimates of relevant non-Gaussian, skewed QoIs with tight confidence intervals.
△ Less
Submitted 26 June, 2020;
originally announced June 2020.
-
Towards a Human-like Open-Domain Chatbot
Authors:
Daniel Adiwardana,
Minh-Thang Luong,
David R. So,
Jamie Hall,
Noah Fiedel,
Romal Thoppilan,
Zi Yang,
Apoorv Kulshreshtha,
Gaurav Nemade,
Yifeng Lu,
Quoc V. Le
Abstract:
We present Meena, a multi-turn open-domain chatbot trained end-to-end on data mined and filtered from public domain social media conversations. This 2.6B parameter neural network is simply trained to minimize perplexity of the next token. We also propose a human evaluation metric called Sensibleness and Specificity Average (SSA), which captures key elements of a human-like multi-turn conversation.…
▽ More
We present Meena, a multi-turn open-domain chatbot trained end-to-end on data mined and filtered from public domain social media conversations. This 2.6B parameter neural network is simply trained to minimize perplexity of the next token. We also propose a human evaluation metric called Sensibleness and Specificity Average (SSA), which captures key elements of a human-like multi-turn conversation. Our experiments show strong correlation between perplexity and SSA. The fact that the best perplexity end-to-end trained Meena scores high on SSA (72% on multi-turn evaluation) suggests that a human-level SSA of 86% is potentially within reach if we can better optimize perplexity. Additionally, the full version of Meena (with a filtering mechanism and tuned decoding) scores 79% SSA, 23% higher in absolute SSA than the existing chatbots we evaluated.
△ Less
Submitted 27 February, 2020; v1 submitted 27 January, 2020;
originally announced January 2020.
-
Causality and Bayesian network PDEs for multiscale representations of porous media
Authors:
Kimoon Um,
Eric Joseph Hall,
Markos A. Katsoulakis,
Daniel M. Tartakovsky
Abstract:
Microscopic (pore-scale) properties of porous media affect and often determine their macroscopic (continuum- or Darcy-scale) counterparts. Understanding the relationship between processes on these two scales is essential to both the derivation of macroscopic models of, e.g., transport phenomena in natural porous media, and the design of novel materials, e.g., for energy storage. Most microscopic p…
▽ More
Microscopic (pore-scale) properties of porous media affect and often determine their macroscopic (continuum- or Darcy-scale) counterparts. Understanding the relationship between processes on these two scales is essential to both the derivation of macroscopic models of, e.g., transport phenomena in natural porous media, and the design of novel materials, e.g., for energy storage. Most microscopic properties exhibit complex statistical correlations and geometric constraints, which presents challenges for the estimation of macroscopic quantities of interest (QoIs), e.g., in the context of global sensitivity analysis (GSA) of macroscopic QoIs with respect to microscopic material properties. We present a systematic way of building correlations into stochastic multiscale models through Bayesian networks. This allows us to construct the joint probability density function (PDF) of model parameters through causal relationships that emulate engineering processes, e.g., the design of hierarchical nanoporous materials. Such PDFs also serve as input for the forward propagation of parametric uncertainty; our findings indicate that the inclusion of causal relationships impacts predictions of macroscopic QoIs. To assess the impact of correlations and causal relationships between microscopic parameters on macroscopic material properties, we use a moment-independent GSA based on the differential mutual information. Our GSA accounts for the correlated inputs and complex non-Gaussian QoIs. The global sensitivity indices are used to rank the effect of uncertainty in microscopic parameters on macroscopic QoIs, to quantify the impact of causality on the multiscale model's predictions, and to provide physical interpretations of these results for hierarchical nanoporous materials.
△ Less
Submitted 6 January, 2019;
originally announced January 2019.
-
Estimation of temperature-dependent growth profiles for the assessment of time of hatching in forensic entomology
Authors:
D. Pigoli,
J. A. D. Aston,
F. Ferraty,
A. Mazumder,
C. Richards,
M. J. R. Hall
Abstract:
Forensic entomology contributes important information to crime scene investigations. In this paper, we propose a method to estimate the hatching time of larvae (or maggots) based on their lengths, the temperature profile at the crime scene and experimental data on larval development. This requires the estimation of a time-dependent growth curve from experiments where larvae have been exposed to a…
▽ More
Forensic entomology contributes important information to crime scene investigations. In this paper, we propose a method to estimate the hatching time of larvae (or maggots) based on their lengths, the temperature profile at the crime scene and experimental data on larval development. This requires the estimation of a time-dependent growth curve from experiments where larvae have been exposed to a relatively small number of constant temperature profiles. Since the temperature influences the developmental speed, a crucial step is the time alignment of the curves at different temperatures. We propose a model for time varying temperature profiles based on the local growth rate estimated from the experimental data. This allows us to estimate the most likely hatching time for a sample of larvae from the crime scene. Asymptotic properties are provided for the estimators of the growth curves and the hatching time. We explore via simulations the robustness of the method to errors in the estimated temperature profile. We also apply the methodology to data from two criminal cases from the United Kingdom.
△ Less
Submitted 4 November, 2021; v1 submitted 2 September, 2017;
originally announced September 2017.
-
Bayesian inference for latent factor GARCH models
Authors:
Michael K. Pitt,
Jamie Hall,
Robert Kohn
Abstract:
Latent factor GARCH models are difficult to estimate using Bayesian methods because standard Markov chain Monte Carlo samplers produce slowly mixing and inefficient draws from the posterior distributions of the model parameters. This paper describes how to apply the particle Gibbs algorithm to estimate factor GARCH models efficiently. The method has two advantages over previous approaches. First,…
▽ More
Latent factor GARCH models are difficult to estimate using Bayesian methods because standard Markov chain Monte Carlo samplers produce slowly mixing and inefficient draws from the posterior distributions of the model parameters. This paper describes how to apply the particle Gibbs algorithm to estimate factor GARCH models efficiently. The method has two advantages over previous approaches. First, it generalises in a straightfoward way to models with multiple factors and to various members of the GARCH family. Second, it scales up well as the dimension of the o, bservation vector increases.
△ Less
Submitted 5 July, 2015;
originally announced July 2015.
-
Bayesian inference for nonlinear structural time series models
Authors:
Jamie Hall,
Michael K. Pitt,
Robert Kohn
Abstract:
This article discusses a partially adapted particle filter for estimating the likelihood of a nonlinear structural econometric state space models whose state transition density cannot be expressed in closed form. The filter generates the disturbances in the state transition equation and allows for multiple modes in the conditional disturbance distribution. The particle filter produces an unbiased…
▽ More
This article discusses a partially adapted particle filter for estimating the likelihood of a nonlinear structural econometric state space models whose state transition density cannot be expressed in closed form. The filter generates the disturbances in the state transition equation and allows for multiple modes in the conditional disturbance distribution. The particle filter produces an unbiased estimate of the likelihood and so can be used to carry out Bayesian inference in a particle Markov chain Monte Carlo framework. We show empirically that when the signal to noise ratio is high, the new filter can be much more efficient than the standard particle filter, in the sense that it requires far fewer particles to give the same accuracy. The new filter is applied to several simulated and real examples and in particular to a dynamic stochastic general equilibrium model.
△ Less
Submitted 3 September, 2012; v1 submitted 3 September, 2012;
originally announced September 2012.
-
Implementing Risk-Limiting Post-Election Audits in California
Authors:
Joseph Lorenzo Hall,
Luke W. Miratrix,
Philip B. Stark,
Melvin Briones,
Elaine Ginnold,
Freddie Oakley,
Martin Peaden,
Gail Pellerin,
Tom Stanionis,
Tricia Webber
Abstract:
Risk-limiting post-election audits limit the chance of certifying an electoral outcome if the outcome is not what a full hand count would show. Building on previous work, we report on pilot risk-limiting audits in four elections during 2008 in three California counties: one during the February 2008 Primary Election in Marin County and three during the November 2008 General Elections in Marin, Sa…
▽ More
Risk-limiting post-election audits limit the chance of certifying an electoral outcome if the outcome is not what a full hand count would show. Building on previous work, we report on pilot risk-limiting audits in four elections during 2008 in three California counties: one during the February 2008 Primary Election in Marin County and three during the November 2008 General Elections in Marin, Santa Cruz and Yolo Counties. We explain what makes an audit risk-limiting and how existing and proposed laws fall short. We discuss the differences among our four pilot audits. We identify challenges to practical, efficient risk-limiting audits and conclude that current approaches are too complex to be used routinely on a large scale. One important logistical bottleneck is the difficulty of exporting data from commercial election management systems in a format amenable to audit calculations. Finally, we propose a bare-bones risk-limiting audit that is less efficient than these pilot audits, but avoids many practical problems.
△ Less
Submitted 10 July, 2009; v1 submitted 28 May, 2009;
originally announced May 2009.
-
Sequential tests and estimates after overrunning based on $p$-value combination
Authors:
W. J. Hall,
Keyue Ding
Abstract:
Often in sequential trials additional data become available after a stopping boundary has been reached. A method of incorporating such information from overrunning is developed, based on the ``adding weighted Zs'' method of combining $p$-values. This yields a combined $p$-value for the primary test and a median-unbiased estimate and confidence bounds for the parameter under test. When the amount…
▽ More
Often in sequential trials additional data become available after a stopping boundary has been reached. A method of incorporating such information from overrunning is developed, based on the ``adding weighted Zs'' method of combining $p$-values. This yields a combined $p$-value for the primary test and a median-unbiased estimate and confidence bounds for the parameter under test. When the amount of overrunning information is proportional to the amount available upon terminating the sequential test, exact inference methods are provided; otherwise, approximate methods are given and evaluated. The context is that of observing a Brownian motion with drift, with either linear stopping boundaries in continuous time or discrete-time group-sequential boundaries. The method is compared with other available methods and is exemplified with data from two sequential clinical trials.
△ Less
Submitted 20 May, 2008;
originally announced May 2008.