-
Efficient Long-Term Structural Reliability Estimation with Non-Gaussian Stochastic Models: A Design of Experiments Approach
Authors:
Sebastian Winter,
Christian Agrell,
Juan Camilo Guevara Gómez,
Erik Vanem
Abstract:
Extreme response assessment is important in the design and operation of engineering structures, and is a crucial part of structural risk and reliability analyses. Structures should be designed in a way that enables them to withstand the environmental loads they are expected to experience over their lifetime, without designs being unnecessarily conservative and costly. An accurate risk estimate is…
▽ More
Extreme response assessment is important in the design and operation of engineering structures, and is a crucial part of structural risk and reliability analyses. Structures should be designed in a way that enables them to withstand the environmental loads they are expected to experience over their lifetime, without designs being unnecessarily conservative and costly. An accurate risk estimate is essential but difficult to obtain because the long-term behaviour of a structure is typically too complex to calculate analytically or with brute force Monte Carlo simulation. Therefore, approximation methods are required to estimate the extreme response using only a limited number of short-term conditional response calculations. Combining surrogate models with Design of Experiments is an approximation approach that has gained popularity due to its ability to account for both long-term environment variability and short-term response variability. In this paper, we propose a method for estimating the extreme response of black-box, stochastic models with heteroscedastic non-Gaussian noise. We present a mathematically founded extreme response estimation process that enables Design of Experiment approaches that are prohibitively expensive with surrogate Monte Carlo. The theory leads us to speculate this method can robustly produce more confident extreme response estimates, and is suitable for a variety of domains. While this needs to be further validated empirically, the method offers a promising tool for reducing the uncertainty decision-makers face, allowing them to make better informed choices and create more optimal structures.
△ Less
Submitted 3 March, 2025;
originally announced March 2025.
-
Identification and Scaling of Latent Variables in Ordinal Factor Analysis
Authors:
Edgar C. Merkle,
Sonja D. Winter,
Ellen Fitzsimmons
Abstract:
Social science researchers are generally accustomed to treating ordinal variables as though they are continuous. In this paper, we consider how identification constraints in ordinal factor analysis can mimic the treatment of ordinal variables as continuous. We describe model constraints that lead to latent variable predictions equaling the average of ordinal variables. This result leads us to prop…
▽ More
Social science researchers are generally accustomed to treating ordinal variables as though they are continuous. In this paper, we consider how identification constraints in ordinal factor analysis can mimic the treatment of ordinal variables as continuous. We describe model constraints that lead to latent variable predictions equaling the average of ordinal variables. This result leads us to propose minimal identification constraints, which we call "integer constraints," that center the latent variables around the scale of the observed, integer-coded ordinal variables. The integer constraints lead to intuitive model parameterizations because researchers are already accustomed to thinking about ordinal variables as though they are continuous. We provide a proof that our proposed integer constraints are indeed minimal identification constraints, as well as an illustration of how integer constraints work with real data. We also provide simulation results indicating that integer constraints are similar to other identification constraints in terms of estimation convergence and admissibility.
△ Less
Submitted 10 January, 2025;
originally announced January 2025.
-
Modelling Loss of Complexity in Intermittent Time Series and its Application
Authors:
Jie Li,
Jian Zhang,
Samantha L. Winter,
Mark Burnley
Abstract:
In this paper, we developed a novel method of nonparametric relative entropy (RlEn) for modelling loss of complexity in intermittent time series. The method consists of two steps. We first fit a nonlinear autoregressive model to each intermittent time series, where the corresponding lag order and the loss of complexity are determined by Bayesian Information Criterion (BIC) and relative entropy res…
▽ More
In this paper, we developed a novel method of nonparametric relative entropy (RlEn) for modelling loss of complexity in intermittent time series. The method consists of two steps. We first fit a nonlinear autoregressive model to each intermittent time series, where the corresponding lag order and the loss of complexity are determined by Bayesian Information Criterion (BIC) and relative entropy respectively. Then, change-points in the complexity are detected by a cumulative sum (CUSUM) based statistic. Compared to approximate entropy (ApEn), a popular method in literature, the performance of RlEn was assessed by simulations in terms of (1) ability to localize complexity change-points in intermittent time series; (2) ability to faithfully estimate underlying nonlinear models. The performance of the proposal was then examined in a real analysis of fatigue-induced changes in the complexity of human motor outputs. The results showed that the proposed method outperformed the ApEn in accurately detecting changes of complexity in intermittent time series segments.
△ Less
Submitted 6 January, 2025; v1 submitted 21 November, 2024;
originally announced November 2024.
-
Sequential Gibbs Posteriors with Applications to Principal Component Analysis
Authors:
Steven Winter,
Omar Melikechi,
David B. Dunson
Abstract:
Gibbs posteriors are proportional to a prior distribution multiplied by an exponentiated loss function, with a key tuning parameter weighting information in the loss relative to the prior and providing a control of posterior uncertainty. Gibbs posteriors provide a principled framework for likelihood-free Bayesian inference, but in many situations, including a single tuning parameter inevitably lea…
▽ More
Gibbs posteriors are proportional to a prior distribution multiplied by an exponentiated loss function, with a key tuning parameter weighting information in the loss relative to the prior and providing a control of posterior uncertainty. Gibbs posteriors provide a principled framework for likelihood-free Bayesian inference, but in many situations, including a single tuning parameter inevitably leads to poor uncertainty quantification. In particular, regardless of the value of the parameter, credible regions have far from the nominal frequentist coverage even in large samples. We propose a sequential extension to Gibbs posteriors to address this problem. We prove the proposed sequential posterior exhibits concentration and a Bernstein-von Mises theorem, which holds under easy to verify conditions in Euclidean space and on manifolds. As a byproduct, we obtain the first Bernstein-von Mises theorem for traditional likelihood-based Bayesian posteriors on manifolds. All methods are illustrated with an application to principal component analysis.
△ Less
Submitted 19 October, 2023;
originally announced October 2023.
-
Machine Learning and the Future of Bayesian Computation
Authors:
Steven Winter,
Trevor Campbell,
Lizhen Lin,
Sanvesh Srivastava,
David B. Dunson
Abstract:
Bayesian models are a powerful tool for studying complex data, allowing the analyst to encode rich hierarchical dependencies and leverage prior information. Most importantly, they facilitate a complete characterization of uncertainty through the posterior distribution. Practical posterior computation is commonly performed via MCMC, which can be computationally infeasible for high dimensional model…
▽ More
Bayesian models are a powerful tool for studying complex data, allowing the analyst to encode rich hierarchical dependencies and leverage prior information. Most importantly, they facilitate a complete characterization of uncertainty through the posterior distribution. Practical posterior computation is commonly performed via MCMC, which can be computationally infeasible for high dimensional models with many observations. In this article we discuss the potential to improve posterior computation using ideas from machine learning. Concrete future directions are explored in vignettes on normalizing flows, Bayesian coresets, distributed Bayesian inference, and variational inference.
△ Less
Submitted 21 April, 2023;
originally announced April 2023.
-
Opaque prior distributions in Bayesian latent variable models
Authors:
Edgar C. Merkle,
Oludare Ariyo,
Sonja D. Winter,
Mauricio Garnier-Villarreal
Abstract:
We review common situations in Bayesian latent variable models where the prior distribution that a researcher specifies differs from the prior distribution used during estimation. These situations can arise from the positive definite requirement on correlation matrices, from sign indeterminacy of factor loadings, and from order constraints on threshold parameters. The issue is especially problemat…
▽ More
We review common situations in Bayesian latent variable models where the prior distribution that a researcher specifies differs from the prior distribution used during estimation. These situations can arise from the positive definite requirement on correlation matrices, from sign indeterminacy of factor loadings, and from order constraints on threshold parameters. The issue is especially problematic for reproducibility and for model checks that involve prior distributions, including prior predictive assessment and Bayes factors. In these cases, one might be assessing the wrong model, casting doubt on the relevance of the results. The most straightforward solution to the issue sometimes involves use of informative prior distributions. We explore other solutions and make recommendations for practice.
△ Less
Submitted 24 August, 2023; v1 submitted 20 January, 2023;
originally announced January 2023.
-
Interpretable AI for relating brain structural and functional connectomes
Authors:
Haoming Yang,
Steven Winter,
Zhengwu Zhang,
David Dunson
Abstract:
One of the central problems in neuroscience is understanding how brain structure relates to function. Naively one can relate the direct connections of white matter fiber tracts between brain regions of interest (ROIs) to the increased co-activation in the same pair of ROIs, but the link between structural and functional connectomes (SCs and FCs) has proven to be much more complex. To learn a reali…
▽ More
One of the central problems in neuroscience is understanding how brain structure relates to function. Naively one can relate the direct connections of white matter fiber tracts between brain regions of interest (ROIs) to the increased co-activation in the same pair of ROIs, but the link between structural and functional connectomes (SCs and FCs) has proven to be much more complex. To learn a realistic generative model characterizing population variation in SCs, FCs, and the SC-FC coupling, we develop a graph auto-encoder that we refer to as Staf-GATE. We trained Staf-GATE with data from the Human Connectome Project (HCP) and show state-of-the-art performance in predicting FC and joint generation of SC and FC. In addition, as a crucial component of the proposed approach, we provide a masking-based algorithm to extract interpretable inferences about SC-FC coupling. Our interpretation methods identified important SC subnetworks for FC coupling and relating SC and FC with sex.
△ Less
Submitted 29 August, 2023; v1 submitted 10 October, 2022;
originally announced October 2022.
-
Multi-scale graph principal component analysis for connectomics
Authors:
Steven Winter,
Zhengwu Zhang,
David Dunson
Abstract:
In brain connectomics, the cortical surface is parcellated into different regions of interest (ROIs) prior to statistical analysis. The brain connectome for each individual can then be represented as a graph, with the nodes corresponding to ROIs and edges to connections between ROIs. Such a graph can be summarized as an adjacency matrix, with each cell containing the strength of connection between…
▽ More
In brain connectomics, the cortical surface is parcellated into different regions of interest (ROIs) prior to statistical analysis. The brain connectome for each individual can then be represented as a graph, with the nodes corresponding to ROIs and edges to connections between ROIs. Such a graph can be summarized as an adjacency matrix, with each cell containing the strength of connection between a pair of ROIs. These matrices are symmetric with the diagonal elements corresponding to self-connections typically excluded. A major disadvantage of such representations of the connectome is their sensitivity to the chosen ROIs, including critically the number of ROIs and hence the scale of the graph. As the scale becomes finer and more ROIs are used, graphs become increasingly sparse. Clearly, the results of downstream statistical analyses can be highly dependent on the chosen parcellation. To solve this problem, we propose a multi-scale graph factorization, which links together scale-specific factorizations through a common set of individual-specific scores. These scores summarize an individual's brain structure combining information across measurement scales. We obtain a simple and efficient algorithm for implementation, and illustrate substantial advantages over single scale approaches in simulations and analyses of the Human Connectome Project dataset.
△ Less
Submitted 5 October, 2020;
originally announced October 2020.