-
Fast Directed $q$-Analysis for Brain Graphs
Authors:
Felix Windisch,
Florian Unger
Abstract:
Recent innovations in reconstructing large scale, full-precision, neuron-synapse-scale connectomes demand subsequent improvements to graph analysis methods to keep up with the growing complexity and size of the data. One such tool is the recently introduced directed $q$-analysis. We present numerous improvements, theoretical and applied, to this technique: on the theoretical side, we introduce mod…
▽ More
Recent innovations in reconstructing large scale, full-precision, neuron-synapse-scale connectomes demand subsequent improvements to graph analysis methods to keep up with the growing complexity and size of the data. One such tool is the recently introduced directed $q$-analysis. We present numerous improvements, theoretical and applied, to this technique: on the theoretical side, we introduce modified definitions for key elements of directed $q$-analysis, which remedy a well-hidden and previously undetected bias. This also leads to new, beneficial perspectives to the associated computational challenges. Most importantly, we present a high-speed, publicly available, low-level implementation that provides speed-ups of several orders of magnitude on C. Elegans. Furthermore, the speed gains grow with the size of the considered graph. This is made possible due to the mathematical and algorithmic improvements as well as a carefully crafted implementation. These speed-ups enable, for the first time, the analysis of full-sized connectomes such as those obtained by recent reconstructive methods. Additionally, the speed-ups allow comparative analysis to corresponding null models, appropriately designed randomly structured artificial graphs that do not correspond to actual brains. This, in turn, allows for assessing the efficacy and usefulness of directed $q$-analysis for studying the brain. We report on the results in this paper.
△ Less
Submitted 24 April, 2025; v1 submitted 8 January, 2025;
originally announced January 2025.
-
Posterior sampling with Adaptive Gaussian Processes in Bayesian parameter identification
Authors:
Paolo Villani,
Daniel Andrés-Arcones,
Jörg F. Unger,
Martin Weiser
Abstract:
Posterior sampling by Monte Carlo methods provides a more comprehensive solution approach to inverse problems than computing point estimates such as the maximum posterior using optimization methods, at the expense of usually requiring many more evaluations of the forward model. Replacing computationally expensive forward models by fast surrogate models is an attractive option. However, computing t…
▽ More
Posterior sampling by Monte Carlo methods provides a more comprehensive solution approach to inverse problems than computing point estimates such as the maximum posterior using optimization methods, at the expense of usually requiring many more evaluations of the forward model. Replacing computationally expensive forward models by fast surrogate models is an attractive option. However, computing the simulated training data for building a sufficiently accurate surrogate model can be computationally expensive in itself, leading to the design of computer experiments problem of finding evaluation points and accuracies such that the highest accuracy is obtained given a fixed computational budget. Here, we consider a fully adaptive greedy approach to this problem. Using Gaussian process regression as surrogate, samples are drawn from the available posterior approximation while designs are incrementally defined by solving a sequence of optimization problems for evaluation accuracy and positions. The selection of training designs is tailored towards representing the posterior to be sampled as good as possible, while the interleaved sampling steps discard old inaccurate samples in favor of new, more accurate ones. Numerical results show a significant reduction of the computational effort compared to just position-adaptive and static designs.
△ Less
Submitted 26 November, 2024;
originally announced November 2024.
-
Adaptive Gaussian Process Regression for Bayesian inverse problems
Authors:
Paolo Villani,
Jörg Unger,
Martin Weiser
Abstract:
We introduce a novel adaptive Gaussian Process Regression (GPR) methodology for efficient construction of surrogate models for Bayesian inverse problems with expensive forward model evaluations. An adaptive design strategy focuses on optimizing both the positioning and simulation accuracy of training data in order to reduce the computational cost of simulating training data without compromising th…
▽ More
We introduce a novel adaptive Gaussian Process Regression (GPR) methodology for efficient construction of surrogate models for Bayesian inverse problems with expensive forward model evaluations. An adaptive design strategy focuses on optimizing both the positioning and simulation accuracy of training data in order to reduce the computational cost of simulating training data without compromising the fidelity of the posterior distributions of parameters. The method interleaves a goal-oriented active learning algorithm selecting evaluation points and tolerances based on the expected impact on the Kullback-Leibler divergence of surrogated and true posterior with a Markov Chain Monte Carlo sampling of the posterior. The performance benefit of the adaptive approach is demonstrated for two simple test problems.
△ Less
Submitted 30 April, 2024;
originally announced April 2024.
-
From concrete mixture to structural design -- a holistic optimization procedure in the presence of uncertainties
Authors:
Atul Agrawal,
Erik Tamsen,
Phaedon-Stelios Koutsourelakis,
Joerg F. Unger
Abstract:
Designing civil structures such as bridges, dams or buildings is a complex task requiring many synergies from several experts. Each is responsible for different parts of the process. This is often done in a sequential manner, e.g. the structural engineer makes a design under the assumption of certain material properties (e.g. the strength class of the concrete), and then the material engineer opti…
▽ More
Designing civil structures such as bridges, dams or buildings is a complex task requiring many synergies from several experts. Each is responsible for different parts of the process. This is often done in a sequential manner, e.g. the structural engineer makes a design under the assumption of certain material properties (e.g. the strength class of the concrete), and then the material engineer optimizes the material with these restrictions. This paper proposes a holistic optimization procedure, which combines the concrete mixture design and structural simulations in a joint, forward workflow that we ultimately seek to invert. In this manner, new mixtures beyond standard ranges can be considered. Any design effort should account for the presence of uncertainties which can be aleatoric or epistemic as when data is used to calibrate physical models or identify models that fill missing links in the workflow. Inverting the causal relations established poses several challenges especially when these involve physics-based models which most often than not do not provide derivatives/sensitivities or when design constraints are present. To this end, we advocate Variational Optimization, with proposed extensions and appropriately chosen heuristics to overcome the aforementioned challenges. The proposed methodology is illustrated using the design of a precast concrete beam with the objective to minimize the global warming potential while satisfying a number of constraints associated with its load-bearing capacity after 28days according to the Eurocode, the demoulding time as computed by a complex nonlinear Finite Element model, and the maximum temperature during the hydration.
△ Less
Submitted 6 December, 2023;
originally announced December 2023.
-
MCMC Sampling of Directed Flag Complexes with Fixed Undirected Graphs
Authors:
Florian Unger,
Jonathan Krebs
Abstract:
Constructing null models to test the significance of extracted information is a crucial step in data analysis. In this work, we provide a uniformly sampleable null model of directed graphs with the same (or similar) number of simplices in the flag complex, with the restriction of retaining the underlying undirected graph. We describe an MCMC-based algorithm to sample from this null model and stati…
▽ More
Constructing null models to test the significance of extracted information is a crucial step in data analysis. In this work, we provide a uniformly sampleable null model of directed graphs with the same (or similar) number of simplices in the flag complex, with the restriction of retaining the underlying undirected graph. We describe an MCMC-based algorithm to sample from this null model and statistically investigate the mixing behaviour. This is paired with a high-performance, Rust-based, publicly available implementation. The motivation comes from topological data analysis of connectomes in neuroscience. In particular, we answer the fundamental question: are the high Betti numbers observed in the investigated graphs evidence of an interesting topology, or are they merely a byproduct of the high numbers of simplices? Indeed, by applying our new tool on the connectome of C. Elegans and parts of the statistical reconstructions of the Blue Brain Project, we find that the Betti numbers observed are considerable statistical outliers with respect to this new null model. We thus, for the first time, statistically confirm that topological data analysis in microscale connectome research is extracting statistically meaningful information.
△ Less
Submitted 6 September, 2023;
originally announced September 2023.
-
Simplex Closing Probabilities in Directed Graphs
Authors:
Florian Unger,
Jonathan Krebs,
Michael G. Müller
Abstract:
Recent work in mathematical neuroscience has calculated the directed graph homology of the directed simplicial complex given by the brains sparse adjacency graph, the so called connectome.
These biological connectomes show an abundance of both high-dimensional directed simplices and Betti-numbers in all viable dimensions - in contrast to Erdős-Rényi-graphs of comparable size and density. An anal…
▽ More
Recent work in mathematical neuroscience has calculated the directed graph homology of the directed simplicial complex given by the brains sparse adjacency graph, the so called connectome.
These biological connectomes show an abundance of both high-dimensional directed simplices and Betti-numbers in all viable dimensions - in contrast to Erdős-Rényi-graphs of comparable size and density. An analysis of synthetically trained connectomes reveals similar findings, raising questions about the graphs comparability and the nature of origin of the simplices.
We present a new method capable of delivering insight into the emergence of simplices and thus simplicial abundance. Our approach allows to easily distinguish simplex-rich connectomes of different origin. The method relies on the novel concept of an almost-d-simplex, that is, a simplex missing exactly one edge, and consequently the almost-d-simplex closing probability by dimension. We also describe a fast algorithm to identify almost-d-simplices in a given graph. Applying this method to biological and artificial data allows us to identify a mechanism responsible for simplex emergence, and suggests this mechanism is responsible for the simplex signature of the excitatory subnetwork of a statistical reconstruction of the mouse primary visual cortex. Our highly optimised code for this new method is publicly available.
△ Less
Submitted 13 May, 2022;
originally announced May 2022.
-
Stability of a spatial polling system with greedy myopic service
Authors:
Lasse Leskelä,
Falk Unger
Abstract:
This paper studies a spatial queueing system on a circle, polled at random locations by a myopic server that can only observe customers in a bounded neighborhood. The server operates according to a greedy policy, always serving the nearest customer in its neighborhood, and leaving the system unchanged at polling instants where the neighborhood is empty. This system is modeled as a measure-valued…
▽ More
This paper studies a spatial queueing system on a circle, polled at random locations by a myopic server that can only observe customers in a bounded neighborhood. The server operates according to a greedy policy, always serving the nearest customer in its neighborhood, and leaving the system unchanged at polling instants where the neighborhood is empty. This system is modeled as a measure-valued random process, which is shown to be positive recurrent under a natural stability condition that does not depend on the server's scan range. When the interpolling times are light-tailed, the stable system is shown to be geometrically ergodic. The steady-state behavior of the system is briefly discussed using numerical simulations and a heuristic light-traffic approximation.
△ Less
Submitted 27 April, 2010; v1 submitted 31 August, 2009;
originally announced August 2009.