-
Multilayer networks characterize human-mobility patterns by industry sector for the 2021 Texas winter storm
Authors:
Melissa Butler,
Alisha Khan,
Francis Afrifa,
Yingjie Hu,
Dane Taylor
Abstract:
Understanding human mobility during disastrous events is crucial for emergency planning and disaster management. Here, we develop a methodology involving the construction of time-varying, multilayer networks in which edges encode observed movements between spatial regions (census tracts) and network layers encode different movement categories according to industry sectors (e.g., visitations to sch…
▽ More
Understanding human mobility during disastrous events is crucial for emergency planning and disaster management. Here, we develop a methodology involving the construction of time-varying, multilayer networks in which edges encode observed movements between spatial regions (census tracts) and network layers encode different movement categories according to industry sectors (e.g., visitations to schools, hospitals, and grocery stores). This approach provides a rich characterization of human mobility, thereby complementing studies examining the risk-aversion activities of evacuation and sheltering in place. Focusing on the 2021 Texas winter storm as a case study which led to many casualties, we find that people largely reduced their movements to ambulatory healthcare services, restaurants, and schools, but prioritized movements to grocery stores and gas stations. Additionally, we study the predictability of nodes' in- and out-degrees in the multilayer networks, which encode movements into and out of census tracts. We find that inward movements are harder to predict than outward movements, and even more so during this winter storm. Our findings about the reduction, prioritization, and predictability of sector-specific human movements could inform mobility-related decisions arising from future extreme weather events.
△ Less
Submitted 3 September, 2025;
originally announced September 2025.
-
A novel application of Shapley values for large multidimensional time-series data: Applying explainable AI to a DNA profile classification neural network
Authors:
Lauren Elborough,
Duncan Taylor,
Melissa Humphries
Abstract:
The application of Shapley values to high-dimensional, time-series-like data is computationally challenging - and sometimes impossible. For $N$ inputs the problem is $2^N$ hard. In image processing, clusters of pixels, referred to as superpixels, are used to streamline computations. This research presents an efficient solution for time-seres-like data that adapts the idea of superpixels for Shaple…
▽ More
The application of Shapley values to high-dimensional, time-series-like data is computationally challenging - and sometimes impossible. For $N$ inputs the problem is $2^N$ hard. In image processing, clusters of pixels, referred to as superpixels, are used to streamline computations. This research presents an efficient solution for time-seres-like data that adapts the idea of superpixels for Shapley value computation. Motivated by a forensic DNA classification example, the method is applied to multivariate time-series-like data whose features have been classified by a convolutional neural network (CNN). In DNA processing, it is important to identify alleles from the background noise created by DNA extraction and processing. A single DNA profile has $31,200$ scan points to classify, and the classification decisions must be defensible in a court of law. This means that classification is routinely performed by human readers - a monumental and time consuming process. The application of a CNN with fast computation of meaningful Shapley values provides a potential alternative to the classification. This research demonstrates the realistic, accurate and fast computation of Shapley values for this massive task
△ Less
Submitted 26 September, 2024;
originally announced September 2024.
-
A diagnosis of the primary difference between EuroForMix and STRmix
Authors:
John Buckleton,
Mateusz Susik,
James M. Curran,
Kevin Cheng,
Duncan Taylor,
Jo-Anne Bright,
Hannah Kelly,
Richard Wivell
Abstract:
There is interest in comparing the output, principally the likelihood ratio, from the two probabilistic genotyping software EuroForMix (EFM) and STRmix. Many of these comparison studies are descriptive and make little or no effort to diagnose the cause of difference. There are fundamental differences between EFM and STRmix that are causative of the largest set of likelihood ratio differences. This…
▽ More
There is interest in comparing the output, principally the likelihood ratio, from the two probabilistic genotyping software EuroForMix (EFM) and STRmix. Many of these comparison studies are descriptive and make little or no effort to diagnose the cause of difference. There are fundamental differences between EFM and STRmix that are causative of the largest set of likelihood ratio differences. This set of differences is for false donors where there are many instances of LRs just above or below 1 for EFM that give much lower LRs in STRmix. This is caused by the separate estimation of parameters such as allele height variance and mixture proportion under Hp and Ha for EFM. It results in a departure from calibration for EFM in the region of LRs just above and below 1.
△ Less
Submitted 22 June, 2023;
originally announced July 2023.
-
Continuous-in-Depth Neural Networks
Authors:
Alejandro F. Queiruga,
N. Benjamin Erichson,
Dane Taylor,
Michael W. Mahoney
Abstract:
Recent work has attempted to interpret residual networks (ResNets) as one step of a forward Euler discretization of an ordinary differential equation, focusing mainly on syntactic algebraic similarities between the two systems. Discrete dynamical integrators of continuous dynamical systems, however, have a much richer structure. We first show that ResNets fail to be meaningful dynamical integrator…
▽ More
Recent work has attempted to interpret residual networks (ResNets) as one step of a forward Euler discretization of an ordinary differential equation, focusing mainly on syntactic algebraic similarities between the two systems. Discrete dynamical integrators of continuous dynamical systems, however, have a much richer structure. We first show that ResNets fail to be meaningful dynamical integrators in this richer sense. We then demonstrate that neural network models can learn to represent continuous dynamical systems, with this richer structure and properties, by embedding them into higher-order numerical integration schemes, such as the Runge Kutta schemes. Based on these insights, we introduce ContinuousNet as a continuous-in-depth generalization of ResNet architectures. ContinuousNets exhibit an invariance to the particular computational graph manifestation. That is, the continuous-in-depth model can be evaluated with different discrete time step sizes, which changes the number of layers, and different numerical integration schemes, which changes the graph connectivity. We show that this can be used to develop an incremental-in-depth training scheme that improves model quality, while significantly decreasing training time. We also show that, once trained, the number of units in the computational graph can even be decreased, for faster inference with little-to-no accuracy drop.
△ Less
Submitted 5 August, 2020;
originally announced August 2020.
-
Noise-Response Analysis of Deep Neural Networks Quantifies Robustness and Fingerprints Structural Malware
Authors:
N. Benjamin Erichson,
Dane Taylor,
Qixuan Wu,
Michael W. Mahoney
Abstract:
The ubiquity of deep neural networks (DNNs), cloud-based training, and transfer learning is giving rise to a new cybersecurity frontier in which unsecure DNNs have `structural malware' (i.e., compromised weights and activation pathways). In particular, DNNs can be designed to have backdoors that allow an adversary to easily and reliably fool an image classifier by adding a pattern of pixels called…
▽ More
The ubiquity of deep neural networks (DNNs), cloud-based training, and transfer learning is giving rise to a new cybersecurity frontier in which unsecure DNNs have `structural malware' (i.e., compromised weights and activation pathways). In particular, DNNs can be designed to have backdoors that allow an adversary to easily and reliably fool an image classifier by adding a pattern of pixels called a trigger. It is generally difficult to detect backdoors, and existing detection methods are computationally expensive and require extensive resources (e.g., access to the training data). Here, we propose a rapid feature-generation technique that quantifies the robustness of a DNN, `fingerprints' its nonlinearity, and allows us to detect backdoors (if present). Our approach involves studying how a DNN responds to noise-infused images with varying noise intensity, which we summarize with titration curves. We find that DNNs with backdoors are more sensitive to input noise and respond in a characteristic way that reveals the backdoor and where it leads (its `target'). Our empirical results demonstrate that we can accurately detect backdoors with high confidence orders-of-magnitude faster than existing approaches (seconds versus hours).
△ Less
Submitted 3 February, 2021; v1 submitted 31 July, 2020;
originally announced August 2020.
-
A Quantized Representation of Intertemporal Choice in the Brain
Authors:
James Tee,
Desmond P. Taylor
Abstract:
Value [4][5] is typically modeled using a continuous representation (i.e., a Real number). A discrete representation of value has recently been postulated [6]. A quantized representation of probability in the brain was also posited and supported by experimental data [7]. Value and probability are inter-related via Prospect Theory [4][5]. In this paper, we hypothesize that intertemporal choices may…
▽ More
Value [4][5] is typically modeled using a continuous representation (i.e., a Real number). A discrete representation of value has recently been postulated [6]. A quantized representation of probability in the brain was also posited and supported by experimental data [7]. Value and probability are inter-related via Prospect Theory [4][5]. In this paper, we hypothesize that intertemporal choices may also be quantized. For example, people may treat (or discount) 16 days indifferently to 17 days. To test this, we analyzed an intertemporal task by using 2 novel models: quantized hyperbolic discounting, and quantized exponential discounting. Our work here is a re-examination of the behavioral data previously collected for an fMRI study [8]. Both quantized hyperbolic and quantized exponential models were compared using AIC and BIC tests. We found that 13/20 participants were best fit to the quantized exponential model, while the remaining 7/20 were best fit to the quantized hyperbolic model. Overall, 15/20 participants were best fit to models with a 5-bit precision (i.e., 2^5 = 32 steps). In conclusion, regardless of hyperbolic or exponential, quantized versions of these models are better fit to the experimental data than their continuous forms. We finally outline some potential applications of our findings.
△ Less
Submitted 15 September, 2020; v1 submitted 23 February, 2020;
originally announced February 2020.
-
Co-Attentive Cross-Modal Deep Learning for Medical Evidence Synthesis and Decision Making
Authors:
Devin Taylor,
Simeon Spasov,
Pietro Liò
Abstract:
Modern medicine requires generalised approaches to the synthesis and integration of multimodal data, often at different biological scales, that can be applied to a variety of evidence structures, such as complex disease analyses and epidemiological models. However, current methods are either slow and expensive, or ineffective due to the inability to model the complex relationships between data mod…
▽ More
Modern medicine requires generalised approaches to the synthesis and integration of multimodal data, often at different biological scales, that can be applied to a variety of evidence structures, such as complex disease analyses and epidemiological models. However, current methods are either slow and expensive, or ineffective due to the inability to model the complex relationships between data modes which differ in scale and format. We address these issues by proposing a cross-modal deep learning architecture and co-attention mechanism to accurately model the relationships between the different data modes, while further reducing patient diagnosis time. Differentiating Parkinson's Disease (PD) patients from healthy patients forms the basis of the evaluation. The model outperforms the previous state-of-the-art unimodal analysis by 2.35%, while also being 53% more parameter efficient than the industry standard cross-modal model. Furthermore, the evaluation of the attention coefficients allows for qualitative insights to be obtained. Through the coupling with bioinformatics, a novel link between the interferon-gamma-mediated pathway, DNA methylation and PD was identified. We believe that our approach is general and could optimise the process of medical evidence synthesis and decision making in an actionable way.
△ Less
Submitted 8 November, 2019; v1 submitted 13 September, 2019;
originally announced September 2019.
-
Network-ensemble comparisons with stochastic rewiring and von Neumann entropy
Authors:
Zichao Li,
Peter J. Mucha,
Dane Taylor
Abstract:
Assessing whether a given network is typical or atypical for a random-network ensemble (i.e., network-ensemble comparison) has widespread applications ranging from null-model selection and hypothesis testing to clustering and classifying networks. We develop a framework for network-ensemble comparison by subjecting the network to stochastic rewiring. We study two rewiring processes, uniform and de…
▽ More
Assessing whether a given network is typical or atypical for a random-network ensemble (i.e., network-ensemble comparison) has widespread applications ranging from null-model selection and hypothesis testing to clustering and classifying networks. We develop a framework for network-ensemble comparison by subjecting the network to stochastic rewiring. We study two rewiring processes, uniform and degree-preserved rewiring, which yield random-network ensembles that converge to the Erdos-Renyi and configuration-model ensembles, respectively. We study convergence through von Neumann entropy (VNE), a network summary statistic measuring information content based on the spectra of a Laplacian matrix, and develop a perturbation analysis for the expected effect of rewiring on VNE. Our analysis yields an estimate for how many rewires are required for a given network to resemble a typical network from an ensemble, offering a computationally efficient quantity for network-ensemble comparison that does not require simulation of the corresponding rewiring process.
△ Less
Submitted 29 November, 2017; v1 submitted 4 April, 2017;
originally announced April 2017.
-
Ensemble-based estimates of eigenvector error for empirical covariance matrices
Authors:
Dane Taylor,
Juan G. Restrepo,
Francois G. Meyer
Abstract:
Covariance matrices are fundamental to the analysis and forecast of economic, physical and biological systems. Although the eigenvalues $\{λ_i\}$ and eigenvectors $\{{\bf u}_i\}$ of a covariance matrix are central to such endeavors, in practice one must inevitably approximate the covariance matrix based on data with finite sample size $n$ to obtain empirical eigenvalues $\{\tildeλ_i\}$ and eigenve…
▽ More
Covariance matrices are fundamental to the analysis and forecast of economic, physical and biological systems. Although the eigenvalues $\{λ_i\}$ and eigenvectors $\{{\bf u}_i\}$ of a covariance matrix are central to such endeavors, in practice one must inevitably approximate the covariance matrix based on data with finite sample size $n$ to obtain empirical eigenvalues $\{\tildeλ_i\}$ and eigenvectors $\{\tilde{\bf u}_i\}$, and therefore understanding the error so introduced is of central importance. We analyze eigenvector error $\|{\bf u}_i - \tilde{\bf u}_i \|^2$ while leveraging the assumption that the true covariance matrix having size $p$ is drawn from a matrix ensemble with known spectral properties---particularly, we assume the distribution of population eigenvalues weakly converges as $p\to\infty$ to a spectral density $ρ(λ)$ and that the spacing between population eigenvalues is similar to that for the Gaussian orthogonal ensemble. Our approach complements previous analyses of eigenvector error that require the full set of eigenvalues to be known, which can be computationally infeasible when $p$ is large. To provide a scalable approach for uncertainty quantification of eigenvector error, we consider a fixed eigenvalue $λ$ and approximate the distribution of the expected square error $r= \mathbb{E}\left[\| {\bf u}_i - \tilde{\bf u}_i \|^2\right]$ across the matrix ensemble for all ${\bf u}_i$ associated with $λ_i=λ$. We find, for example, that for sufficiently large matrix size $p$ and sample size $n>p$, the probability density of $r$ scales as $1/nr^2$. This power-law scaling implies that eigenvector error is extremely heterogeneous---even if $r$ is very small for most eigenvectors, it can be large for others with non-negligible probability. We support this and further results with numerical experiments.
△ Less
Submitted 28 February, 2018; v1 submitted 28 December, 2016;
originally announced December 2016.
-
Clustering Network Layers With the Strata Multilayer Stochastic Block Model
Authors:
Natalie Stanley,
Saray Shai,
Dane Taylor,
Peter J. Mucha
Abstract:
Multilayer networks are a useful data structure for simultaneously capturing multiple types of relationships between a set of nodes. In such networks, each relational definition gives rise to a layer. While each layer provides its own set of information, community structure across layers can be collectively utilized to discover and quantify underlying relational patterns between nodes. To concisel…
▽ More
Multilayer networks are a useful data structure for simultaneously capturing multiple types of relationships between a set of nodes. In such networks, each relational definition gives rise to a layer. While each layer provides its own set of information, community structure across layers can be collectively utilized to discover and quantify underlying relational patterns between nodes. To concisely extract information from a multilayer network, we propose to identify and combine sets of layers with meaningful similarities in community structure. In this paper, we describe the "strata multilayer stochastic block model'' (sMLSBM), a probabilistic model for multilayer community structure. The central extension of the model is that there exist groups of layers, called "strata'', which are defined such that all layers in a given stratum have community structure described by a common stochastic block model (SBM). That is, layers in a stratum exhibit similar node-to-community assignments and SBM probability parameters. Fitting the sMLSBM to a multilayer network provides a joint clustering that yields node-to-community and layer-to-stratum assignments, which cooperatively aid one another during inference. We describe an algorithm for separating layers into their appropriate strata and an inference technique for estimating the SBM parameters for each stratum. We demonstrate our method using synthetic networks and a multilayer network inferred from data collected in the Human Microbiome Project.
△ Less
Submitted 9 October, 2015; v1 submitted 7 July, 2015;
originally announced July 2015.