Search | arXiv e-print repository

Multilayer networks characterize human-mobility patterns by industry sector for the 2021 Texas winter storm

Authors: Melissa Butler, Alisha Khan, Francis Afrifa, Yingjie Hu, Dane Taylor

Abstract: Understanding human mobility during disastrous events is crucial for emergency planning and disaster management. Here, we develop a methodology involving the construction of time-varying, multilayer networks in which edges encode observed movements between spatial regions (census tracts) and network layers encode different movement categories according to industry sectors (e.g., visitations to sch… ▽ More Understanding human mobility during disastrous events is crucial for emergency planning and disaster management. Here, we develop a methodology involving the construction of time-varying, multilayer networks in which edges encode observed movements between spatial regions (census tracts) and network layers encode different movement categories according to industry sectors (e.g., visitations to schools, hospitals, and grocery stores). This approach provides a rich characterization of human mobility, thereby complementing studies examining the risk-aversion activities of evacuation and sheltering in place. Focusing on the 2021 Texas winter storm as a case study which led to many casualties, we find that people largely reduced their movements to ambulatory healthcare services, restaurants, and schools, but prioritized movements to grocery stores and gas stations. Additionally, we study the predictability of nodes' in- and out-degrees in the multilayer networks, which encode movements into and out of census tracts. We find that inward movements are harder to predict than outward movements, and even more so during this winter storm. Our findings about the reduction, prioritization, and predictability of sector-specific human movements could inform mobility-related decisions arising from future extreme weather events. △ Less

Submitted 3 September, 2025; originally announced September 2025.

arXiv:2409.18156 [pdf, other]

A novel application of Shapley values for large multidimensional time-series data: Applying explainable AI to a DNA profile classification neural network

Authors: Lauren Elborough, Duncan Taylor, Melissa Humphries

Abstract: The application of Shapley values to high-dimensional, time-series-like data is computationally challenging - and sometimes impossible. For $N$ inputs the problem is $2^N$ hard. In image processing, clusters of pixels, referred to as superpixels, are used to streamline computations. This research presents an efficient solution for time-seres-like data that adapts the idea of superpixels for Shaple… ▽ More The application of Shapley values to high-dimensional, time-series-like data is computationally challenging - and sometimes impossible. For $N$ inputs the problem is $2^N$ hard. In image processing, clusters of pixels, referred to as superpixels, are used to streamline computations. This research presents an efficient solution for time-seres-like data that adapts the idea of superpixels for Shapley value computation. Motivated by a forensic DNA classification example, the method is applied to multivariate time-series-like data whose features have been classified by a convolutional neural network (CNN). In DNA processing, it is important to identify alleles from the background noise created by DNA extraction and processing. A single DNA profile has $31,200$ scan points to classify, and the classification decisions must be defensible in a court of law. This means that classification is routinely performed by human readers - a monumental and time consuming process. The application of a CNN with fast computation of meaningful Shapley values provides a potential alternative to the classification. This research demonstrates the realistic, accurate and fast computation of Shapley values for this massive task △ Less

Submitted 26 September, 2024; originally announced September 2024.

Comments: 16 pages, 5 figures

arXiv:2307.00015 [pdf]

A diagnosis of the primary difference between EuroForMix and STRmix

Authors: John Buckleton, Mateusz Susik, James M. Curran, Kevin Cheng, Duncan Taylor, Jo-Anne Bright, Hannah Kelly, Richard Wivell

Abstract: There is interest in comparing the output, principally the likelihood ratio, from the two probabilistic genotyping software EuroForMix (EFM) and STRmix. Many of these comparison studies are descriptive and make little or no effort to diagnose the cause of difference. There are fundamental differences between EFM and STRmix that are causative of the largest set of likelihood ratio differences. This… ▽ More There is interest in comparing the output, principally the likelihood ratio, from the two probabilistic genotyping software EuroForMix (EFM) and STRmix. Many of these comparison studies are descriptive and make little or no effort to diagnose the cause of difference. There are fundamental differences between EFM and STRmix that are causative of the largest set of likelihood ratio differences. This set of differences is for false donors where there are many instances of LRs just above or below 1 for EFM that give much lower LRs in STRmix. This is caused by the separate estimation of parameters such as allele height variance and mixture proportion under Hp and Ha for EFM. It results in a departure from calibration for EFM in the region of LRs just above and below 1. △ Less

Submitted 22 June, 2023; originally announced July 2023.

Comments: 21 pages, 9 figures, 5 tables

arXiv:2008.02389 [pdf, other]

Continuous-in-Depth Neural Networks

Authors: Alejandro F. Queiruga, N. Benjamin Erichson, Dane Taylor, Michael W. Mahoney

Abstract: Recent work has attempted to interpret residual networks (ResNets) as one step of a forward Euler discretization of an ordinary differential equation, focusing mainly on syntactic algebraic similarities between the two systems. Discrete dynamical integrators of continuous dynamical systems, however, have a much richer structure. We first show that ResNets fail to be meaningful dynamical integrator… ▽ More Recent work has attempted to interpret residual networks (ResNets) as one step of a forward Euler discretization of an ordinary differential equation, focusing mainly on syntactic algebraic similarities between the two systems. Discrete dynamical integrators of continuous dynamical systems, however, have a much richer structure. We first show that ResNets fail to be meaningful dynamical integrators in this richer sense. We then demonstrate that neural network models can learn to represent continuous dynamical systems, with this richer structure and properties, by embedding them into higher-order numerical integration schemes, such as the Runge Kutta schemes. Based on these insights, we introduce ContinuousNet as a continuous-in-depth generalization of ResNet architectures. ContinuousNets exhibit an invariance to the particular computational graph manifestation. That is, the continuous-in-depth model can be evaluated with different discrete time step sizes, which changes the number of layers, and different numerical integration schemes, which changes the graph connectivity. We show that this can be used to develop an incremental-in-depth training scheme that improves model quality, while significantly decreasing training time. We also show that, once trained, the number of units in the computational graph can even be decreased, for faster inference with little-to-no accuracy drop. △ Less

Submitted 5 August, 2020; originally announced August 2020.

arXiv:2008.00123 [pdf, other]

Noise-Response Analysis of Deep Neural Networks Quantifies Robustness and Fingerprints Structural Malware

Authors: N. Benjamin Erichson, Dane Taylor, Qixuan Wu, Michael W. Mahoney

Abstract: The ubiquity of deep neural networks (DNNs), cloud-based training, and transfer learning is giving rise to a new cybersecurity frontier in which unsecure DNNs have `structural malware' (i.e., compromised weights and activation pathways). In particular, DNNs can be designed to have backdoors that allow an adversary to easily and reliably fool an image classifier by adding a pattern of pixels called… ▽ More The ubiquity of deep neural networks (DNNs), cloud-based training, and transfer learning is giving rise to a new cybersecurity frontier in which unsecure DNNs have `structural malware' (i.e., compromised weights and activation pathways). In particular, DNNs can be designed to have backdoors that allow an adversary to easily and reliably fool an image classifier by adding a pattern of pixels called a trigger. It is generally difficult to detect backdoors, and existing detection methods are computationally expensive and require extensive resources (e.g., access to the training data). Here, we propose a rapid feature-generation technique that quantifies the robustness of a DNN, `fingerprints' its nonlinearity, and allows us to detect backdoors (if present). Our approach involves studying how a DNN responds to noise-infused images with varying noise intensity, which we summarize with titration curves. We find that DNNs with backdoors are more sensitive to input noise and respond in a characteristic way that reveals the backdoor and where it leads (its `target'). Our empirical results demonstrate that we can accurately detect backdoors with high confidence orders-of-magnitude faster than existing approaches (seconds versus hours). △ Less

Submitted 3 February, 2021; v1 submitted 31 July, 2020; originally announced August 2020.

Comments: 9 pages, 7 figures, accepted to the SIAM International Conference on Data Mining (SDM 21)

arXiv:2002.10882 [pdf]

A Quantized Representation of Intertemporal Choice in the Brain

Authors: James Tee, Desmond P. Taylor

Abstract: Value [4][5] is typically modeled using a continuous representation (i.e., a Real number). A discrete representation of value has recently been postulated [6]. A quantized representation of probability in the brain was also posited and supported by experimental data [7]. Value and probability are inter-related via Prospect Theory [4][5]. In this paper, we hypothesize that intertemporal choices may… ▽ More Value [4][5] is typically modeled using a continuous representation (i.e., a Real number). A discrete representation of value has recently been postulated [6]. A quantized representation of probability in the brain was also posited and supported by experimental data [7]. Value and probability are inter-related via Prospect Theory [4][5]. In this paper, we hypothesize that intertemporal choices may also be quantized. For example, people may treat (or discount) 16 days indifferently to 17 days. To test this, we analyzed an intertemporal task by using 2 novel models: quantized hyperbolic discounting, and quantized exponential discounting. Our work here is a re-examination of the behavioral data previously collected for an fMRI study [8]. Both quantized hyperbolic and quantized exponential models were compared using AIC and BIC tests. We found that 13/20 participants were best fit to the quantized exponential model, while the remaining 7/20 were best fit to the quantized hyperbolic model. Overall, 15/20 participants were best fit to models with a 5-bit precision (i.e., 2^5 = 32 steps). In conclusion, regardless of hyperbolic or exponential, quantized versions of these models are better fit to the experimental data than their continuous forms. We finally outline some potential applications of our findings. △ Less

Submitted 15 September, 2020; v1 submitted 23 February, 2020; originally announced February 2020.

Comments: 9 pages, 19 figures. arXiv admin note: substantial text overlap with arXiv:1805.01631

arXiv:1909.06442 [pdf, other]

Co-Attentive Cross-Modal Deep Learning for Medical Evidence Synthesis and Decision Making

Authors: Devin Taylor, Simeon Spasov, Pietro Liò

Abstract: Modern medicine requires generalised approaches to the synthesis and integration of multimodal data, often at different biological scales, that can be applied to a variety of evidence structures, such as complex disease analyses and epidemiological models. However, current methods are either slow and expensive, or ineffective due to the inability to model the complex relationships between data mod… ▽ More Modern medicine requires generalised approaches to the synthesis and integration of multimodal data, often at different biological scales, that can be applied to a variety of evidence structures, such as complex disease analyses and epidemiological models. However, current methods are either slow and expensive, or ineffective due to the inability to model the complex relationships between data modes which differ in scale and format. We address these issues by proposing a cross-modal deep learning architecture and co-attention mechanism to accurately model the relationships between the different data modes, while further reducing patient diagnosis time. Differentiating Parkinson's Disease (PD) patients from healthy patients forms the basis of the evaluation. The model outperforms the previous state-of-the-art unimodal analysis by 2.35%, while also being 53% more parameter efficient than the industry standard cross-modal model. Furthermore, the evaluation of the attention coefficients allows for qualitative insights to be obtained. Through the coupling with bioinformatics, a novel link between the interferon-gamma-mediated pathway, DNA methylation and PD was identified. We believe that our approach is general and could optimise the process of medical evidence synthesis and decision making in an actionable way. △ Less

Submitted 8 November, 2019; v1 submitted 13 September, 2019; originally announced September 2019.

Comments: 7 pages, 2 figures, Machine Learning for Health (ML4H) at NeurIPS 2019 - Extended Abstract, clarified graph and math notation, typos corrected

arXiv:1704.01053 [pdf, other]

Network-ensemble comparisons with stochastic rewiring and von Neumann entropy

Authors: Zichao Li, Peter J. Mucha, Dane Taylor

Abstract: Assessing whether a given network is typical or atypical for a random-network ensemble (i.e., network-ensemble comparison) has widespread applications ranging from null-model selection and hypothesis testing to clustering and classifying networks. We develop a framework for network-ensemble comparison by subjecting the network to stochastic rewiring. We study two rewiring processes, uniform and de… ▽ More Assessing whether a given network is typical or atypical for a random-network ensemble (i.e., network-ensemble comparison) has widespread applications ranging from null-model selection and hypothesis testing to clustering and classifying networks. We develop a framework for network-ensemble comparison by subjecting the network to stochastic rewiring. We study two rewiring processes, uniform and degree-preserved rewiring, which yield random-network ensembles that converge to the Erdos-Renyi and configuration-model ensembles, respectively. We study convergence through von Neumann entropy (VNE), a network summary statistic measuring information content based on the spectra of a Laplacian matrix, and develop a perturbation analysis for the expected effect of rewiring on VNE. Our analysis yields an estimate for how many rewires are required for a given network to resemble a typical network from an ensemble, offering a computationally efficient quantity for network-ensemble comparison that does not require simulation of the corresponding rewiring process. △ Less

Submitted 29 November, 2017; v1 submitted 4 April, 2017; originally announced April 2017.

Comments: 24 pages, 6 figures

arXiv:1612.08804 [pdf, other]

Ensemble-based estimates of eigenvector error for empirical covariance matrices

Authors: Dane Taylor, Juan G. Restrepo, Francois G. Meyer

Abstract: Covariance matrices are fundamental to the analysis and forecast of economic, physical and biological systems. Although the eigenvalues $\{λ_i\}$ and eigenvectors $\{{\bf u}_i\}$ of a covariance matrix are central to such endeavors, in practice one must inevitably approximate the covariance matrix based on data with finite sample size $n$ to obtain empirical eigenvalues $\{\tildeλ_i\}$ and eigenve… ▽ More Covariance matrices are fundamental to the analysis and forecast of economic, physical and biological systems. Although the eigenvalues $\{λ_i\}$ and eigenvectors $\{{\bf u}_i\}$ of a covariance matrix are central to such endeavors, in practice one must inevitably approximate the covariance matrix based on data with finite sample size $n$ to obtain empirical eigenvalues $\{\tildeλ_i\}$ and eigenvectors $\{\tilde{\bf u}_i\}$, and therefore understanding the error so introduced is of central importance. We analyze eigenvector error $\|{\bf u}_i - \tilde{\bf u}_i \|^2$ while leveraging the assumption that the true covariance matrix having size $p$ is drawn from a matrix ensemble with known spectral properties---particularly, we assume the distribution of population eigenvalues weakly converges as $p\to\infty$ to a spectral density $ρ(λ)$ and that the spacing between population eigenvalues is similar to that for the Gaussian orthogonal ensemble. Our approach complements previous analyses of eigenvector error that require the full set of eigenvalues to be known, which can be computationally infeasible when $p$ is large. To provide a scalable approach for uncertainty quantification of eigenvector error, we consider a fixed eigenvalue $λ$ and approximate the distribution of the expected square error $r= \mathbb{E}\left[\| {\bf u}_i - \tilde{\bf u}_i \|^2\right]$ across the matrix ensemble for all ${\bf u}_i$ associated with $λ_i=λ$. We find, for example, that for sufficiently large matrix size $p$ and sample size $n>p$, the probability density of $r$ scales as $1/nr^2$. This power-law scaling implies that eigenvector error is extremely heterogeneous---even if $r$ is very small for most eigenvectors, it can be large for others with non-negligible probability. We support this and further results with numerical experiments. △ Less

Submitted 28 February, 2018; v1 submitted 28 December, 2016; originally announced December 2016.

Comments: 24 pages, 8 figures

arXiv:1507.01826 [pdf, other]

doi 10.1109/TNSE.2016.2537545

Clustering Network Layers With the Strata Multilayer Stochastic Block Model

Authors: Natalie Stanley, Saray Shai, Dane Taylor, Peter J. Mucha

Abstract: Multilayer networks are a useful data structure for simultaneously capturing multiple types of relationships between a set of nodes. In such networks, each relational definition gives rise to a layer. While each layer provides its own set of information, community structure across layers can be collectively utilized to discover and quantify underlying relational patterns between nodes. To concisel… ▽ More Multilayer networks are a useful data structure for simultaneously capturing multiple types of relationships between a set of nodes. In such networks, each relational definition gives rise to a layer. While each layer provides its own set of information, community structure across layers can be collectively utilized to discover and quantify underlying relational patterns between nodes. To concisely extract information from a multilayer network, we propose to identify and combine sets of layers with meaningful similarities in community structure. In this paper, we describe the "strata multilayer stochastic block model'' (sMLSBM), a probabilistic model for multilayer community structure. The central extension of the model is that there exist groups of layers, called "strata'', which are defined such that all layers in a given stratum have community structure described by a common stochastic block model (SBM). That is, layers in a stratum exhibit similar node-to-community assignments and SBM probability parameters. Fitting the sMLSBM to a multilayer network provides a joint clustering that yields node-to-community and layer-to-stratum assignments, which cooperatively aid one another during inference. We describe an algorithm for separating layers into their appropriate strata and an inference technique for estimating the SBM parameters for each stratum. We demonstrate our method using synthetic networks and a multilayer network inferred from data collected in the Human Microbiome Project. △ Less

Submitted 9 October, 2015; v1 submitted 7 July, 2015; originally announced July 2015.

Showing 1–10 of 10 results for author: Taylor, D