-
Inference of Hierarchical Core-Periphery Structure in Temporal Networks
Authors:
Theodore Y. Faust,
Mason A. Porter
Abstract:
Networks can have various types of mesoscale structures. One type of mesoscale structure in networks is core-periphery structure, which consists of densely-connected core nodes and sparsely-connected peripheral nodes. The core nodes are connected densely to each other and can be connected to the peripheral nodes, which are connected sparsely to other nodes. There has been much research on core-per…
▽ More
Networks can have various types of mesoscale structures. One type of mesoscale structure in networks is core-periphery structure, which consists of densely-connected core nodes and sparsely-connected peripheral nodes. The core nodes are connected densely to each other and can be connected to the peripheral nodes, which are connected sparsely to other nodes. There has been much research on core-periphery structure in time-independent networks, but few core-periphery detection methods have been developed for time-dependent (i.e., ``temporal") networks. Using a multilayer-network representation of temporal networks and an inference approach that employs stochastic block models, we generalize a recent method for detecting hierarchical core-periphery structure \cite{Polanco23} from time-independent networks to temporal networks. In contrast to ``onion-like'' nested core-periphery structures (where each node is assigned to a group according to how deeply it is nested in a network's core), hierarchical core-periphery structures encompass networks with nested structures, tree-like structures (where any two groups must either be disjoint or have one as a strict subset of the other), and general non-nested mesoscale structures (where the group assignments of nodes do not have to be nested in any way). To perform statistical inference and thereby identify core-periphery structure, we use a Markov-chain Monte Carlo (MCMC) approach. We illustrate our method for detecting hierarchical core-periphery structure in two real-world temporal networks, and we briefly discuss the structures that we identify in these networks.
△ Less
Submitted 15 June, 2025; v1 submitted 11 June, 2025;
originally announced June 2025.
-
An AI-driven framework for the prediction of personalised health response to air pollution
Authors:
Nazanin Zounemat Kermani,
Sadjad Naderi,
Claire H. Dilliway,
Claire E. Heaney,
Shrreya Behll,
Boyang Chen,
Hisham Abubakar-Waziri,
Alexandra E. Porter,
Marc Chadeau-Hyam,
Fangxin Fang,
Ian M. Adcock,
Kian Fan Chung,
Christopher C. Pain
Abstract:
Air pollution poses a significant threat to public health, causing or exacerbating many respiratory and cardiovascular diseases. In addition, climate change is bringing about more extreme weather events such as wildfires and heatwaves, which can increase levels of pollution and worsen the effects of pollution exposure. Recent advances in personal sensing have transformed the collection of behaviou…
▽ More
Air pollution poses a significant threat to public health, causing or exacerbating many respiratory and cardiovascular diseases. In addition, climate change is bringing about more extreme weather events such as wildfires and heatwaves, which can increase levels of pollution and worsen the effects of pollution exposure. Recent advances in personal sensing have transformed the collection of behavioural and physiological data, leading to the potential for new improvements in healthcare. We wish to capitalise on this data, alongside new capabilities in AI for making time series predictions, in order to monitor and predict health outcomes for an individual. Thus, we present a novel workflow for predicting personalised health responses to pollution by integrating physiological data from wearable fitness devices with real-time environmental exposures. The data is collected from various sources in a secure and ethical manner, and is used to train an AI model to predict individual health responses to pollution exposure within a cloud-based, modular framework. We demonstrate that the AI model -- an Adversarial Autoencoder neural network in this case -- accurately reconstructs time-dependent health signals and captures nonlinear responses to pollution. Transfer learning is applied using data from a personal smartwatch, which increases the generalisation abilities of the AI model and illustrates the adaptability of the approach to real-world, user-generated data.
△ Less
Submitted 15 May, 2025;
originally announced May 2025.
-
A Social Dynamical System for Twitter Analysis
Authors:
Zhiping Xiao,
Xinyu Wang,
Yifang Qin,
Zijie Huang,
Mason A. Porter,
Yizhou Sun
Abstract:
Understanding the evolution of public opinion is crucial for informed decision-making in various domains, particularly public affairs. The rapid growth of social networks, such as Twitter (now rebranded as X), provides an unprecedented opportunity to analyze public opinion at scale without relying on traditional surveys. With the rise of deep learning, Graph Neural Networks (GNNs) have shown great…
▽ More
Understanding the evolution of public opinion is crucial for informed decision-making in various domains, particularly public affairs. The rapid growth of social networks, such as Twitter (now rebranded as X), provides an unprecedented opportunity to analyze public opinion at scale without relying on traditional surveys. With the rise of deep learning, Graph Neural Networks (GNNs) have shown great promise in modeling online opinion dynamics. Notably, classical opinion dynamics models, such as DeGroot, can be reformulated within a GNN framework.
We introduce Latent Social Dynamical System (LSDS), a novel framework for modeling the latent dynamics of social media users' opinions based on textual content. Since expressed opinions may not fully reflect underlying beliefs, LSDS first encodes post content into latent representations. It then leverages a GraphODE framework, using a GNN-based ODE function to predict future opinions. A decoder subsequently utilizes these predicted latent opinions to perform downstream tasks, such as interaction prediction, which serve as benchmarks for model evaluation. Our framework is highly flexible, supporting various opinion dynamic models as ODE functions, provided they can be adapted into a GNN-based form. It also accommodates different encoder architectures and is compatible with diverse downstream tasks.
To validate our approach, we constructed dynamic datasets from Twitter data. Experimental results demonstrate the effectiveness of LSDS, highlighting its potential for future applications. We plan to publicly release our dataset and code upon the publication of this paper.
△ Less
Submitted 27 March, 2025; v1 submitted 24 March, 2025;
originally announced March 2025.
-
An Approximate-Master-Equation Formulation of the Watts Threshold Model on Hypergraphs
Authors:
Leah A. Keating,
Kwang-Il Goh,
Mason A. Porter
Abstract:
In traditional models of behavioral or opinion dynamics on social networks, researchers suppose that all interactions occur between pairs of individuals. However, in reality, social interactions also occur in groups of three or more individuals. A common way to incorporate such polyadic interactions is to study dynamical processes on hypergraphs. In a hypergraph, interactions can occur between any…
▽ More
In traditional models of behavioral or opinion dynamics on social networks, researchers suppose that all interactions occur between pairs of individuals. However, in reality, social interactions also occur in groups of three or more individuals. A common way to incorporate such polyadic interactions is to study dynamical processes on hypergraphs. In a hypergraph, interactions can occur between any number of the individuals in a network. The Watts threshold model (WTM) is a well-known model of a simplistic social spreading process. Very recently, Chen et al. extended the WTM from dyadic networks (i.e., graphs) to polyadic networks (i.e., hypergraphs). In the present paper, we extend their discrete-time model to continuous time using approximate master equations (AMEs). By using AMEs, we are able to model the system with very high accuracy. We then reduce the high-dimensional AME system to a system of three coupled differential equations without any detectable loss of accuracy. This much lower-dimensional system is more computationally efficient to solve numerically and is also easier to interpret. We linearize the reduced AME system and calculate a cascade condition, which allows us to determine when a large spreading event occurs. We then apply our model to a social contact network of a French primary school and to a hypergraph of computer-science coauthorships. We find that the AME system is accurate in modelling the polyadic WTM on these empirical networks; however, we expect that future work that incorporates structural correlations between nearby nodes and groups into the model for the dynamics will lead to more accurate theory for real-world networks.
△ Less
Submitted 6 May, 2025; v1 submitted 5 March, 2025;
originally announced March 2025.
-
Clustering-induced localization of quantum walks on networks
Authors:
Lucas Böttcher,
Mason A. Porter
Abstract:
Quantum walks on networks are a paradigmatic model in quantum information theory. Quantum-walk algorithms have been developed for various applications, including spatial-search problems, element-distinctness problems, and node centrality analysis. Unlike their classical counterparts, the evolution of quantum walks is unitary, so they do not converge to a stationary distribution. However, for many…
▽ More
Quantum walks on networks are a paradigmatic model in quantum information theory. Quantum-walk algorithms have been developed for various applications, including spatial-search problems, element-distinctness problems, and node centrality analysis. Unlike their classical counterparts, the evolution of quantum walks is unitary, so they do not converge to a stationary distribution. However, for many applications, it is important to understand the long-time behavior of quantum walks and the impact of network structure on their evolution. In the present paper, we study the localization of quantum walks on networks. We demonstrate how localization emerges in highly clustered networks that we construct by recursively attaching triangles, and we derive an analytical expression for the long-time inverse participation ratio that depends on products of eigenvectors of the quantum-walk Hamiltonian. Building on the insights from this example, we then show that localization also occurs in Kleinberg navigable small-world networks and Holme--Kim power-law cluster networks. Our results illustrate that local clustering, which is a key structural feature of networks, can induce localization of quantum walks.
△ Less
Submitted 7 June, 2025; v1 submitted 5 December, 2024;
originally announced December 2024.
-
Synthetic Generation of Dermatoscopic Images with GAN and Closed-Form Factorization
Authors:
Rohan Reddy Mekala,
Frederik Pahde,
Simon Baur,
Sneha Chandrashekar,
Madeline Diep,
Markus Wenzel,
Eric L. Wisotzky,
Galip Ümit Yolcu,
Sebastian Lapuschkin,
Jackie Ma,
Peter Eisert,
Mikael Lindvall,
Adam Porter,
Wojciech Samek
Abstract:
In the realm of dermatological diagnoses, where the analysis of dermatoscopic and microscopic skin lesion images is pivotal for the accurate and early detection of various medical conditions, the costs associated with creating diverse and high-quality annotated datasets have hampered the accuracy and generalizability of machine learning models. We propose an innovative unsupervised augmentation so…
▽ More
In the realm of dermatological diagnoses, where the analysis of dermatoscopic and microscopic skin lesion images is pivotal for the accurate and early detection of various medical conditions, the costs associated with creating diverse and high-quality annotated datasets have hampered the accuracy and generalizability of machine learning models. We propose an innovative unsupervised augmentation solution that harnesses Generative Adversarial Network (GAN) based models and associated techniques over their latent space to generate controlled semiautomatically-discovered semantic variations in dermatoscopic images. We created synthetic images to incorporate the semantic variations and augmented the training data with these images. With this approach, we were able to increase the performance of machine learning models and set a new benchmark amongst non-ensemble based models in skin lesion classification on the HAM10000 dataset; and used the observed analytics and generated models for detailed studies on model explainability, affirming the effectiveness of our solution.
△ Less
Submitted 7 October, 2024;
originally announced October 2024.
-
Bounded-confidence opinion models with random-time interactions
Authors:
Weiqi Chu,
Mason A Porter
Abstract:
In models of opinion dynamics, the opinions of individual agents evolve with time. One type of opinion model is a bounded-confidence model (BCM), in which opinions take continuous values and interacting agents compromise their opinions with each other if those opinions are sufficiently similar. In studies of BCMs, it is typically assumed that interactions between agents occur at deterministic time…
▽ More
In models of opinion dynamics, the opinions of individual agents evolve with time. One type of opinion model is a bounded-confidence model (BCM), in which opinions take continuous values and interacting agents compromise their opinions with each other if those opinions are sufficiently similar. In studies of BCMs, it is typically assumed that interactions between agents occur at deterministic times. This assumption neglects an inherent element of randomness in social systems. In this paper, we study BCMs on networks and allow agents to interact at random times. To incorporate random-time interactions, we use renewal processes to determine social interactions, which can follow arbitrary waiting-time distributions (WTDs). We establish connections between these random-time-interaction BCMs and deterministic-time-interaction BCMs. We find that BCMs with Markovian WTDs have consistent statistical properties on different networks but that the statistical properties of BCMs with non-Markovian WTDs depend on network structure.
△ Less
Submitted 23 September, 2024;
originally announced September 2024.
-
Structural Robustness and Vulnerability of Networks
Authors:
Alice C. Schwarze,
Jessica Jiang,
Jonny Wray,
Mason A. Porter
Abstract:
Networks are useful descriptions of the structure of many complex systems. Unsurprisingly, it is thus important to analyze the robustness of networks in many scientific disciplines. In applications in communication, logistics, finance, ecology, biomedicine, and many other fields, researchers have studied the robustness of networks to the removal of nodes, edges, or other subnetworks to identify an…
▽ More
Networks are useful descriptions of the structure of many complex systems. Unsurprisingly, it is thus important to analyze the robustness of networks in many scientific disciplines. In applications in communication, logistics, finance, ecology, biomedicine, and many other fields, researchers have studied the robustness of networks to the removal of nodes, edges, or other subnetworks to identify and characterize robust network structures. A major challenge in the study of network robustness is that researchers have reported that different and seemingly contradictory network properties are correlated with a network's robustness. Using a framework by Alderson and Doyle~\cite{Alderson2010}, we categorize several notions of network robustness and we examine these ostensible contradictions. We survey studies of network robustness with a focus on (1)~identifying robustness specifications in common use, (2)~understanding when these specifications are appropriate, and (3)~understanding the conditions under which one can expect different notions of robustness to yield similar results. With this review, we aim to give researchers an overview of the large, interdisciplinary body of work on network robustness and develop practical guidance for the design of computational experiments to study a network's robustness.
△ Less
Submitted 10 September, 2024;
originally announced September 2024.
-
Oscillatory and Excitable Dynamics in an Opinion Model with Group Opinions
Authors:
Corbit R. Sampson,
Juan G. Restrepo,
Mason A. Porter
Abstract:
In traditional models of opinion dynamics, each agent in a network has an opinion and changes in opinions arise from pairwise (i.e., dyadic) interactions between agents. However, in many situations, groups of individuals possess a collective opinion that can differ from the opinions of its constituent individuals. In this paper, we study the effects of group opinions on opinion dynamics. We formul…
▽ More
In traditional models of opinion dynamics, each agent in a network has an opinion and changes in opinions arise from pairwise (i.e., dyadic) interactions between agents. However, in many situations, groups of individuals possess a collective opinion that can differ from the opinions of its constituent individuals. In this paper, we study the effects of group opinions on opinion dynamics. We formulate a hypergraph model in which both individual agents and groups of 3 agents have opinions, and we examine how opinions evolve through both dyadic interactions and group memberships. In some parameter regimes, we find that the presence of group opinions can lead to oscillatory and excitable opinion dynamics. In the oscillatory regime, the mean opinion of the agents in a network has self-sustained oscillations. In the excitable regime, finite-size effects create large but short-lived opinion swings (as in social fads). We develop a mean-field approximation of our model and obtain good agreement with direct numerical simulations. We also show -- both numerically and via our mean-field description -- that oscillatory dynamics occur only when the number of dyadic and polyadic interactions per agent are not completely correlated. Our results illustrate how polyadic structures, such as groups of agents, can have important effects on collective opinion dynamics.
△ Less
Submitted 11 April, 2025; v1 submitted 23 August, 2024;
originally announced August 2024.
-
Recomposition: A New Technique for Efficient Compositional Verification
Authors:
Ian Dardik,
April Porter,
Eunsuk Kang
Abstract:
Compositional verification algorithms are well-studied in the context of model checking. Properly selecting components for verification is important for efficiency, yet has received comparatively less attention. In this paper, we address this gap with a novel compositional verification framework that focuses on component selection as an explicit, first-class concept. The framework decomposes a sys…
▽ More
Compositional verification algorithms are well-studied in the context of model checking. Properly selecting components for verification is important for efficiency, yet has received comparatively less attention. In this paper, we address this gap with a novel compositional verification framework that focuses on component selection as an explicit, first-class concept. The framework decomposes a system into components, which we then recompose into new components for efficient verification. At the heart of our technique is the recomposition map that determines how recomposition is performed; the component selection problem thus reduces to finding a good recomposition map. However, the space of possible recomposition maps can be large. We therefore propose heuristics to find a small portfolio of recomposition maps, which we then run in parallel. We implemented our techniques in a model checker for the TLA+ language. In our experiments, we show that our tool achieves competitive performance with TLC-a well-known model checker for TLA+-on a benchmark suite of distributed protocols.
△ Less
Submitted 15 August, 2024; v1 submitted 6 August, 2024;
originally announced August 2024.
-
Ginzburg--Landau Functionals in the Large-Graph Limit
Authors:
Edith Zhang,
James Scott,
Qiang Du,
Mason A. Porter
Abstract:
Ginzburg--Landau (GL) functionals on graphs, which are relaxations of graph-cut functionals on graphs, have yielded a variety of insights in image segmentation and graph clustering. In this paper, we study large-graph limits of GL functionals by taking a functional-analytic view of graphs as nonlocal kernels. For a graph $W_n$ with $n$ nodes, the corresponding graph GL functional $\GL^{W_n}_\ep$ i…
▽ More
Ginzburg--Landau (GL) functionals on graphs, which are relaxations of graph-cut functionals on graphs, have yielded a variety of insights in image segmentation and graph clustering. In this paper, we study large-graph limits of GL functionals by taking a functional-analytic view of graphs as nonlocal kernels. For a graph $W_n$ with $n$ nodes, the corresponding graph GL functional $\GL^{W_n}_\ep$ is an energy for functions on $W_n$. We minimize GL functionals on sequences of growing graphs that converge to functions called graphons. For such sequences of graphs, we show that the graph GL functional $Γ$-converges to a continuous and nonlocal functional that we call the \emph{graphon GL functional}. We also investigate the sharp-interface limits of the graph GL and graphon GL functionals, and we relate these limits to a nonlocal total variation. We express the limiting GL functional in terms of Young measures and thereby obtain a probabilistic interpretation of the variational problem in the large-graph limit. Finally, to develop intuition about the graphon GL functional, we compute the GL minimizer for several example families of graphons.
△ Less
Submitted 1 August, 2024;
originally announced August 2024.
-
Competition between group interactions and nonlinearity in voter dynamics on hypergraphs
Authors:
Jihye Kim,
Deok-Sun Lee,
Byungjoon Min,
Mason A. Porter,
Maxi San Miguel,
K. -I. Goh
Abstract:
Social dynamics are often driven by both pairwise (i.e., dyadic) relationships and higher-order (i.e., polyadic) group relationships, which one can describe using hypergraphs. To gain insight into the impact of polyadic relationships on dynamical processes on networks, we formulate and study a polyadic voter process, which we call the group-driven voter model (GVM), that incorporates the effect of…
▽ More
Social dynamics are often driven by both pairwise (i.e., dyadic) relationships and higher-order (i.e., polyadic) group relationships, which one can describe using hypergraphs. To gain insight into the impact of polyadic relationships on dynamical processes on networks, we formulate and study a polyadic voter process, which we call the group-driven voter model (GVM), that incorporates the effect of group interactions by nonlinear interactions that are subject to a group (i.e., hyperedge) constraint. By examining the competition between nonlinearity and group sizes, we show that the GVM achieves consensus faster than standard voter-model dynamics, with an optimal minimizing exit time. We substantiate this finding by using mean-field theory on annealed uniform hypergraphs with $N$ nodes, for which the exit time scales as ${\cal A}\ln N$, where the prefactor ${\cal A}$ depends both on the nonlinearity and on group-constraint factors. Our results reveal how competition between group interactions and nonlinearity shapes GVM dynamics. We thereby highlight the importance of such competing effects in complex systems with polyadic interactions.
△ Less
Submitted 29 November, 2024; v1 submitted 15 July, 2024;
originally announced July 2024.
-
A Network-Based Measure of Cosponsorship Influence on Bill Passing in the United States House of Representatives
Authors:
Sarah Sotoudeh,
Mason A. Porter,
Sanjukta Krishnagopal
Abstract:
Each year, the United States Congress considers {thousands of legislative proposals to select bills} to present to the US President to sign into law. Naturally, the decision processes of members of Congress are subject to peer influence. In this paper, we examine the effect on bill passage of accrued influence between US Congress members in the US House of Representatives. We explore how the influ…
▽ More
Each year, the United States Congress considers {thousands of legislative proposals to select bills} to present to the US President to sign into law. Naturally, the decision processes of members of Congress are subject to peer influence. In this paper, we examine the effect on bill passage of accrued influence between US Congress members in the US House of Representatives. We explore how the influence of a bill's cosponsors affects the bill's outcome (specifically, whether or not it passes in the House). We define a notion of influence by analyzing the structure of a network that we construct {using} cosponsorship dynamics. We award `influence' between a pair of Congress members when they cosponsor a bill that achieves some amount of legislative success. We find that properties of the bill cosponsorship network can be a useful signal to examine influence in Congress; they help explain why some bills pass and others fail. We compare our measure of influence to off-the-shelf centrality measures and conclude that our influence measure is more indicative of bill passage.
△ Less
Submitted 27 June, 2024;
originally announced June 2024.
-
A Weighted-Median Model of Opinion Dynamics on Networks
Authors:
Lasse Mohr,
Poul G. Hjorth,
Mason A. Porter
Abstract:
Social interactions influence people's opinions. In some situations, these interactions result in a consensus opinion; in others, they result in opinion fragmentation and the formation of different opinion groups in the form of "echo chambers". Consider a social network of individuals, who hold continuous-valued scalar opinions and change their opinions when they interact with each other. In such…
▽ More
Social interactions influence people's opinions. In some situations, these interactions result in a consensus opinion; in others, they result in opinion fragmentation and the formation of different opinion groups in the form of "echo chambers". Consider a social network of individuals, who hold continuous-valued scalar opinions and change their opinions when they interact with each other. In such an opinion model, it is common for an opinion-update rule to depend on the mean opinion of interacting individuals. However, we consider an alternative update rule - which may be more realistic in some situations - that instead depends on a weighted median opinion of interacting individuals. Through numerical simulations of our opinion model, we investigate how the limit opinion distribution depends on network structure. For configuration-model networks, we also derive a mean-field approximation for the asymptotic dynamics of the opinion distribution when there are infinitely many individuals in a network.
△ Less
Submitted 25 June, 2024;
originally announced June 2024.
-
AI-Guided Feature Segmentation Techniques to Model Features from Single Crystal Diamond Growth
Authors:
Rohan Reddy Mekala,
Elias Garratt,
Matthias Muehle,
Arjun Srinivasan,
Adam Porter,
Mikael Lindvall
Abstract:
Process refinement to consistently produce high-quality material over a large area of the grown crystal, enabling various applications from optics crystals to quantum detectors, has long been a goal for diamond growth. Machine learning offers a promising path toward this goal, but faces challenges such as the complexity of features within datasets, their time-dependency, and the volume of data pro…
▽ More
Process refinement to consistently produce high-quality material over a large area of the grown crystal, enabling various applications from optics crystals to quantum detectors, has long been a goal for diamond growth. Machine learning offers a promising path toward this goal, but faces challenges such as the complexity of features within datasets, their time-dependency, and the volume of data produced per growth run. Accurate spatial feature extraction from image to image for real-time monitoring of diamond growth is crucial yet complicated due to the low-volume and high feature complexity nature of the datasets. This paper compares various traditional and machine learning-driven approaches for feature extraction in the diamond growth domain, proposing a novel deep learning-driven semantic segmentation approach to isolate and classify accurate pixel masks of geometric features like diamond, pocket holder, and background, along with their derivative features based on shape and size. Using an annotation-focused human-in-the-loop software architecture for training datasets, with modules for selective data labeling using active learning, data augmentations, and model-assisted labeling, our approach achieves effective annotation accuracy and drastically reduces labeling time and cost. Deep learning algorithms prove highly efficient in accurately learning complex representations from datasets with many features. Our top-performing model, based on the DeeplabV3plus architecture, achieves outstanding accuracy in classifying features of interest, with accuracies of 96.31% for pocket holder, 98.60% for diamond top, and 91.64% for diamond side features.
△ Less
Submitted 10 April, 2024;
originally announced April 2024.
-
AI-Guided Defect Detection Techniques to Model Single Crystal Diamond Growth
Authors:
Rohan Reddy Mekala,
Elias Garratt,
Matthias Muehle,
Arjun Srinivasan,
Adam Porter,
Mikael Lindvall
Abstract:
From a process development perspective, diamond growth via chemical vapor deposition has made significant strides. However, challenges persist in achieving high quality and large-area material production. These difficulties include controlling conditions to maintain uniform growth rates for the entire growth surface. As growth progresses, various factors or defect states emerge, altering the unifo…
▽ More
From a process development perspective, diamond growth via chemical vapor deposition has made significant strides. However, challenges persist in achieving high quality and large-area material production. These difficulties include controlling conditions to maintain uniform growth rates for the entire growth surface. As growth progresses, various factors or defect states emerge, altering the uniform conditions. These changes affect the growth rate and result in the formation of crystalline defects at the microscale. However, there is a distinct lack of methods to identify these defect states and their geometry using images taken during the growth process. This paper details seminal work on defect segmentation pipeline using in-situ optical images to identify features that indicate defective states that are visible at the macroscale. Using a semantic segmentation approach as applied in our previous work, these defect states and corresponding derivative features are isolated and classified by their pixel masks. Using an annotation focused human-in-the-loop software architecture to produce training datasets, with modules for selective data labeling using active learning, data augmentations, and model-assisted labeling, our approach achieves effective annotation accuracy and drastically reduces the time and cost of labeling by orders of magnitude. On the model development front, we found that deep learning-based algorithms are the most efficient. They can accurately learn complex representations from feature-rich datasets. Our best-performing model, based on the YOLOV3 and DeeplabV3plus architectures, achieved excellent accuracy for specific features of interest. Specifically, it reached 93.35% accuracy for center defects, 92.83% for polycrystalline defects, and 91.98% for edge defects.
△ Less
Submitted 10 April, 2024;
originally announced April 2024.
-
Dynamical importance and network perturbations
Authors:
Ethan Young,
Mason A. Porter
Abstract:
The leading eigenvalue $λ$ of the adjacency matrix of a graph exerts much influence on the behavior of dynamical processes on that graph. It is thus relevant to relate notions of the importance (specifically, centrality measures) of network structures to $λ$ and its associated eigenvector. We study a previously derived measure of edge importance known as ``dynamical importance'', which estimates h…
▽ More
The leading eigenvalue $λ$ of the adjacency matrix of a graph exerts much influence on the behavior of dynamical processes on that graph. It is thus relevant to relate notions of the importance (specifically, centrality measures) of network structures to $λ$ and its associated eigenvector. We study a previously derived measure of edge importance known as ``dynamical importance'', which estimates how much $λ$ changes when one removes an edge from a graph or adds an edge to it. We examine the accuracy of this estimate for different network structures and compare it to the true change in $λ$ after an edge removal or edge addition. We then derive a first-order approximation of the change in the leading eigenvector. We also consider the effects of edge additions on Kuramoto dynamics on networks, and we express the Kuramoto order parameter in terms of dynamical importance. Through our analysis and computational experiments, we find that studying dynamical importance can improve understanding of the relationship between network perturbations and dynamical processes on networks.
△ Less
Submitted 21 August, 2024; v1 submitted 21 March, 2024;
originally announced March 2024.
-
An "Opinion Reproduction Number" for Infodemics in a Bounded-Confidence Content-Spreading Process on Networks
Authors:
Heather Z. Brooks,
Mason A. Porter
Abstract:
We study the spreading dynamics of content on networks. To do this, we use a model in which content spreads through a bounded-confidence mechanism. In a bounded-confidence model (BCM) of opinion dynamics, the agents of a network have continuous-valued opinions, which they adjust when they interact with agents whose opinions are sufficiently close to theirs. The employed content-spreading model int…
▽ More
We study the spreading dynamics of content on networks. To do this, we use a model in which content spreads through a bounded-confidence mechanism. In a bounded-confidence model (BCM) of opinion dynamics, the agents of a network have continuous-valued opinions, which they adjust when they interact with agents whose opinions are sufficiently close to theirs. The employed content-spreading model introduces a twist into BCMs by using bounded confidence for the content spread itself. We define an analogue of the basic reproduction number from disease dynamics that we call an \emph{opinion reproduction number}. A critical value of the opinion reproduction number indicates whether or not there is an ``infodemic'' (i.e., a large content-spreading cascade) of content that reflects a particular opinion. By determining this critical value, one can determine whether or not an opinion dies off or propagates widely as a cascade in a population of agents. Using configuration-model networks, we quantify the size and shape of content dissemination by calculating a variety of summary statistics, and we illustrate how network structure and spreading-model parameters affect these statistics. We find that content spreads most widely when the agents have a large expected mean degree or a large receptiveness to content. When the spreading process slightly exceeds the infodemic threshold, there can be longer dissemination trees than when the expected mean degree or receptiveness are larger, even though the total number of content shares is smaller.
△ Less
Submitted 14 November, 2024; v1 submitted 1 March, 2024;
originally announced March 2024.
-
Bounded-Confidence Models of Opinion Dynamics with Neighborhood Effects
Authors:
Sanjukta Krishnagopal,
Mason A. Porter
Abstract:
We generalize bounded-confidence models (BCMs) of opinion dynamics by incorporating neighborhood effects. In a BCM, interacting agents influence each other through dyadic influence if their opinions are sufficiently similar to each other. In our "neighborhood BCMs" (NBCMs), interacting agents are influenced both by each other's opinions and by the opinions of the agents in each other's neighborhoo…
▽ More
We generalize bounded-confidence models (BCMs) of opinion dynamics by incorporating neighborhood effects. In a BCM, interacting agents influence each other through dyadic influence if their opinions are sufficiently similar to each other. In our "neighborhood BCMs" (NBCMs), interacting agents are influenced both by each other's opinions and by the opinions of the agents in each other's neighborhoods. Our NBCMs thus include both the usual dyadic influence between agents and a "transitive influence", which encodes the influence of an agent's neighbors, when determining whether or not an interaction changes the opinions of agents. In this transitive influence, an individual's opinion is influenced by a neighbor when, on average, the opinions of the neighbor's neighbors are sufficiently similar to its own opinion. We formulate both neighborhood Deffuant--Weisbuch (NDW) and neighborhood Hegselmann--Krause (NHK) BCMs.
We build further on our NBCMs by introducing a neighborhood-based network adaptation in which a network coevolves with agent opinions by changing its structure through "transitive homophily". In this network evolution, an agent breaks a tie to one of its neighbors and then rewires that tie to a new agent, with a preference for agents with a mean neighbor opinion that is closer to its own opinion. Using numerical simulations on a variety of types of networks, we explore how the qualitative opinion dynamics and network properties of our adaptive NDW model change as we adjust the relative proportions of dyadic and transitive influence. In our numerical experiments, we find that incorporating neighborhood effects into the opinion dynamics and the network-adaptation rewiring strategy tends to reduce the spectral gap and degree assortativity of networks.
(This is a shortened version of the paper's abstract.)
△ Less
Submitted 22 May, 2025; v1 submitted 7 February, 2024;
originally announced February 2024.
-
The damage number of the Cartesian product of graphs
Authors:
Melissa A. Huggan,
Margaret-Ellen Messinger,
Amanda Porter
Abstract:
We consider a variation of Cops and Robber, introduced in [D. Cox and A. Sanaei, The damage number of a graph, [Aust. J. of Comb. 75(1) (2019) 1-16] where vertices visited by a robber are considered damaged and a single cop aims to minimize the number of distinct vertices damaged by a robber. Motivated by the interesting relationships that often emerge between input graphs and their Cartesian prod…
▽ More
We consider a variation of Cops and Robber, introduced in [D. Cox and A. Sanaei, The damage number of a graph, [Aust. J. of Comb. 75(1) (2019) 1-16] where vertices visited by a robber are considered damaged and a single cop aims to minimize the number of distinct vertices damaged by a robber. Motivated by the interesting relationships that often emerge between input graphs and their Cartesian product, we study the damage number of the Cartesian product of graphs. We provide a general upper bound and consider the damage number of the product of two trees or cycles. We also consider graphs with small damage number.
△ Less
Submitted 18 December, 2024; v1 submitted 18 August, 2023;
originally announced August 2023.
-
Using mathematics to study how people influence each other's opinions
Authors:
Grace J. Li,
Jiajie Luo,
Kaiyan Peng,
Mason A. Porter
Abstract:
People sometimes change their opinions when they discuss things with other people. Researchers can use mathematics to study opinion changes in simplifications of real-life situations. These simplified settings, which are examples of mathematical models, help researchers explore how people influence each other through their social interactions. In today's digital world, these models can help us lea…
▽ More
People sometimes change their opinions when they discuss things with other people. Researchers can use mathematics to study opinion changes in simplifications of real-life situations. These simplified settings, which are examples of mathematical models, help researchers explore how people influence each other through their social interactions. In today's digital world, these models can help us learn how to promote the spread of accurate information and reduce the spread of inaccurate information. In this article, we discuss a simple mathematical model of opinion changes that arise from social interactions. We briefly describe what such opinion models can tell us and how researchers try to make them more realistic.
△ Less
Submitted 5 August, 2024; v1 submitted 4 July, 2023;
originally announced July 2023.
-
MiraBest: A Dataset of Morphologically Classified Radio Galaxies for Machine Learning
Authors:
Fiona A. M. Porter,
Anna M. M. Scaife
Abstract:
The volume of data from current and future observatories has motivated the increased development and application of automated machine learning methodologies for astronomy. However, less attention has been given to the production of standardised datasets for assessing the performance of different machine learning algorithms within astronomy and astrophysics. Here we describe in detail the MiraBest…
▽ More
The volume of data from current and future observatories has motivated the increased development and application of automated machine learning methodologies for astronomy. However, less attention has been given to the production of standardised datasets for assessing the performance of different machine learning algorithms within astronomy and astrophysics. Here we describe in detail the MiraBest dataset, a publicly available batched dataset of 1256 radio-loud AGN from NVSS and FIRST, filtered to $0.03 < z < 0.1$, manually labelled by Miraghaei and Best (2017) according to the Fanaroff-Riley morphological classification, created for machine learning applications and compatible for use with standard deep learning libraries. We outline the principles underlying the construction of the dataset, the sample selection and pre-processing methodology, dataset structure and composition, as well as a comparison of MiraBest to other datasets used in the literature. Existing applications that utilise the MiraBest dataset are reviewed, and an extended dataset of 2100 sources is created by cross-matching MiraBest with other catalogues of radio-loud AGN that have been used more widely in the literature for machine learning applications.
△ Less
Submitted 18 May, 2023;
originally announced May 2023.
-
Perceived community alignment increases information sharing
Authors:
Elisa C. Baek,
Ryan Hyon,
Karina López,
Mason A. Porter,
Carolyn Parkinson
Abstract:
It has been proposed that information sharing, which is a ubiquitous and consequential behavior, plays a critical role in cultivating and maintaining a sense of shared reality. Across three studies, we tested this theory by investigating whether or not people are especially likely to share information that they believe will be interpreted similarly by others in their social circles. Using neuroima…
▽ More
It has been proposed that information sharing, which is a ubiquitous and consequential behavior, plays a critical role in cultivating and maintaining a sense of shared reality. Across three studies, we tested this theory by investigating whether or not people are especially likely to share information that they believe will be interpreted similarly by others in their social circles. Using neuroimaging while members of the same community viewed brief film clips, we found that more similar neural responding of participants was associated with a greater likelihood to share content. We then tested this relationship using two behavioral studies and found (1) that people were particularly likely to share content that they believed others in their social circles would interpret similarly and (2) that perceived similarity with others leads to increased sharing likelihood. In concert, our findings support the idea that people are driven to share information to create and reinforce shared understanding, which is critical to social connection.
△ Less
Submitted 23 December, 2024; v1 submitted 26 April, 2023;
originally announced April 2023.
-
Bounded-Confidence Models of Opinion Dynamics with Adaptive Confidence Bounds
Authors:
Grace J. Li,
Jiajie Luo,
Mason A. Porter
Abstract:
People's opinions change with time as they interact with each other. In a bounded-confidence model (BCM) of opinion dynamics, individuals (which are represented by the nodes of a network) have continuous-valued opinions and are influenced by neighboring nodes whose opinions are sufficiently similar to theirs (i.e., are within a confidence bound). In this paper, we formulate and analyze discrete-ti…
▽ More
People's opinions change with time as they interact with each other. In a bounded-confidence model (BCM) of opinion dynamics, individuals (which are represented by the nodes of a network) have continuous-valued opinions and are influenced by neighboring nodes whose opinions are sufficiently similar to theirs (i.e., are within a confidence bound). In this paper, we formulate and analyze discrete-time BCMs with heterogeneous and adaptive confidence bounds. We introduce two new models: (1) a BCM with synchronous opinion updates that generalizes the Hegselmann--Krause (HK) model and (2) a BCM with asynchronous opinion updates that generalizes the Deffuant--Weisbuch (DW) model. We analytically and numerically explore our adaptive BCMs' limiting behaviors, including the confidence-bound dynamics, the formation of clusters of nodes with similar opinions, and the time evolution of an "effective graph", which is a time-dependent subgraph of a network with edges between nodes that {are currently receptive to each other.} For a variety of networks and a wide range of values of the parameters that control the increase and decrease of confidence bounds, we demonstrate numerically that our adaptive BCMs result in fewer major opinion clusters and longer convergence times than the baseline (i.e., nonadaptive) BCMs. We also show that our adaptive BCMs can have adjacent nodes that converge to the same opinion but are not {receptive to each other.} This qualitative behavior does not occur in the associated baseline BCMs.
△ Less
Submitted 27 July, 2024; v1 submitted 13 March, 2023;
originally announced March 2023.
-
Inference of interaction kernels in mean-field models of opinion dynamics
Authors:
Weiqi Chu,
Qin Li,
Mason A. Porter
Abstract:
In models of opinion dynamics, many parameters -- either in the form of constants or in the form of functions -- play a critical role in describing, calibrating, and forecasting how opinions change with time. When examining a model of opinion dynamics, it is beneficial to infer its parameters using empirical data. In this paper, we study an example of such an inference problem. We consider a mean-…
▽ More
In models of opinion dynamics, many parameters -- either in the form of constants or in the form of functions -- play a critical role in describing, calibrating, and forecasting how opinions change with time. When examining a model of opinion dynamics, it is beneficial to infer its parameters using empirical data. In this paper, we study an example of such an inference problem. We consider a mean-field bounded-confidence model with an unknown interaction kernel between individuals. This interaction kernel encodes how individuals with different opinions interact and affect each other's opinions. Because it is often difficult to quantitatively measure opinions as empirical data from observations or experiments, we assume that the available data takes the form of partial observations of a cumulative distribution function of opinions. We prove that certain measurements guarantee a precise and unique inference of the interaction kernel and propose a numerical method to reconstruct an interaction kernel from a limited number of data points. Our numerical results suggest that the error of the inferred interaction kernel decays exponentially as we strategically enlarge the data set.
△ Less
Submitted 26 October, 2023; v1 submitted 29 December, 2022;
originally announced December 2022.
-
Complex networks with complex weights
Authors:
Lucas Böttcher,
Mason A. Porter
Abstract:
In many studies, it is common to use binary (i.e., unweighted) edges to examine networks of entities that are either adjacent or not adjacent. Researchers have generalized such binary networks to incorporate edge weights, which allow one to encode node--node interactions with heterogeneous intensities or frequencies (e.g., in transportation networks, supply chains, and social networks). Most such…
▽ More
In many studies, it is common to use binary (i.e., unweighted) edges to examine networks of entities that are either adjacent or not adjacent. Researchers have generalized such binary networks to incorporate edge weights, which allow one to encode node--node interactions with heterogeneous intensities or frequencies (e.g., in transportation networks, supply chains, and social networks). Most such studies have considered real-valued weights, despite the fact that networks with complex weights arise in fields as diverse as quantum information, quantum chemistry, electrodynamics, rheology, and machine learning. Many of the standard network-science approaches in the study of classical systems rely on the real-valued nature of edge weights, so it is necessary to generalize them if one seeks to use them to analyze networks with complex edge weights. In this paper, we examine how standard network-analysis methods fail to capture structural features of networks with complex edge weights. We then generalize several network measures to the complex domain and show that random-walk centralities provide a useful approach to examine node importances in networks with complex weights.
△ Less
Submitted 25 July, 2023; v1 submitted 12 December, 2022;
originally announced December 2022.
-
Structure of Classifier Boundaries: Case Study for a Naive Bayes Classifier
Authors:
Alan F. Karr,
Zac Bowen,
Adam A. Porter
Abstract:
Whether based on models, training data or a combination, classifiers place (possibly complex) input data into one of a relatively small number of output categories. In this paper, we study the structure of the boundary--those points for which a neighbor is classified differently--in the context of an input space that is a graph, so that there is a concept of neighboring inputs, The scientific sett…
▽ More
Whether based on models, training data or a combination, classifiers place (possibly complex) input data into one of a relatively small number of output categories. In this paper, we study the structure of the boundary--those points for which a neighbor is classified differently--in the context of an input space that is a graph, so that there is a concept of neighboring inputs, The scientific setting is a model-based naive Bayes classifier for DNA reads produced by Next Generation Sequencers. We show that the boundary is both large and complicated in structure. We create a new measure of uncertainty, called Neighbor Similarity, that compares the result for a point to the distribution of results for its neighbors. This measure not only tracks two inherent uncertainty measures for the Bayes classifier, but also can be implemented, at a computational cost, for classifiers without inherent measures of uncertainty.
△ Less
Submitted 9 February, 2024; v1 submitted 8 December, 2022;
originally announced December 2022.
-
Inference of Media Bias and Content Quality Using Natural-Language Processing
Authors:
Zehan Chao,
Denali Molitor,
Deanna Needell,
Mason A. Porter
Abstract:
Media bias can significantly impact the formation and development of opinions and sentiments in a population. It is thus important to study the emergence and development of partisan media and political polarization. However, it is challenging to quantitatively infer the ideological positions of media outlets. In this paper, we present a quantitative framework to infer both political bias and conte…
▽ More
Media bias can significantly impact the formation and development of opinions and sentiments in a population. It is thus important to study the emergence and development of partisan media and political polarization. However, it is challenging to quantitatively infer the ideological positions of media outlets. In this paper, we present a quantitative framework to infer both political bias and content quality of media outlets from text, and we illustrate this framework with empirical experiments with real-world data. We apply a bidirectional long short-term memory (LSTM) neural network to a data set of more than 1 million tweets to generate a two-dimensional ideological-bias and content-quality measurement for each tweet. We then infer a ``media-bias chart'' of (bias, quality) coordinates for the media outlets by integrating the (bias, quality) measurements of the tweets of the media outlets. We also apply a variety of baseline machine-learning methods, such as a naive-Bayes method and a support-vector machine (SVM), to infer the bias and quality values for each tweet. All of these baseline approaches are based on a bag-of-words approach. We find that the LSTM-network approach has the best performance of the examined methods. Our results illustrate the importance of leveraging word order into machine-learning methods in text analysis.
△ Less
Submitted 30 November, 2022;
originally announced December 2022.
-
Detecting Political Biases of Named Entities and Hashtags on Twitter
Authors:
Zhiping Xiao,
Jeffrey Zhu,
Yining Wang,
Pei Zhou,
Wen Hong Lam,
Mason A. Porter,
Yizhou Sun
Abstract:
Ideological divisions in the United States have become increasingly prominent in daily communication. Accordingly, there has been much research on political polarization, including many recent efforts that take a computational perspective. By detecting political biases in a corpus of text, one can attempt to describe and discern the polarity of that text. Intuitively, the named entities (i.e., the…
▽ More
Ideological divisions in the United States have become increasingly prominent in daily communication. Accordingly, there has been much research on political polarization, including many recent efforts that take a computational perspective. By detecting political biases in a corpus of text, one can attempt to describe and discern the polarity of that text. Intuitively, the named entities (i.e., the nouns and the phrases that act as nouns) and hashtags in text often carry information about political views. For example, people who use the term "pro-choice" are likely to be liberal, whereas people who use the term "pro-life" are likely to be conservative. In this paper, we seek to reveal political polarities in social-media text data and to quantify these polarities by explicitly assigning a polarity score to entities and hashtags. Although this idea is straightforward, it is difficult to perform such inference in a trustworthy quantitative way. Key challenges include the small number of known labels, the continuous spectrum of political views, and the preservation of both a polarity score and a polarity-neutral semantic meaning in an embedding vector of words. To attempt to overcome these challenges, we propose the Polarity-aware Embedding Multi-task learning (PEM) model. This model consists of (1) a self-supervised context-preservation task, (2) an attention-based tweet-level polarity-inference task, and (3) an adversarial learning task that promotes independence between an embedding's polarity dimension and its semantic dimensions. Our experimental results demonstrate that our PEM model can successfully learn polarity-aware embeddings that perform well classification tasks. We examine a variety of applications and we thereby demonstrate the effectiveness of our PEM model. We also discuss important limitations of our work and encourage caution when applying the it to real-world scenarios.
△ Less
Submitted 17 March, 2023; v1 submitted 16 September, 2022;
originally announced September 2022.
-
Emergence of polarization in a sigmoidal bounded-confidence model of opinion dynamics
Authors:
Heather Z. Brooks,
Philip S. Chodrow,
Mason A. Porter
Abstract:
We study a nonlinear bounded-confidence model (BCM) of continuous-time opinion dynamics on networks with both persuadable individuals and zealots. The model is parameterized by a scalar $γ$, which controls the steepness of a smooth influence function. This influence function encodes the relative weights that nodes place on the opinions of other nodes. When $γ= 0$, this influence function recovers…
▽ More
We study a nonlinear bounded-confidence model (BCM) of continuous-time opinion dynamics on networks with both persuadable individuals and zealots. The model is parameterized by a scalar $γ$, which controls the steepness of a smooth influence function. This influence function encodes the relative weights that nodes place on the opinions of other nodes. When $γ= 0$, this influence function recovers Taylor's averaging model; when $γ\rightarrow \infty$, the influence function converges to that of a modified Hegselmann--Krause (HK) BCM. Unlike the classical HK model, however, our sigmoidal bounded-confidence model (SBCM) is smooth for any finite $γ$. We show that the set of steady states of our SBCM is qualitatively similar to that of the Taylor model when $γ$ is small and that the set of steady states approaches a subset of the set of steady states of a modified HK model as $γ\rightarrow \infty$. For several special graph topologies, we give analytical descriptions of important features of the space of steady states. A notable result is a closed-form relationship between the stability of a polarized state and the graph topology in a simple model of echo chambers in social networks. Because the influence function of our BCM is smooth, we are able to study it with linear stability analysis, which is difficult to employ with the usual discontinuous influence functions in BCMs.
△ Less
Submitted 29 July, 2023; v1 submitted 14 September, 2022;
originally announced September 2022.
-
Non-Markovian models of opinion dynamics on temporal networks
Authors:
Weiqi Chu,
Mason A. Porter
Abstract:
Traditional models of opinion dynamics, in which the nodes of a network change their opinions based on their interactions with neighboring nodes, consider how opinions evolve either on time-independent networks or on temporal networks with edges that follow Poisson statistics. Most such models are Markovian. However, in many real-life networks, interactions between individuals (and hence the edges…
▽ More
Traditional models of opinion dynamics, in which the nodes of a network change their opinions based on their interactions with neighboring nodes, consider how opinions evolve either on time-independent networks or on temporal networks with edges that follow Poisson statistics. Most such models are Markovian. However, in many real-life networks, interactions between individuals (and hence the edges of a network) follow non-Poisson processes and thus yield dynamics with memory-dependent effects. In this paper, we model opinion dynamics in which the entities of a temporal network interact and change their opinions via random social interactions. When the edges have non-Poisson interevent statistics, the corresponding opinion models are have non-Markovian dynamics. We derive an opinion model that is governed by an arbitrary waiting-time distribution (WTD) and illustrate a variety of induced opinion models from common WTDs (including Dirac delta distributions, exponential distributions, and heavy-tailed distributions). We analyze the convergence to consensus of these models and prove that homogeneous memory-dependent models of opinion dynamics in our framework always converge to the same steady state regardless of the WTD. We also conduct a numerical investigation of the effects of waiting-time distributions on both transient dynamics and steady states. We observe that models that are induced by heavy-tailed WTDs converge to a steady state more slowly than those with light tails (or with compact support) and that entities with larger waiting times exert a larger influence on the mean opinion at steady state.
△ Less
Submitted 10 March, 2023; v1 submitted 26 August, 2022;
originally announced August 2022.
-
A Majority-Vote Model On Multiplex Networks with Community Structure
Authors:
Kaiyan Peng,
Mason A. Porter
Abstract:
We investigate a majority-vote model on two-layer multiplex networks with community structure. In our majority-vote model, the edges on each layer encode one type of social relationship and an individual changes their opinion based on the majority opinions of their neighbors in each layer. To capture the fact that different relationships often have different levels of importance, we introduce a la…
▽ More
We investigate a majority-vote model on two-layer multiplex networks with community structure. In our majority-vote model, the edges on each layer encode one type of social relationship and an individual changes their opinion based on the majority opinions of their neighbors in each layer. To capture the fact that different relationships often have different levels of importance, we introduce a layer-preference parameter, which determines the probability of a node to adopt an opinion when the node's neighborhoods on the two layers have different majority opinions. We construct our networks so that each node is a member of one community on each layer, and we consider situations in which nodes tend to have more connections with nodes from the same community than with nodes from different communities. We study the influence of the layer-preference parameter, the intralayer communities, and interlayer membership correlation on the steady-state behavior of our model using both direct numerical simulations and a mean-field approximation. We find three different types of steady-state behavior: a fully-mixed state, consensus states, and polarized states. We demonstrate that a stronger interlayer community correlation makes polarized steady states reachable for wider ranges of the other model parameters. We also show that different values of the layer-preference parameter result in qualitatively different phase diagrams for the mean opinions at steady states.
△ Less
Submitted 27 June, 2022;
originally announced June 2022.
-
A Bounded-Confidence Model of Opinion Dynamics with Heterogeneous Node-Activity Levels
Authors:
Grace J. Li,
Mason A. Porter
Abstract:
Agent-based models of opinion dynamics allow one to examine the spread of opinions between entities and to study phenomena such as consensus, polarization, and fragmentation. By studying a model of opinion dynamics on a social network, one can explore the effects of network structure on these phenomena. In social networks, some individuals share their ideas and opinions more frequently than others…
▽ More
Agent-based models of opinion dynamics allow one to examine the spread of opinions between entities and to study phenomena such as consensus, polarization, and fragmentation. By studying a model of opinion dynamics on a social network, one can explore the effects of network structure on these phenomena. In social networks, some individuals share their ideas and opinions more frequently than others. These disparities can arise from heterogeneous sociabilities, heterogeneous activity levels, different prevalences to share opinions when engaging in a social-media platform, or something else. To examine the impact of such heterogeneities on opinion dynamics, we generalize the Deffuant--Weisbuch (DW) bounded-confidence model (BCM) of opinion dynamics by incorporating node weights. The node weights allow us to model agents with different probabilities of interacting. Using numerical simulations, we systematically investigate (using a variety of network structures and node-weight distributions) the effects of node weights, which we assign uniformly at random to the nodes. We demonstrate that introducing heterogeneous node weights results in longer convergence times and more opinion fragmentation than in a baseline DW model. The node weights in our BCM allow one to consider a variety of sociological scenarios in which agents have heterogeneous probabilities of interacting with other agents.
△ Less
Submitted 20 March, 2023; v1 submitted 19 June, 2022;
originally announced June 2022.
-
Persistent Homology for Resource Coverage: A Case Study of Access to Polling Sites
Authors:
Abigail Hickok,
Benjamin Jarman,
Michael Johnson,
Jiajie Luo,
Mason A. Porter
Abstract:
It is important to choose the geographical distributions of public resources in a fair and equitable manner. However, it is complicated to quantify the equity of such a distribution; important factors include distances to resource sites, availability of transportation, and ease of travel. We use persistent homology, which is a tool from topological data analysis, to study the effective availabilit…
▽ More
It is important to choose the geographical distributions of public resources in a fair and equitable manner. However, it is complicated to quantify the equity of such a distribution; important factors include distances to resource sites, availability of transportation, and ease of travel. We use persistent homology, which is a tool from topological data analysis, to study the effective availability and coverage of polling sites. The information from persistent homology allows us to infer holes in the distribution of polling sites. We analyze and compare the coverage of polling sites in Los Angeles County and five cities (Atlanta, Chicago, Jacksonville, New York City, and Salt Lake City), and we conclude that computation of persistent homology appears to be a reasonable approach to analyzing resource coverage.
△ Less
Submitted 11 August, 2023; v1 submitted 9 June, 2022;
originally announced June 2022.
-
A density description of a bounded-confidence model of opinion dynamics on hypergraphs
Authors:
Weiqi Chu,
Mason A. Porter
Abstract:
Social interactions often occur between three or more agents simultaneously. Examining opinion dynamics on hypergraphs allows one to study the effect of such polyadic interactions on the opinions of agents. In this paper, we consider a bounded-confidence model (BCM), in which opinions take continuous values and interacting agents comprise their opinions if they are close enough to each other. We s…
▽ More
Social interactions often occur between three or more agents simultaneously. Examining opinion dynamics on hypergraphs allows one to study the effect of such polyadic interactions on the opinions of agents. In this paper, we consider a bounded-confidence model (BCM), in which opinions take continuous values and interacting agents comprise their opinions if they are close enough to each other. We study a density description of a Deffuant--Weisbuch BCM on hypergraphs. We derive a rate equation for the mean-field opinion density as the number of agents becomes infinite, and we prove that this rate equation yields a probability density that converges to noninteracting opinion clusters. Using numerical simulations, we examine bifurcations of the density-based BCM's steady-state opinion clusters and demonstrate that the agent-based BCM converges to the density description of the BCM as the number of agents becomes infinite.
△ Less
Submitted 27 April, 2023; v1 submitted 23 March, 2022;
originally announced March 2022.
-
A Non-Expert's Introduction to Data Ethics for Mathematicians
Authors:
Mason A. Porter
Abstract:
I give a short introduction to data ethics. I begin with some background information and societal context for data ethics. I then discuss data ethics in mathematical-science education and indicate some available course material. I briefly highlight a few efforts -- at my home institution and elsewhere -- on data ethics, society, and social good. I then discuss open data in research, research repli…
▽ More
I give a short introduction to data ethics. I begin with some background information and societal context for data ethics. I then discuss data ethics in mathematical-science education and indicate some available course material. I briefly highlight a few efforts -- at my home institution and elsewhere -- on data ethics, society, and social good. I then discuss open data in research, research replicability and some other ethical issues in research, and the tension between privacy and open data and code, and a few controversial studies and reactions to studies. I then discuss ethical principles, institutional review boards, and a few other considerations in the scientific use of human data. I then briefly survey a variety of research and lay articles that are relevant to data ethics and data privacy. I conclude with a brief summary and some closing remarks.
My focal audience is mathematicians, but I hope that this chapter will also be useful to others. I am not an expert about data ethics, and this chapter provides only a starting point on this wide-ranging topic. I encourage you to examine the resources that I discuss and to reflect carefully on data ethics, its role in mathematics education, and the societal implications of data and data analysis. As data and technology continue to evolve, I hope that such careful reflection will continue throughout your life.
△ Less
Submitted 25 July, 2024; v1 submitted 18 January, 2022;
originally announced January 2022.
-
Analytical Models for Motifs in Temporal Networks: Discovering Trends and Anomalies
Authors:
Alexandra Porter,
Baharan Mirzasoleiman,
Jure Leskovec
Abstract:
Dynamic evolving networks capture temporal relations in domains such as social networks, communication networks, and financial transaction networks. In such networks, temporal motifs, which are repeated sequences of time-stamped edges/transactions, offer valuable information about the networks' evolution and function. However, currently no analytical models for temporal graphs exist and there are…
▽ More
Dynamic evolving networks capture temporal relations in domains such as social networks, communication networks, and financial transaction networks. In such networks, temporal motifs, which are repeated sequences of time-stamped edges/transactions, offer valuable information about the networks' evolution and function. However, currently no analytical models for temporal graphs exist and there are no models that would allow for scalable modeling of temporal motif frequencies over time. Here, we develop the Temporal Activity State Block Model (TASBM), to model temporal motifs in temporal graphs. We develop efficient model fitting methods and derive closed-form expressions for the expected motif frequencies and their variances in a given temporal network, thus enabling the discovery of statistically significant temporal motifs. Our TASMB framework can accurately track the changes in the expected motif frequencies over time, and also scales well to networks with tens of millions of edges/transactions as it does not require time-consuming generation of many random temporal networks and then computing motif counts for each one of them. We show that TASBM is able to model changes in temporal activity over time in a network of financial transactions, a phone call, and an email network. Additionally, we show that deviations from the expected motif counts calculated by our analytical framework correspond to anomalies in the financial transactions and phone call networks.
△ Less
Submitted 29 December, 2021;
originally announced December 2021.
-
Application of Markov Structure of Genomes to Outlier Identification and Read Classification
Authors:
Alan F. Karr,
Jason Hauzel,
Adam A. Porter,
Marcel Schaefer
Abstract:
In this paper we apply the structure of genomes as second-order Markov processes specified by the distributions of successive triplets of bases to two bioinformatics problems: identification of outliers in genome databases and read classification in metagenomics, using real coronavirus and adenovirus data.
In this paper we apply the structure of genomes as second-order Markov processes specified by the distributions of successive triplets of bases to two bioinformatics problems: identification of outliers in genome databases and read classification in metagenomics, using real coronavirus and adenovirus data.
△ Less
Submitted 24 December, 2021;
originally announced December 2021.
-
Measuring Quality of DNA Sequence Data via Degradation
Authors:
Alan F. Karr,
Jason Hauzel,
Adam A. Porter,
Marcel Schaefer
Abstract:
We propose and apply a novel paradigm for characterization of genome data quality, which quantifies the effects of intentional degradation of quality. The rationale is that the higher the initial quality, the more fragile the genome and the greater the effects of degradation. We demonstrate that this phenomenon is ubiquitous, and that quantified measures of degradation can be used for multiple pur…
▽ More
We propose and apply a novel paradigm for characterization of genome data quality, which quantifies the effects of intentional degradation of quality. The rationale is that the higher the initial quality, the more fragile the genome and the greater the effects of degradation. We demonstrate that this phenomenon is ubiquitous, and that quantified measures of degradation can be used for multiple purposes. We focus on identifying outliers that may be problematic with respect to data quality, but might also be true anomalies or even attempts to subvert the database.
△ Less
Submitted 24 December, 2021;
originally announced December 2021.
-
An adaptation of InfoMap to absorbing random walks using absorption-scaled graphs
Authors:
Esteban Vargas Bernal,
Mason A. Porter,
Joseph H. Tien
Abstract:
InfoMap is a popular approach to detect densely connected "communities" of nodes in networks. To detect such communities, InfoMap uses random walks and ideas from information theory. Motivated by the dynamics of disease spread on networks, whose nodes can have heterogeneous disease-removal rates, we adapt InfoMap to absorbing random walks. To do this, we use absorption-scaled graphs (in which edge…
▽ More
InfoMap is a popular approach to detect densely connected "communities" of nodes in networks. To detect such communities, InfoMap uses random walks and ideas from information theory. Motivated by the dynamics of disease spread on networks, whose nodes can have heterogeneous disease-removal rates, we adapt InfoMap to absorbing random walks. To do this, we use absorption-scaled graphs (in which edge weights are scaled according to absorption rates) and Markov time sweeping. One of our adaptations of InfoMap converges to the standard version of InfoMap in the limit in which the node-absorption rates approach $0$. We demonstrate that the community structure that one obtains using our adaptations of InfoMap can differ markedly from the community structure that one detects using methods that do not account for node-absorption rates. We also illustrate that the community structure that is induced by heterogeneous absorption rates can have important implications for susceptible-infected-recovered (SIR) dynamics on ring-lattice networks. For example, in some situations, the outbreak duration is maximized when a moderate number of nodes have large node-absorption rates.
△ Less
Submitted 23 April, 2024; v1 submitted 20 December, 2021;
originally announced December 2021.
-
An Adaptive Bounded-Confidence Model of Opinion Dynamics on Networks
Authors:
Unchitta Kan,
Michelle Feng,
Mason A. Porter
Abstract:
Individuals who interact with each other in social networks often exchange ideas and influence each other's opinions. A popular approach to study the spread of opinions on networks is by examining bounded-confidence models (BCMs), in which the nodes of a network have continuous-valued states that encode their opinions and are receptive to other nodes' opinions when they lie within some confidence…
▽ More
Individuals who interact with each other in social networks often exchange ideas and influence each other's opinions. A popular approach to study the spread of opinions on networks is by examining bounded-confidence models (BCMs), in which the nodes of a network have continuous-valued states that encode their opinions and are receptive to other nodes' opinions when they lie within some confidence bound of their own opinion. In this paper, we extend the Deffuant--Weisbuch (DW) model, which is a well-known BCM, by examining the spread of opinions that coevolve with network structure. We propose an adaptive variant of the DW model in which the nodes of a network can (1) alter their opinions when they interact with neighboring nodes and (2) break connections with neighbors based on an opinion tolerance threshold and then form new connections following the principle of homophily. This opinion tolerance threshold determines whether or not the opinions of adjacent nodes are sufficiently different to be viewed as `discordant'. Using numerical simulations, we find that our adaptive DW model requires a larger confidence bound than a baseline DW model for the nodes of a network to achieve a consensus opinion. In one region of parameter space, we observe `pseudo-consensus' steady states, in which there exist multiple subclusters of an opinion cluster with opinions that differ from each other by a small amount. In our simulations, we also examine the importance of early-time dynamics and nodes with initially moderate opinions for achieving consensus. Additionally, we explore the effects of coevolution on the convergence time of our BCM.
△ Less
Submitted 29 November, 2022; v1 submitted 10 December, 2021;
originally announced December 2021.
-
Connected Components for Infinite Graph Streams: Theory and Practice
Authors:
Jonathan W. Berry,
Cynthia A Phillips,
Alexandra M. Porter
Abstract:
Motivated by the properties of unending real-world cybersecurity streams, we present a new graph streaming model: XStream. We maintain a streaming graph and its connected components at single-edge granularity. In cybersecurity graph applications, input streams typically consist of edge insertions; individual deletions are not explicit. Analysts maintain as much history as possible and will trigger…
▽ More
Motivated by the properties of unending real-world cybersecurity streams, we present a new graph streaming model: XStream. We maintain a streaming graph and its connected components at single-edge granularity. In cybersecurity graph applications, input streams typically consist of edge insertions; individual deletions are not explicit. Analysts maintain as much history as possible and will trigger customized bulk deletions when necessary Despite a variety of dynamic graph processing systems and some canonical literature on theoretical sliding-window graph streaming, XStream is the first model explicitly designed to accommodate this usage model. Users can provide Boolean predicates to define bulk deletions. Edge arrivals are expected to occur continuously and must always be handled. XStream is implemented via a ring of finite-memory processors. We give algorithms to maintain connected components on the input stream, answer queries about connectivity, and to perform bulk deletion. The system requires bandwidth for internal messages that is some constant factor greater than the stream arrival rate. We prove a relationship among four quantities: the proportion of query downtime allowed, the proportion of edges that survive an aging event, the proportion of duplicated edges, and the bandwidth expansion factor. In addition to presenting the theory behind XStream, we present computational results for a single-threaded prototype implementation. Stream ingestion rates are bounded by computer architecture. We determine this bound for XStream inter-process message-passing rates in Intel TBB applications on Intel Sky Lake processors: between one and five million graph edges per second. Our single-threaded prototype runs our full protocols through multiple aging events at between one half and one a million edges per second, and we give ideas for speeding this up by orders of magnitude.
△ Less
Submitted 30 November, 2021;
originally announced December 2021.
-
Specified Certainty Classification, with Application to Read Classification for Reference-Guided Metagenomic Assembly
Authors:
Alan F. Karr,
Jason Hauzel,
Prahlad Menon,
Adam A. Porter,
Marcel Schaefer
Abstract:
Specified Certainty Classification (SCC) is a new paradigm for employing classifiers whose outputs carry uncertainties, typically in the form of Bayesian posterior probabilities. By allowing the classifier output to be less precise than one of a set of atomic decisions, SCC allows all decisions to achieve a specified level of certainty, as well as provides insights into classifier behavior by exam…
▽ More
Specified Certainty Classification (SCC) is a new paradigm for employing classifiers whose outputs carry uncertainties, typically in the form of Bayesian posterior probabilities. By allowing the classifier output to be less precise than one of a set of atomic decisions, SCC allows all decisions to achieve a specified level of certainty, as well as provides insights into classifier behavior by examining all decisions that are possible. Our primary illustration is read classification for reference-guided genome assembly, but we demonstrate the breadth of SCC by also analyzing COVID-19 vaccination data.
△ Less
Submitted 28 September, 2021; v1 submitted 13 September, 2021;
originally announced September 2021.
-
Analysis of Spatial and Spatiotemporal Anomalies Using Persistent Homology: Case Studies with COVID-19 Data
Authors:
Abigail Hickok,
Deanna Needell,
Mason A. Porter
Abstract:
We develop a method for analyzing spatial and spatiotemporal anomalies in geospatial data using topological data analysis (TDA). To do this, we use persistent homology (PH), which allows one to algorithmically detect geometric voids in a data set and quantify the persistence of such voids. We construct an efficient filtered simplicial complex (FSC) such that the voids in our FSC are in one-to-one…
▽ More
We develop a method for analyzing spatial and spatiotemporal anomalies in geospatial data using topological data analysis (TDA). To do this, we use persistent homology (PH), which allows one to algorithmically detect geometric voids in a data set and quantify the persistence of such voids. We construct an efficient filtered simplicial complex (FSC) such that the voids in our FSC are in one-to-one correspondence with the anomalies. Our approach goes beyond simply identifying anomalies; it also encodes information about the relationships between anomalies. We use vineyards, which one can interpret as time-varying persistence diagrams (which are an approach for visualizing PH), to track how the locations of the anomalies change with time. We conduct two case studies using spatially heterogeneous COVID-19 data. First, we examine vaccination rates in New York City by zip code at a single point in time. Second, we study a year-long data set of COVID-19 case rates in neighborhoods of the city of Los Angeles.
△ Less
Submitted 24 February, 2022; v1 submitted 19 July, 2021;
originally announced July 2021.
-
A note on hyperopic cops and robber
Authors:
Nancy E. Clarke,
Stephen Finbow,
Margaret-Ellen Messinger,
Amanda Porter
Abstract:
We explore a variant of the game of Cops and Robber introduced by Bonato et al.~where the robber is invisible unless outside the common neighbourhood of the cops. The hyperopic cop number is analogous to the cop number and we investigate bounds on this quantity. We define a small common neighbourhood set and relate the minimum cardinality of this graph parameter to the hyperopic cop number. We con…
▽ More
We explore a variant of the game of Cops and Robber introduced by Bonato et al.~where the robber is invisible unless outside the common neighbourhood of the cops. The hyperopic cop number is analogous to the cop number and we investigate bounds on this quantity. We define a small common neighbourhood set and relate the minimum cardinality of this graph parameter to the hyperopic cop number. We consider diameter 2 graphs, particularly the join of two graphs, as well as Cartesian products.
△ Less
Submitted 15 July, 2021;
originally announced July 2021.
-
A Multilayer Network Model of the Coevolution of the Spread of a Disease and Competing Opinions
Authors:
Kaiyan Peng,
Zheng Lu,
Vanessa Lin,
Michael R. Lindstrom,
Christian Parkinson,
Chuntian Wang,
Andrea L. Bertozzi,
Mason A. Porter
Abstract:
During the COVID-19 pandemic, conflicting opinions on physical distancing swept across social media, affecting both human behavior and the spread of COVID-19. Inspired by such phenomena, we construct a two-layer multiplex network for the coupled spread of a disease and conflicting opinions. We model each process as a contagion. On one layer, we consider the concurrent evolution of two opinions --…
▽ More
During the COVID-19 pandemic, conflicting opinions on physical distancing swept across social media, affecting both human behavior and the spread of COVID-19. Inspired by such phenomena, we construct a two-layer multiplex network for the coupled spread of a disease and conflicting opinions. We model each process as a contagion. On one layer, we consider the concurrent evolution of two opinions -- pro-physical-distancing and anti-physical-distancing -- that compete with each other and have mutual immunity to each other. The disease evolves on the other layer, and individuals are less likely (respectively, more likely) to become infected when they adopt the pro-physical-distancing (respectively, anti-physical-distancing) opinion. We develop approximations of mean-field type by generalizing monolayer pair approximations to multilayer networks; these approximations agree well with Monte Carlo simulations for a broad range of parameters and several network structures. Through numerical simulations, we illustrate the influence of opinion dynamics on the spread of the disease from complex interactions both between the two conflicting opinions and between the opinions and the disease. We find that lengthening the duration that individuals hold an opinion may help suppress disease transmission, and we demonstrate that increasing the cross-layer correlations or intra-layer correlations of node degrees may lead to fewer individuals becoming infected with the disease.
△ Less
Submitted 4 July, 2021;
originally announced July 2021.
-
Lonely individuals process the world in idiosyncratic ways
Authors:
Elisa C. Baek,
Ryan Hyon,
Karina López,
Meng Du,
Mason A. Porter,
Carolyn Parkinson
Abstract:
Loneliness is detrimental to well-being and is often accompanied by self-reported feelings of not being understood by others. What contributes to such feelings in lonely people? We used functional magnetic resonance imaging (fMRI) of 66 participants to unobtrusively measure the relative alignment of people's mental processing of naturalistic stimuli and tested whether or not lonely people actually…
▽ More
Loneliness is detrimental to well-being and is often accompanied by self-reported feelings of not being understood by others. What contributes to such feelings in lonely people? We used functional magnetic resonance imaging (fMRI) of 66 participants to unobtrusively measure the relative alignment of people's mental processing of naturalistic stimuli and tested whether or not lonely people actually process the world in idiosyncratic ways. We found evidence for such idiosyncrasy: lonely individuals' neural responses were dissimilar to their peers, particularly in regions of the default-mode network in which similar responses have been associated with shared perspectives and subjective understanding. These relationships persisted when controlling for demographic similarities, objective social isolation, and participants' friendships with each other. Our findings suggest the possibility that being surrounded by people who see the world differently from oneself, even if one is friends with them, may be a risk factor for loneliness.
△ Less
Submitted 16 August, 2022; v1 submitted 2 July, 2021;
originally announced July 2021.
-
Popular individuals process the world in particularly normative ways
Authors:
Elisa C. Baek,
Ryan Hyon,
Karina López,
Emily S. Finn,
Mason A. Porter,
Carolyn Parkinson
Abstract:
People differ in how they attend to, interpret, and respond to their surroundings. Convergent processing of the world may be one factor that contributes to social connections between individuals. We used neuroimaging and network analysis to investigate whether the most central individuals in their communities (as measured by in-degree centrality, a notion of popularity) process the world in a part…
▽ More
People differ in how they attend to, interpret, and respond to their surroundings. Convergent processing of the world may be one factor that contributes to social connections between individuals. We used neuroimaging and network analysis to investigate whether the most central individuals in their communities (as measured by in-degree centrality, a notion of popularity) process the world in a particularly normative way. We found that more central individuals had exceptionally similar neural responses to their peers and especially to each other in brain regions that are associated with high-level interpretations and social cognition (e.g., in the default-mode network), whereas less-central individuals exhibited more idiosyncratic responses. Self-reported enjoyment of and interest in stimuli followed a similar pattern, but accounting for these data did not change our main results. These findings suggest that highly-central individuals process the world in exceptionally similar ways, whereas less-central individuals process the world in idiosyncratic ways.
△ Less
Submitted 23 September, 2021; v1 submitted 4 June, 2021;
originally announced June 2021.
-
Topological Data Analysis of Spatial Systems
Authors:
Michelle Feng,
Abigail Hickok,
Mason A. Porter
Abstract:
In this chapter, we discuss applications of topological data analysis (TDA) to spatial systems. We briefly review the recently proposed level-set construction of filtered simplicial complexes, and we then examine persistent homology in two cases studies: street networks in Shanghai and hotspots of COVID-19 infections. We then summarize our results and provide an outlook on TDA in spatial systems.
In this chapter, we discuss applications of topological data analysis (TDA) to spatial systems. We briefly review the recently proposed level-set construction of filtered simplicial complexes, and we then examine persistent homology in two cases studies: street networks in Shanghai and hotspots of COVID-19 infections. We then summarize our results and provide an outlook on TDA in spatial systems.
△ Less
Submitted 1 April, 2021;
originally announced April 2021.
-
Detection of Functional Communities in Networks of Randomly Coupled Oscillators Using the Dynamic-Mode Decomposition
Authors:
Christopher W. Curtis,
Mason A. Porter
Abstract:
Dynamic-mode decomposition (DMD) is a versatile framework for model-free analysis of time series that are generated by dynamical systems. We develop a DMD-based algorithm to investigate the formation of "functional communities" in networks of coupled, heterogeneous Kuramoto oscillators. In these functional communities, the oscillators in the network have similar dynamics. We consider two common ra…
▽ More
Dynamic-mode decomposition (DMD) is a versatile framework for model-free analysis of time series that are generated by dynamical systems. We develop a DMD-based algorithm to investigate the formation of "functional communities" in networks of coupled, heterogeneous Kuramoto oscillators. In these functional communities, the oscillators in the network have similar dynamics. We consider two common random-graph models (Watts--Strogatz networks and Barabási--Albert networks) with different amounts of heterogeneities among the oscillators. In our computations, we find that membership in a community reflects the extent to which there is establishment and sustainment of locking between oscillators. We construct forest graphs that illustrate the complex ways in which the heterogeneous oscillators associate and disassociate with each other.
△ Less
Submitted 5 August, 2021; v1 submitted 26 March, 2021;
originally announced March 2021.