Search | arXiv e-print repository

Spectral clustering for dependent community Hawkes process models of temporal networks

Authors: Lingfei Zhao, Hadeel Soliman, Kevin S. Xu, Subhadeep Paul

Abstract: Temporal networks observed continuously over time through timestamped relational events data are commonly encountered in application settings including online social media communications, financial transactions, and international relations. Temporal networks often exhibit community structure and strong dependence patterns among node pairs. This dependence can be modeled through mutual excitations,… ▽ More Temporal networks observed continuously over time through timestamped relational events data are commonly encountered in application settings including online social media communications, financial transactions, and international relations. Temporal networks often exhibit community structure and strong dependence patterns among node pairs. This dependence can be modeled through mutual excitations, where an interaction event from a sender to a receiver node increases the possibility of future events among other node pairs. We provide statistical results for a class of models that we call dependent community Hawkes (DCH) models, which combine the stochastic block model with mutually exciting Hawkes processes for modeling both community structure and dependence among node pairs, respectively. We derive a non-asymptotic upper bound on the misclustering error of spectral clustering on the event count matrix as a function of the number of nodes and communities, time duration, and the amount of dependence in the model. Our result leverages recent results on bounding an appropriate distance between a multivariate Hawkes process count vector and a Gaussian vector, along with results from random matrix theory. We also propose a DCH model that incorporates only self and reciprocal excitation along with highly scalable parameter estimation using a Generalized Method of Moments (GMM) estimator that we demonstrate to be consistent for growing network size and time duration. △ Less

Submitted 27 May, 2025; originally announced May 2025.

arXiv:2211.02234 [pdf, other]

A Latent Space Model for HLA Compatibility Networks in Kidney Transplantation

Authors: Zhipeng Huang, Kevin S. Xu

Abstract: Kidney transplantation is the preferred treatment for people suffering from end-stage renal disease. Successful kidney transplants still fail over time, known as graft failure; however, the time to graft failure, or graft survival time, can vary significantly between different recipients. A significant biological factor affecting graft survival times is the compatibility between the human leukocyt… ▽ More Kidney transplantation is the preferred treatment for people suffering from end-stage renal disease. Successful kidney transplants still fail over time, known as graft failure; however, the time to graft failure, or graft survival time, can vary significantly between different recipients. A significant biological factor affecting graft survival times is the compatibility between the human leukocyte antigens (HLAs) of the donor and recipient. We propose to model HLA compatibility using a network, where the nodes denote different HLAs of the donor and recipient, and edge weights denote compatibilities of the HLAs, which can be positive or negative. The network is indirectly observed, as the edge weights are estimated from transplant outcomes rather than directly observed. We propose a latent space model for such indirectly-observed weighted and signed networks. We demonstrate that our latent space model can not only result in more accurate estimates of HLA compatibilities, but can also be incorporated into survival analysis models to improve accuracy for the downstream task of predicting graft survival times. △ Less

Submitted 3 November, 2022; originally announced November 2022.

Comments: This work has been accepted to BIBM 2022

arXiv:2205.09263 [pdf, other]

A Mutually Exciting Latent Space Hawkes Process Model for Continuous-time Networks

Authors: Zhipeng Huang, Hadeel Soliman, Subhadeep Paul, Kevin S. Xu

Abstract: Networks and temporal point processes serve as fundamental building blocks for modeling complex dynamic relational data in various domains. We propose the latent space Hawkes (LSH) model, a novel generative model for continuous-time networks of relational events, using a latent space representation for nodes. We model relational events between nodes using mutually exciting Hawkes processes with ba… ▽ More Networks and temporal point processes serve as fundamental building blocks for modeling complex dynamic relational data in various domains. We propose the latent space Hawkes (LSH) model, a novel generative model for continuous-time networks of relational events, using a latent space representation for nodes. We model relational events between nodes using mutually exciting Hawkes processes with baseline intensities dependent upon the distances between the nodes in the latent space and sender and receiver specific effects. We demonstrate that our proposed LSH model can replicate many features observed in real temporal networks including reciprocity and transitivity, while also achieving superior prediction accuracy and providing more interpretable fits than existing models. △ Less

Submitted 6 July, 2022; v1 submitted 18 May, 2022; originally announced May 2022.

Comments: To appear in UAI 2022. Code available at https://github.com/IdeasLabUT/Latent-Space-Hawkes

arXiv:2205.00639 [pdf, other]

The Multivariate Community Hawkes Model for Dependent Relational Events in Continuous-time Networks

Authors: Hadeel Soliman, Lingfei Zhao, Zhipeng Huang, Subhadeep Paul, Kevin S. Xu

Abstract: The stochastic block model (SBM) is one of the most widely used generative models for network data. Many continuous-time dynamic network models are built upon the same assumption as the SBM: edges or events between all pairs of nodes are conditionally independent given the block or community memberships, which prevents them from reproducing higher-order motifs such as triangles that are commonly o… ▽ More The stochastic block model (SBM) is one of the most widely used generative models for network data. Many continuous-time dynamic network models are built upon the same assumption as the SBM: edges or events between all pairs of nodes are conditionally independent given the block or community memberships, which prevents them from reproducing higher-order motifs such as triangles that are commonly observed in real networks. We propose the multivariate community Hawkes (MULCH) model, an extremely flexible community-based model for continuous-time networks that introduces dependence between node pairs using structured multivariate Hawkes processes. We fit the model using a spectral clustering and likelihood-based local refinement procedure. We find that our proposed MULCH model is far more accurate than existing models both for predictive and generative tasks. △ Less

Submitted 6 July, 2022; v1 submitted 2 May, 2022; originally announced May 2022.

Comments: To appear at ICML 2022. Code available at https://github.com/IdeasLabUT/Multivariate-Community-Hawkes

arXiv:2103.03305 [pdf, other]

Predicting Kidney Transplant Survival using Multiple Feature Representations for HLAs

Authors: Mohammadreza Nemati, Haonan Zhang, Michael Sloma, Dulat Bekbolsynov, Hong Wang, Stanislaw Stepkowski, Kevin S. Xu

Abstract: Kidney transplantation can significantly enhance living standards for people suffering from end-stage renal disease. A significant factor that affects graft survival time (the time until the transplant fails and the patient requires another transplant) for kidney transplantation is the compatibility of the Human Leukocyte Antigens (HLAs) between the donor and recipient. In this paper, we propose 4… ▽ More Kidney transplantation can significantly enhance living standards for people suffering from end-stage renal disease. A significant factor that affects graft survival time (the time until the transplant fails and the patient requires another transplant) for kidney transplantation is the compatibility of the Human Leukocyte Antigens (HLAs) between the donor and recipient. In this paper, we propose 4 new biologically-relevant feature representations for incorporating HLA information into machine learning-based survival analysis algorithms. We evaluate our proposed HLA feature representations on a database of over 100,000 transplants and find that they improve prediction accuracy by about 1%, modest at the patient level but potentially significant at a societal level. Accurate prediction of survival times can improve transplant survival outcomes, enabling better allocation of donors to recipients and reducing the number of re-transplants due to graft failure with poorly matched donors. △ Less

Submitted 5 July, 2022; v1 submitted 4 March, 2021; originally announced March 2021.

Comments: Extended version of AIME 2021 conference paper

Journal ref: Proceedings of the 19th International Conference on Artificial Intelligence in Medicine (2021) 51-60

arXiv:1908.06940 [pdf, other]

CHIP: A Hawkes Process Model for Continuous-time Networks with Scalable and Consistent Estimation

Authors: Makan Arastuie, Subhadeep Paul, Kevin S. Xu

Abstract: In many application settings involving networks, such as messages between users of an on-line social network or transactions between traders in financial markets, the observed data consist of timestamped relational events, which form a continuous-time network. We propose the Community Hawkes Independent Pairs (CHIP) generative model for such networks. We show that applying spectral clustering to a… ▽ More In many application settings involving networks, such as messages between users of an on-line social network or transactions between traders in financial markets, the observed data consist of timestamped relational events, which form a continuous-time network. We propose the Community Hawkes Independent Pairs (CHIP) generative model for such networks. We show that applying spectral clustering to an aggregated adjacency matrix constructed from the CHIP model provides consistent community detection for a growing number of nodes and time duration. We also develop consistent and computationally efficient estimators for the model parameters. We demonstrate that our proposed CHIP model and estimation procedure scales to large networks with tens of thousands of nodes and provides superior fits than existing continuous-time network models on several real networks. △ Less

Submitted 10 November, 2020; v1 submitted 19 August, 2019; originally announced August 2019.

Comments: 34th Conference on Neural Information Processing Systems (NeurIPS 2020), Vancouver, Canada. Source code is available at https://github.com/IdeasLabUT/CHIP-Network-Model

arXiv:1711.10967 [pdf, other]

The Block Point Process Model for Continuous-Time Event-Based Dynamic Networks

Authors: Ruthwik R. Junuthula, Maysam Haghdan, Kevin S. Xu, Vijay K. Devabhaktuni

Abstract: We consider the problem of analyzing timestamped relational events between a set of entities, such as messages between users of an on-line social network. Such data are often analyzed using static or discrete-time network models, which discard a significant amount of information by aggregating events over time to form network snapshots. In this paper, we introduce a block point process model (BPPM… ▽ More We consider the problem of analyzing timestamped relational events between a set of entities, such as messages between users of an on-line social network. Such data are often analyzed using static or discrete-time network models, which discard a significant amount of information by aggregating events over time to form network snapshots. In this paper, we introduce a block point process model (BPPM) for continuous-time event-based dynamic networks. The BPPM is inspired by the well-known stochastic block model (SBM) for static networks. We show that networks generated by the BPPM follow an SBM in the limit of a growing number of nodes. We use this property to develop principled and efficient local search and variational inference procedures initialized by regularized spectral clustering. We fit BPPMs with exponential Hawkes processes to analyze several real network data sets, including a Facebook wall post network with over 3,500 nodes and 130,000 events. △ Less

Submitted 21 February, 2019; v1 submitted 29 November, 2017; originally announced November 2017.

Comments: To appear at The Web Conference 2019

arXiv:1703.06065 [pdf, other]

Block CUR: Decomposing Matrices using Groups of Columns

Authors: Urvashi Oswal, Swayambhoo Jain, Kevin S. Xu, Brian Eriksson

Abstract: A common problem in large-scale data analysis is to approximate a matrix using a combination of specifically sampled rows and columns, known as CUR decomposition. Unfortunately, in many real-world environments, the ability to sample specific individual rows or columns of the matrix is limited by either system constraints or cost. In this paper, we consider matrix approximation by sampling predefin… ▽ More A common problem in large-scale data analysis is to approximate a matrix using a combination of specifically sampled rows and columns, known as CUR decomposition. Unfortunately, in many real-world environments, the ability to sample specific individual rows or columns of the matrix is limited by either system constraints or cost. In this paper, we consider matrix approximation by sampling predefined \emph{blocks} of columns (or rows) from the matrix. We present an algorithm for sampling useful column blocks and provide novel guarantees for the quality of the approximation. This algorithm has application in problems as diverse as biometric data analysis to distributed computing. We demonstrate the effectiveness of the proposed algorithms for computing the Block CUR decomposition of large matrices in a distributed setting with multiple nodes in a compute cluster, where such blocks correspond to columns (or rows) of the matrix stored on the same node, which can be retrieved with much less overhead than retrieving individual columns stored across different nodes. In the biometric setting, the rows correspond to different users and columns correspond to users' biometric reaction to external stimuli, {\em e.g.,}~watching video content, at a particular time instant. There is significant cost in acquiring each user's reaction to lengthy content so we sample a few important scenes to approximate the biometric response. An individual time sample in this use case cannot be queried in isolation due to the lack of context that caused that biometric reaction. Instead, collections of time segments ({\em i.e.,} blocks) must be presented to the user. The practical application of these algorithms is shown via experimental results using real-world user biometric data from a content testing environment. △ Less

Submitted 9 July, 2018; v1 submitted 17 March, 2017; originally announced March 2017.

Comments: shorter version to appear in ECML-PKDD 2018

arXiv:1607.07330 [pdf, ps, other]

Evaluating Link Prediction Accuracy on Dynamic Networks with Added and Removed Edges

Authors: Ruthwik R. Junuthula, Kevin S. Xu, Vijay K. Devabhaktuni

Abstract: The task of predicting future relationships in a social network, known as link prediction, has been studied extensively in the literature. Many link prediction methods have been proposed, ranging from common neighbors to probabilistic models. Recent work by Yang et al. has highlighted several challenges in evaluating link prediction accuracy. In dynamic networks where edges are both added and remo… ▽ More The task of predicting future relationships in a social network, known as link prediction, has been studied extensively in the literature. Many link prediction methods have been proposed, ranging from common neighbors to probabilistic models. Recent work by Yang et al. has highlighted several challenges in evaluating link prediction accuracy. In dynamic networks where edges are both added and removed over time, the link prediction problem is more complex and involves predicting both newly added and newly removed edges. This results in new challenges in the evaluation of dynamic link prediction methods, and the recommendations provided by Yang et al. are no longer applicable, because they do not address edge removal. In this paper, we investigate several metrics currently used for evaluating accuracies of dynamic link prediction methods and demonstrate why they can be misleading in many cases. We provide several recommendations on evaluating dynamic link prediction accuracy, including separation into two categories of evaluation. Finally we propose a unified metric to characterize link prediction accuracy effectively using a single number. △ Less

Submitted 25 July, 2016; originally announced July 2016.

Comments: To appear in Proceedings of SocialCom 2016

arXiv:1602.07754 [pdf, other]

doi 10.1109/TBME.2016.2632523

A Compressed Sensing Based Decomposition of Electrodermal Activity Signals

Authors: Swayambhoo Jain, Urvashi Oswal, Kevin S. Xu, Brian Eriksson, Jarvis Haupt

Abstract: The measurement and analysis of Electrodermal Activity (EDA) offers applications in diverse areas ranging from market research, to seizure detection, to human stress analysis. Unfortunately, the analysis of EDA signals is made difficult by the superposition of numerous components which can obscure the signal information related to a user's response to a stimulus. We show how simple pre-processing… ▽ More The measurement and analysis of Electrodermal Activity (EDA) offers applications in diverse areas ranging from market research, to seizure detection, to human stress analysis. Unfortunately, the analysis of EDA signals is made difficult by the superposition of numerous components which can obscure the signal information related to a user's response to a stimulus. We show how simple pre-processing followed by a novel compressed sensing based decomposition can mitigate the effects of the undesired noise components and help reveal the underlying physiological signal. The proposed framework allows for decomposition of EDA signals with provable bounds on the recovery of user responses. We test our procedure on both synthetic and real-world EDA signals from wearable sensors and demonstrate that our approach allows for more accurate recovery of user responses as compared to the existing techniques. △ Less

Submitted 26 January, 2017; v1 submitted 24 February, 2016; originally announced February 2016.

Comments: To appear in IEEE Transactions on Biomedical Engineering

arXiv:1508.04887 [pdf, other]

doi 10.1109/TNNLS.2015.2466686

Multi-criteria Similarity-based Anomaly Detection using Pareto Depth Analysis

Authors: Ko-Jen Hsiao, Kevin S. Xu, Jeff Calder, Alfred O. Hero III

Abstract: We consider the problem of identifying patterns in a data set that exhibit anomalous behavior, often referred to as anomaly detection. Similarity-based anomaly detection algorithms detect abnormally large amounts of similarity or dissimilarity, e.g.~as measured by nearest neighbor Euclidean distances between a test sample and the training samples. In many application domains there may not exist a… ▽ More We consider the problem of identifying patterns in a data set that exhibit anomalous behavior, often referred to as anomaly detection. Similarity-based anomaly detection algorithms detect abnormally large amounts of similarity or dissimilarity, e.g.~as measured by nearest neighbor Euclidean distances between a test sample and the training samples. In many application domains there may not exist a single dissimilarity measure that captures all possible anomalous patterns. In such cases, multiple dissimilarity measures can be defined, including non-metric measures, and one can test for anomalies by scalarizing using a non-negative linear combination of them. If the relative importance of the different dissimilarity measures are not known in advance, as in many anomaly detection applications, the anomaly detection algorithm may need to be executed multiple times with different choices of weights in the linear combination. In this paper, we propose a method for similarity-based anomaly detection using a novel multi-criteria dissimilarity measure, the Pareto depth. The proposed Pareto depth analysis (PDA) anomaly detection algorithm uses the concept of Pareto optimality to detect anomalies under multiple criteria without having to run an algorithm multiple times with different choices of weights. The proposed PDA approach is provably better than using linear combinations of the criteria and shows superior performance on experiments with synthetic and real data sets. △ Less

Submitted 20 August, 2015; originally announced August 2015.

Comments: The work is submitted to IEEE TNNLS Special Issue on Learning in Non-(geo)metric Spaces for review on October 28, 2013, revised on July 26, 2015 and accepted on July 30, 2015. A preliminary version of this work is reported in the conference Advances in Neural Information Processing Systems (NIPS) 2012

Journal ref: IEEE Transactions on Neural Networks and Learning Systems 27 (2016) 1307-1321

arXiv:1411.5404 [pdf, other]

Stochastic Block Transition Models for Dynamic Networks

Authors: Kevin S. Xu

Abstract: There has been great interest in recent years on statistical models for dynamic networks. In this paper, I propose a stochastic block transition model (SBTM) for dynamic networks that is inspired by the well-known stochastic block model (SBM) for static networks and previous dynamic extensions of the SBM. Unlike most existing dynamic network models, it does not make a hidden Markov assumption on t… ▽ More There has been great interest in recent years on statistical models for dynamic networks. In this paper, I propose a stochastic block transition model (SBTM) for dynamic networks that is inspired by the well-known stochastic block model (SBM) for static networks and previous dynamic extensions of the SBM. Unlike most existing dynamic network models, it does not make a hidden Markov assumption on the edge-level dynamics, allowing the presence or absence of edges to directly influence future edge probabilities while retaining the interpretability of the SBM. I derive an approximate inference procedure for the SBTM and demonstrate that it is significantly better at reproducing durations of edges in real social network data. △ Less

Submitted 28 January, 2015; v1 submitted 19 November, 2014; originally announced November 2014.

Comments: To appear in proceedings of AISTATS 2015

Journal ref: Proceedings of the 18th International Conference on Artificial Intelligence and Statistics (2015) 1079-1087

arXiv:1410.8597 [pdf, other]

Consistent estimation of dynamic and multi-layer block models

Authors: Qiuyi Han, Kevin S. Xu, Edoardo M. Airoldi

Abstract: Significant progress has been made recently on theoretical analysis of estimators for the stochastic block model (SBM). In this paper, we consider the multi-graph SBM, which serves as a foundation for many application settings including dynamic and multi-layer networks. We explore the asymptotic properties of two estimators for the multi-graph SBM, namely spectral clustering and the maximum-likeli… ▽ More Significant progress has been made recently on theoretical analysis of estimators for the stochastic block model (SBM). In this paper, we consider the multi-graph SBM, which serves as a foundation for many application settings including dynamic and multi-layer networks. We explore the asymptotic properties of two estimators for the multi-graph SBM, namely spectral clustering and the maximum-likelihood estimate (MLE), as the number of layers of the multi-graph increases. We derive sufficient conditions for consistency of both estimators and propose a variational approximation to the MLE that is computationally feasible for large networks. We verify the sufficient conditions via simulation and demonstrate that they are practical. In addition, we apply the model to two real data sets: a dynamic social network and a multi-layer social network with several types of relations. △ Less

Submitted 19 May, 2015; v1 submitted 30 October, 2014; originally announced October 2014.

Comments: To appear at ICML 2015

Journal ref: Proceedings of the 32nd International Conference on Machine Learning (2015) 1511-1520

arXiv:1403.0921 [pdf, other]

doi 10.1109/JSTSP.2014.2310294

Dynamic stochastic blockmodels for time-evolving social networks

Authors: Kevin S. Xu, Alfred O. Hero III

Abstract: Significant efforts have gone into the development of statistical models for analyzing data in the form of networks, such as social networks. Most existing work has focused on modeling static networks, which represent either a single time snapshot or an aggregate view over time. There has been recent interest in statistical modeling of dynamic networks, which are observed at multiple points in tim… ▽ More Significant efforts have gone into the development of statistical models for analyzing data in the form of networks, such as social networks. Most existing work has focused on modeling static networks, which represent either a single time snapshot or an aggregate view over time. There has been recent interest in statistical modeling of dynamic networks, which are observed at multiple points in time and offer a richer representation of many complex phenomena. In this paper, we present a state-space model for dynamic networks that extends the well-known stochastic blockmodel for static networks to the dynamic setting. We fit the model in a near-optimal manner using an extended Kalman filter (EKF) augmented with a local search. We demonstrate that the EKF-based algorithm performs competitively with a state-of-the-art algorithm based on Markov chain Monte Carlo sampling but is significantly less computationally demanding. △ Less

Submitted 4 March, 2014; originally announced March 2014.

Comments: To appear in Journal of Selected Topics in Signal Processing special issue: Signal Processing for Social Networks

ACM Class: G.3; G.2.2

Journal ref: IEEE Journal of Selected Topics in Signal Processing 8 (2014) 552-562

arXiv:1306.1271 [pdf, ps, other]

Predictability of social interactions

Authors: Kevin S. Xu

Abstract: The ability to predict social interactions between people has profound applications including targeted marketing and prediction of information diffusion and disease propagation. Previous work has shown that the location of an individual at any given time is highly predictable. This study examines the predictability of social interactions between people to determine whether interaction patterns are… ▽ More The ability to predict social interactions between people has profound applications including targeted marketing and prediction of information diffusion and disease propagation. Previous work has shown that the location of an individual at any given time is highly predictable. This study examines the predictability of social interactions between people to determine whether interaction patterns are similarly predictable. I find that the locations and times of interactions for an individual are highly predictable; however, the other person the individual interacts with is less predictable. Furthermore, I show that knowledge of the locations and times of interactions has almost no effect on the predictability of the other person. Finally I demonstrate that a simple Markov chain model is able to achieve close to the upper bound in terms of predicting the next person with whom a given individual will interact. △ Less

Submitted 5 June, 2013; originally announced June 2013.

Comments: Extended abstract selected as the winner of the 2013 International Conference on Social Computing, Behavioral-Cultural Modeling, and Prediction (SBP) Challenge

arXiv:1305.0051 [pdf, ps, other]

doi 10.1109/ICC.2009.5199418

Revealing social networks of spammers through spectral clustering

Authors: Kevin S. Xu, Mark Kliger, Yilun Chen, Peter J. Woolf, Alfred O. Hero III

Abstract: To date, most studies on spam have focused only on the spamming phase of the spam cycle and have ignored the harvesting phase, which consists of the mass acquisition of email addresses. It has been observed that spammers conceal their identity to a lesser degree in the harvesting phase, so it may be possible to gain new insights into spammers' behavior by studying the behavior of harvesters, which… ▽ More To date, most studies on spam have focused only on the spamming phase of the spam cycle and have ignored the harvesting phase, which consists of the mass acquisition of email addresses. It has been observed that spammers conceal their identity to a lesser degree in the harvesting phase, so it may be possible to gain new insights into spammers' behavior by studying the behavior of harvesters, which are individuals or bots that collect email addresses. In this paper, we reveal social networks of spammers by identifying communities of harvesters with high behavioral similarity using spectral clustering. The data analyzed was collected through Project Honey Pot, a distributed system for monitoring harvesting and spamming. Our main findings are (1) that most spammers either send only phishing emails or no phishing emails at all, (2) that most communities of spammers also send only phishing emails or no phishing emails at all, and (3) that several groups of spammers within communities exhibit coherent temporal behavior and have similar IP addresses. Our findings reveal some previously unknown behavior of spammers and suggest that there is indeed social structure between spammers to be discovered. △ Less

Submitted 30 April, 2013; originally announced May 2013.

Comments: Source code and data available at http://tbayes.eecs.umich.edu/xukevin/spam_icc09 Proceedings of the IEEE International Conference on Communications (2009)

arXiv:1304.5974 [pdf, other]

doi 10.1007/978-3-642-37210-0_22

Dynamic stochastic blockmodels: Statistical models for time-evolving networks

Authors: Kevin S. Xu, Alfred O. Hero III

Abstract: Significant efforts have gone into the development of statistical models for analyzing data in the form of networks, such as social networks. Most existing work has focused on modeling static networks, which represent either a single time snapshot or an aggregate view over time. There has been recent interest in statistical modeling of dynamic networks, which are observed at multiple points in tim… ▽ More Significant efforts have gone into the development of statistical models for analyzing data in the form of networks, such as social networks. Most existing work has focused on modeling static networks, which represent either a single time snapshot or an aggregate view over time. There has been recent interest in statistical modeling of dynamic networks, which are observed at multiple points in time and offer a richer representation of many complex phenomena. In this paper, we propose a state-space model for dynamic networks that extends the well-known stochastic blockmodel for static networks to the dynamic setting. We then propose a procedure to fit the model using a modification of the extended Kalman filter augmented with a local search. We apply the procedure to analyze a dynamic social network of email communication. △ Less

Submitted 22 April, 2013; originally announced April 2013.

ACM Class: G.3; G.2.2

Journal ref: Proceedings of the 6th International Conference on Social Computing, Behavioral-Cultural Modeling, and Prediction (2013) 201-210

arXiv:1202.6042 [pdf, other]

doi 10.1007/s10618-012-0286-6

A Regularized Graph Layout Framework for Dynamic Network Visualization

Authors: Kevin S. Xu, Mark Kliger, Alfred O. Hero III

Abstract: Many real-world networks, including social and information networks, are dynamic structures that evolve over time. Such dynamic networks are typically visualized using a sequence of static graph layouts. In addition to providing a visual representation of the network structure at each time step, the sequence should preserve the mental map between layouts of consecutive time steps to allow a human… ▽ More Many real-world networks, including social and information networks, are dynamic structures that evolve over time. Such dynamic networks are typically visualized using a sequence of static graph layouts. In addition to providing a visual representation of the network structure at each time step, the sequence should preserve the mental map between layouts of consecutive time steps to allow a human to interpret the temporal evolution of the network. In this paper, we propose a framework for dynamic network visualization in the on-line setting where only present and past graph snapshots are available to create the present layout. The proposed framework creates regularized graph layouts by augmenting the cost function of a static graph layout algorithm with a grouping penalty, which discourages nodes from deviating too far from other nodes belonging to the same group, and a temporal penalty, which discourages large node movements between consecutive time steps. The penalties increase the stability of the layout sequence, thus preserving the mental map. We introduce two dynamic layout algorithms within the proposed framework, namely dynamic multidimensional scaling (DMDS) and dynamic graph Laplacian layout (DGLL). We apply these algorithms on several data sets to illustrate the importance of both grouping and temporal regularization for producing interpretable visualizations of dynamic networks. △ Less

Submitted 19 February, 2013; v1 submitted 27 February, 2012; originally announced February 2012.

Comments: To appear in Data Mining and Knowledge Discovery, supporting material (animations and MATLAB toolbox) available at http://tbayes.eecs.umich.edu/xukevin/visualization_dmkd_2012

ACM Class: G.2.2; H.3.4; H.5

Journal ref: Data Mining and Knowledge Discovery 27 (2013) 84-116

arXiv:1110.3741 [pdf, ps, other]

Multi-criteria Anomaly Detection using Pareto Depth Analysis

Authors: Ko-Jen Hsiao, Kevin S. Xu, Jeff Calder, Alfred O. Hero III

Abstract: We consider the problem of identifying patterns in a data set that exhibit anomalous behavior, often referred to as anomaly detection. In most anomaly detection algorithms, the dissimilarity between data samples is calculated by a single criterion, such as Euclidean distance. However, in many cases there may not exist a single dissimilarity measure that captures all possible anomalous patterns. In… ▽ More We consider the problem of identifying patterns in a data set that exhibit anomalous behavior, often referred to as anomaly detection. In most anomaly detection algorithms, the dissimilarity between data samples is calculated by a single criterion, such as Euclidean distance. However, in many cases there may not exist a single dissimilarity measure that captures all possible anomalous patterns. In such a case, multiple criteria can be defined, and one can test for anomalies by scalarizing the multiple criteria using a linear combination of them. If the importance of the different criteria are not known in advance, the algorithm may need to be executed multiple times with different choices of weights in the linear combination. In this paper, we introduce a novel non-parametric multi-criteria anomaly detection method using Pareto depth analysis (PDA). PDA uses the concept of Pareto optimality to detect anomalies under multiple criteria without having to run an algorithm multiple times with different choices of weights. The proposed PDA approach scales linearly in the number of criteria and is provably better than linear combinations of the criteria. △ Less

Submitted 7 January, 2013; v1 submitted 17 October, 2011; originally announced October 2011.

Comments: Removed an unnecessary line from Algorithm 1

ACM Class: I.5; G.3; H.2.8

Journal ref: Advances in Neural Information Processing Systems 25 (2012) 854-862

arXiv:1104.1990 [pdf, other]

doi 10.1007/s10618-012-0302-x

Adaptive Evolutionary Clustering

Authors: Kevin S. Xu, Mark Kliger, Alfred O. Hero III

Abstract: In many practical applications of clustering, the objects to be clustered evolve over time, and a clustering result is desired at each time step. In such applications, evolutionary clustering typically outperforms traditional static clustering by producing clustering results that reflect long-term trends while being robust to short-term variations. Several evolutionary clustering algorithms have r… ▽ More In many practical applications of clustering, the objects to be clustered evolve over time, and a clustering result is desired at each time step. In such applications, evolutionary clustering typically outperforms traditional static clustering by producing clustering results that reflect long-term trends while being robust to short-term variations. Several evolutionary clustering algorithms have recently been proposed, often by adding a temporal smoothness penalty to the cost function of a static clustering method. In this paper, we introduce a different approach to evolutionary clustering by accurately tracking the time-varying proximities between objects followed by static clustering. We present an evolutionary clustering framework that adaptively estimates the optimal smoothing parameter using shrinkage estimation, a statistical approach that improves a naive estimate using additional information. The proposed framework can be used to extend a variety of static clustering algorithms, including hierarchical, k-means, and spectral clustering, into evolutionary clustering algorithms. Experiments on synthetic and real data sets indicate that the proposed framework outperforms static clustering and existing evolutionary clustering algorithms in many scenarios. △ Less

Submitted 19 February, 2013; v1 submitted 11 April, 2011; originally announced April 2011.

Comments: To appear in Data Mining and Knowledge Discovery, MATLAB toolbox available at http://tbayes.eecs.umich.edu/xukevin/affect

ACM Class: I.5.3; H.3.3; G.3

Journal ref: Data Mining and Knowledge Discovery 28 (2014) 304-336

Showing 1–20 of 20 results for author: Xu, K S