-
A Survey of Link Prediction Algorithms
Authors:
Vivian Feng
Abstract:
The problem of link prediction, predicting if two nodes in a network have a connection between them, is a theoretical problem with numerous field-agnostic real-world applications. This paper investigates the efficacy of three classes of link prediction algorithms: local node similarity heuristics, the global index Random Walk with Restart, and Node2Vec embeddings. Furthermore, this paper provides…
▽ More
The problem of link prediction, predicting if two nodes in a network have a connection between them, is a theoretical problem with numerous field-agnostic real-world applications. This paper investigates the efficacy of three classes of link prediction algorithms: local node similarity heuristics, the global index Random Walk with Restart, and Node2Vec embeddings. Furthermore, this paper provides insight into the performance of canonical link prediction algorithms on small graphs. The graphs included in this study are sampled from various domains, including infrastructure and ecological networks.
△ Less
Submitted 13 June, 2023;
originally announced June 2023.
-
FRANCIS: Fast Reaction Algorithms for Network Coordination In Switches
Authors:
Wenchen Han,
Vic Feng,
Gregory Schwartzman,
Yuliang Li,
Michael Mitzenmacher,
Minlan Yu,
Ran Ben-Basat
Abstract:
Optimizing the reaction to network events, which is critical in tasks such as clock synchronization, multicast, and routing, becomes increasingly challenging as networks grow larger. To improve the reaction time compared to centralized solutions, the theory community has made significant progress in the design of message-passing algorithms that leverage all nodes for distributed computation, and t…
▽ More
Optimizing the reaction to network events, which is critical in tasks such as clock synchronization, multicast, and routing, becomes increasingly challenging as networks grow larger. To improve the reaction time compared to centralized solutions, the theory community has made significant progress in the design of message-passing algorithms that leverage all nodes for distributed computation, and the advent of programmable switches makes it now possible to materialize them.
We propose FRANCIS, a framework and associated libraries for running message-passing algorithms on programmable switches. It features primitives that allow easy integration of such algorithms for quickly reacting to network events while optimizing resource consumption. We use FRANCIS to implement event reaction solutions that improve clock synchronization, source-routed multicast, and routing and demonstrate up to 18x reduction in reaction time.
△ Less
Submitted 1 November, 2024; v1 submitted 29 April, 2022;
originally announced April 2022.
-
A Tour of Visualization Techniques for Computer Vision Datasets
Authors:
Bilal Alsallakh,
Pamela Bhattacharya,
Vanessa Feng,
Narine Kokhlikyan,
Orion Reblitz-Richardson,
Rahul Rajan,
David Yan
Abstract:
We survey a number of data visualization techniques for analyzing Computer Vision (CV) datasets. These techniques help us understand properties and latent patterns in such data, by applying dataset-level analysis. We present various examples of how such analysis helps predict the potential impact of the dataset properties on CV models and informs appropriate mitigation of their shortcomings. Final…
▽ More
We survey a number of data visualization techniques for analyzing Computer Vision (CV) datasets. These techniques help us understand properties and latent patterns in such data, by applying dataset-level analysis. We present various examples of how such analysis helps predict the potential impact of the dataset properties on CV models and informs appropriate mitigation of their shortcomings. Finally, we explore avenues for further visualization techniques of different modalities of CV datasets as well as ones that are tailored to support specific CV tasks and analysis needs.
△ Less
Submitted 18 April, 2022;
originally announced April 2022.
-
Two-pass Discourse Segmentation with Pairing and Global Features
Authors:
Vanessa Wei Feng,
Graeme Hirst
Abstract:
Previous attempts at RST-style discourse segmentation typically adopt features centered on a single token to predict whether to insert a boundary before that token. In contrast, we develop a discourse segmenter utilizing a set of pairing features, which are centered on a pair of adjacent tokens in the sentence, by equally taking into account the information from both tokens. Moreover, we propose a…
▽ More
Previous attempts at RST-style discourse segmentation typically adopt features centered on a single token to predict whether to insert a boundary before that token. In contrast, we develop a discourse segmenter utilizing a set of pairing features, which are centered on a pair of adjacent tokens in the sentence, by equally taking into account the information from both tokens. Moreover, we propose a novel set of global features, which encode characteristics of the segmentation as a whole, once we have an initial segmentation. We show that both the pairing and global features are useful on their own, and their combination achieved an $F_1$ of 92.6% of identifying in-sentence discourse boundaries, which is a 17.8% error-rate reduction over the state-of-the-art performance, approaching 95% of human performance. In addition, similar improvement is observed across different classification frameworks.
△ Less
Submitted 30 July, 2014;
originally announced July 2014.