-
Networked Sensing for Radiation Detection, Localization, and Tracking
Authors:
R. J. Cooper,
N. Abgrall,
G. Aversano,
M. S. Bandstra,
D. Hellfeld,
T. H. Joshi,
V. Negut,
B. J. Quiter,
E. Rofors,
M. Salathe,
K. Vetter,
P. Beckman,
C. Catlett,
N. Ferrier,
Y. Kim,
R. Sankaran,
S. Shahkarami,
S. Amitkumar,
E. Ayton,
J. Kim,
S. Volkova
Abstract:
The detection, identification, and localization of illicit radiological and nuclear material continue to be key components of nuclear non-proliferation and nuclear security efforts around the world. Networks of radiation detectors deployed at strategic locations in urban environments have the potential to provide continuous radiological/nuclear (R/N) surveillance and provide high probabilities of…
▽ More
The detection, identification, and localization of illicit radiological and nuclear material continue to be key components of nuclear non-proliferation and nuclear security efforts around the world. Networks of radiation detectors deployed at strategic locations in urban environments have the potential to provide continuous radiological/nuclear (R/N) surveillance and provide high probabilities of intercepting threat sources. The integration of contextual information from sensors such as video, Lidar, and meteorological sensors can provide significantly enhanced situational awareness, and improved detection and localization performance through the fusion of the radiological and contextual data. In this work, we present details of our work to establish a city-scale multi-sensor network testbed for intelligent, adaptive R/N detection in urban environments, and develop new techniques that enable city-scale source detection, localization, and tracking.
△ Less
Submitted 25 July, 2023;
originally announced July 2023.
-
Anticipating Technical Expertise and Capability Evolution in Research Communities using Dynamic Graph Transformers
Authors:
Sameera Horawalavithana,
Ellyn Ayton,
Anastasiya Usenko,
Robin Cosbey,
Svitlana Volkova
Abstract:
The ability to anticipate technical expertise and capability evolution trends globally is essential for national and global security, especially in safety-critical domains like nuclear nonproliferation (NN) and rapidly emerging fields like artificial intelligence (AI). In this work, we extend traditional statistical relational learning approaches (e.g., link prediction in collaboration networks) a…
▽ More
The ability to anticipate technical expertise and capability evolution trends globally is essential for national and global security, especially in safety-critical domains like nuclear nonproliferation (NN) and rapidly emerging fields like artificial intelligence (AI). In this work, we extend traditional statistical relational learning approaches (e.g., link prediction in collaboration networks) and formulate a problem of anticipating technical expertise and capability evolution using dynamic heterogeneous graph representations. We develop novel capabilities to forecast collaboration patterns, authorship behavior, and technical capability evolution at different granularities (e.g., scientist and institution levels) in two distinct research fields. We implement a dynamic graph transformer (DGT) neural architecture, which pushes the state-of-the-art graph neural network models by (a) forecasting heterogeneous (rather than homogeneous) nodes and edges, and (b) relying on both discrete -- and continuous -- time inputs. We demonstrate that our DGT models predict collaboration, partnership, and expertise patterns with 0.26, 0.73, and 0.53 mean reciprocal rank values for AI and 0.48, 0.93, and 0.22 for NN domains. DGT model performance exceeds the best-performing static graph baseline models by 30-80% across AI and NN domains. Our findings demonstrate that DGT models boost inductive task performance, when previously unseen nodes appear in the test data, for the domains with emerging collaboration patterns (e.g., AI). Specifically, models accurately predict which established scientists will collaborate with early career scientists and vice-versa in the AI domain.
△ Less
Submitted 18 July, 2023;
originally announced July 2023.
-
EXPERT: Public Benchmarks for Dynamic Heterogeneous Academic Graphs
Authors:
Sameera Horawalavithana,
Ellyn Ayton,
Anastasiya Usenko,
Shivam Sharma,
Jasmine Eshun,
Robin Cosbey,
Maria Glenski,
Svitlana Volkova
Abstract:
Machine learning models that learn from dynamic graphs face nontrivial challenges in learning and inference as both nodes and edges change over time. The existing large-scale graph benchmark datasets that are widely used by the community primarily focus on homogeneous node and edge attributes and are static. In this work, we present a variety of large scale, dynamic heterogeneous academic graphs t…
▽ More
Machine learning models that learn from dynamic graphs face nontrivial challenges in learning and inference as both nodes and edges change over time. The existing large-scale graph benchmark datasets that are widely used by the community primarily focus on homogeneous node and edge attributes and are static. In this work, we present a variety of large scale, dynamic heterogeneous academic graphs to test the effectiveness of models developed for multi-step graph forecasting tasks. Our novel datasets cover both context and content information extracted from scientific publications across two communities: Artificial Intelligence (AI) and Nuclear Nonproliferation (NN). In addition, we propose a systematic approach to improve the existing evaluation procedures used in the graph forecasting models.
△ Less
Submitted 14 April, 2022;
originally announced April 2022.
-
Leveraging Community and Author Context to Explain the Performance and Bias of Text-Based Deception Detection Models
Authors:
Galen Weld,
Ellyn Ayton,
Tim Althoff,
Maria Glenski
Abstract:
Deceptive news posts shared in online communities can be detected with NLP models, and much recent research has focused on the development of such models. In this work, we use characteristics of online communities and authors -- the context of how and where content is posted -- to explain the performance of a neural network deception detection model and identify sub-populations who are disproporti…
▽ More
Deceptive news posts shared in online communities can be detected with NLP models, and much recent research has focused on the development of such models. In this work, we use characteristics of online communities and authors -- the context of how and where content is posted -- to explain the performance of a neural network deception detection model and identify sub-populations who are disproportionately affected by model accuracy or failure. We examine who is posting the content, and where the content is posted to. We find that while author characteristics are better predictors of deceptive content than community characteristics, both characteristics are strongly correlated with model performance. Traditional performance metrics such as F1 score may fail to capture poor model performance on isolated sub-populations such as specific authors, and as such, more nuanced evaluation of deception detection models is critical.
△ Less
Submitted 27 April, 2021;
originally announced April 2021.
-
Towards Trustworthy Deception Detection: Benchmarking Model Robustness across Domains, Modalities, and Languages
Authors:
Maria Glenski,
Ellyn Ayton,
Robin Cosbey,
Dustin Arendt,
Svitlana Volkova
Abstract:
Evaluating model robustness is critical when developing trustworthy models not only to gain deeper understanding of model behavior, strengths, and weaknesses, but also to develop future models that are generalizable and robust across expected environments a model may encounter in deployment. In this paper we present a framework for measuring model robustness for an important but difficult text cla…
▽ More
Evaluating model robustness is critical when developing trustworthy models not only to gain deeper understanding of model behavior, strengths, and weaknesses, but also to develop future models that are generalizable and robust across expected environments a model may encounter in deployment. In this paper we present a framework for measuring model robustness for an important but difficult text classification task - deceptive news detection. We evaluate model robustness to out-of-domain data, modality-specific features, and languages other than English.
Our investigation focuses on three type of models: LSTM models trained on multiple datasets(Cross-Domain), several fusion LSTM models trained with images and text and evaluated with three state-of-the-art embeddings, BERT ELMo, and GloVe (Cross-Modality), and character-level CNN models trained on multiple languages (Cross-Language). Our analyses reveal a significant drop in performance when testing neural models on out-of-domain data and non-English languages that may be mitigated using diverse training data. We find that with additional image content as input, ELMo embeddings yield significantly fewer errors compared to BERT orGLoVe. Most importantly, this work not only carefully analyzes deception model robustness but also provides a framework of these analyses that can be applied to new models or extended datasets in the future.
△ Less
Submitted 23 April, 2021;
originally announced April 2021.
-
Evaluating Deception Detection Model Robustness To Linguistic Variation
Authors:
Maria Glenski,
Ellyn Ayton,
Robin Cosbey,
Dustin Arendt,
Svitlana Volkova
Abstract:
With the increasing use of machine-learning driven algorithmic judgements, it is critical to develop models that are robust to evolving or manipulated inputs. We propose an extensive analysis of model robustness against linguistic variation in the setting of deceptive news detection, an important task in the context of misinformation spread online. We consider two prediction tasks and compare thre…
▽ More
With the increasing use of machine-learning driven algorithmic judgements, it is critical to develop models that are robust to evolving or manipulated inputs. We propose an extensive analysis of model robustness against linguistic variation in the setting of deceptive news detection, an important task in the context of misinformation spread online. We consider two prediction tasks and compare three state-of-the-art embeddings to highlight consistent trends in model performance, high confidence misclassifications, and high impact failures. By measuring the effectiveness of adversarial defense strategies and evaluating model susceptibility to adversarial attacks using character- and word-perturbed text, we find that character or mixed ensemble models are the most effective defenses and that character perturbation-based attack tactics are more successful.
△ Less
Submitted 23 April, 2021;
originally announced April 2021.
-
CrossCheck: Rapid, Reproducible, and Interpretable Model Evaluation
Authors:
Dustin Arendt,
Zhuanyi Huang,
Prasha Shrestha,
Ellyn Ayton,
Maria Glenski,
Svitlana Volkova
Abstract:
Evaluation beyond aggregate performance metrics, e.g. F1-score, is crucial to both establish an appropriate level of trust in machine learning models and identify future model improvements. In this paper we demonstrate CrossCheck, an interactive visualization tool for rapid crossmodel comparison and reproducible error analysis. We describe the tool and discuss design and implementation details. We…
▽ More
Evaluation beyond aggregate performance metrics, e.g. F1-score, is crucial to both establish an appropriate level of trust in machine learning models and identify future model improvements. In this paper we demonstrate CrossCheck, an interactive visualization tool for rapid crossmodel comparison and reproducible error analysis. We describe the tool and discuss design and implementation details. We then present three use cases (named entity recognition, reading comprehension, and clickbait detection) that show the benefits of using the tool for model evaluation. CrossCheck allows data scientists to make informed decisions to choose between multiple models, identify when the models are correct and for which examples, investigate whether the models are making the same mistakes as humans, evaluate models' generalizability and highlight models' limitations, strengths and weaknesses. Furthermore, CrossCheck is implemented as a Jupyter widget, which allows rapid and convenient integration into data scientists' model development workflows.
△ Less
Submitted 16 April, 2020;
originally announced April 2020.
-
Multilingual Multimodal Digital Deception Detection and Disinformation Spread across Social Platforms
Authors:
Maria Glenski,
Ellyn Ayton,
Josh Mendoza,
Svitlana Volkova
Abstract:
Our main contribution in this work is novel results of multilingual models that go beyond typical applications of rumor or misinformation detection in English social news content to identify fine-grained classes of digital deception across multiple languages (e.g. Russian, Spanish, etc.). In addition, we present models for multimodal deception detection from images and text and discuss the limitat…
▽ More
Our main contribution in this work is novel results of multilingual models that go beyond typical applications of rumor or misinformation detection in English social news content to identify fine-grained classes of digital deception across multiple languages (e.g. Russian, Spanish, etc.). In addition, we present models for multimodal deception detection from images and text and discuss the limitations of image only and text only models. Finally, we elaborate on the ongoing work on measuring deceptive content (in particular disinformation) spread across social platforms.
△ Less
Submitted 12 September, 2019;
originally announced September 2019.
-
Fishing for Clickbaits in Social Images and Texts with Linguistically-Infused Neural Network Models
Authors:
Maria Glenski,
Ellyn Ayton,
Dustin Arendt,
Svitlana Volkova
Abstract:
This paper presents the results and conclusions of our participation in the Clickbait Challenge 2017 on automatic clickbait detection in social media. We first describe linguistically-infused neural network models and identify informative representations to predict the level of clickbaiting present in Twitter posts. Our models allow to answer the question not only whether a post is a clickbait or…
▽ More
This paper presents the results and conclusions of our participation in the Clickbait Challenge 2017 on automatic clickbait detection in social media. We first describe linguistically-infused neural network models and identify informative representations to predict the level of clickbaiting present in Twitter posts. Our models allow to answer the question not only whether a post is a clickbait or not, but to what extent it is a clickbait post e.g., not at all, slightly, considerably, or heavily clickbaity using a score ranging from 0 to 1. We evaluate the predictive power of models trained on varied text and image representations extracted from tweets. Our best performing model that relies on the tweet text and linguistic markers of biased language extracted from the tweet and the corresponding page yields mean squared error (MSE) of 0.04, mean absolute error (MAE) of 0.16 and R2 of 0.43 on the held-out test data. For the binary classification setup (clickbait vs. non-clickbait), our model achieved F1 score of 0.69. We have not found that image representations combined with text yield significant performance improvement yet. Nevertheless, this work is the first to present preliminary analysis of objects extracted using Google Tensorflow object detection API from images in clickbait vs. non-clickbait Twitter posts. Finally, we outline several steps to improve model performance as a part of the future work.
△ Less
Submitted 17 October, 2017;
originally announced October 2017.