-
Inductive detection of Influence Operations via Graph Learning
Authors:
Nicholas A. Gabriel,
David A. Broniatowski,
Neil F. Johnson
Abstract:
Influence operations are large-scale efforts to manipulate public opinion. The rapid detection and disruption of these operations is critical for healthy public discourse. Emergent AI technologies may enable novel operations which evade current detection methods and influence public discourse on social media with greater scale, reach, and specificity. New methods with inductive learning capacity w…
▽ More
Influence operations are large-scale efforts to manipulate public opinion. The rapid detection and disruption of these operations is critical for healthy public discourse. Emergent AI technologies may enable novel operations which evade current detection methods and influence public discourse on social media with greater scale, reach, and specificity. New methods with inductive learning capacity will be needed to identify these novel operations before they indelibly alter public opinion and events. We develop an inductive learning framework which: 1) determines content- and graph-based indicators that are not specific to any operation; 2) uses graph learning to encode abstract signatures of coordinated manipulation; and 3) evaluates generalization capacity by training and testing models across operations originating from Russia, China, and Iran. We find that this framework enables strong cross-operation generalization while also revealing salient indicators$\unicode{x2013}$illustrating a generic approach which directly complements transductive methodologies, thereby enhancing detection coverage.
△ Less
Submitted 25 May, 2023;
originally announced May 2023.
-
GisPy: A Tool for Measuring Gist Inference Score in Text
Authors:
Pedram Hosseini,
Christopher R. Wolfe,
Mona Diab,
David A. Broniatowski
Abstract:
Decision making theories such as Fuzzy-Trace Theory (FTT) suggest that individuals tend to rely on gist, or bottom-line meaning, in the text when making decisions. In this work, we delineate the process of developing GisPy, an open-source tool in Python for measuring the Gist Inference Score (GIS) in text. Evaluation of GisPy on documents in three benchmarks from the news and scientific text domai…
▽ More
Decision making theories such as Fuzzy-Trace Theory (FTT) suggest that individuals tend to rely on gist, or bottom-line meaning, in the text when making decisions. In this work, we delineate the process of developing GisPy, an open-source tool in Python for measuring the Gist Inference Score (GIS) in text. Evaluation of GisPy on documents in three benchmarks from the news and scientific text domains demonstrates that scores generated by our tool significantly distinguish low vs. high gist documents. Our tool is publicly available to use at: https://github.com/phosseini/GisPy.
△ Less
Submitted 25 May, 2022;
originally announced May 2022.
-
Facebook's Architecture Undermines Vaccine Misinformation Removal Efforts
Authors:
David A. Broniatowski,
Jiayan Gu,
Amelia M. Jamison,
Joseph R. Simons,
Lorien C. Abroms
Abstract:
Misinformation promotes distrust in science, undermines public health, and may drive civil unrest. Vaccine misinformation, in particular, has stalled efforts to overcome the COVID-19 pandemic, prompting social media platforms' attempts to reduce it. Some have questioned whether "soft" content moderation remedies -- e.g., flagging and downranking misinformation -- were successful, suggesting that t…
▽ More
Misinformation promotes distrust in science, undermines public health, and may drive civil unrest. Vaccine misinformation, in particular, has stalled efforts to overcome the COVID-19 pandemic, prompting social media platforms' attempts to reduce it. Some have questioned whether "soft" content moderation remedies -- e.g., flagging and downranking misinformation -- were successful, suggesting that the addition of "hard" content remedies -- e.g., deplatforming and content bans -- is necessary. We therefore examined whether Facebook's vaccine misinformation content removal policies were effective. Here, we show that Facebook's policies reduced the number of anti-vaccine posts but also caused several perverse effects: pro-vaccine content was also removed, engagement with remaining anti-vaccine content repeatedly recovered to pre-policy levels, and this content became more misinformative, more politically polarised, and more likely to be seen in users' newsfeeds. We explain these results as an unintended consequence of Facebook's design goal: promoting community formation. Members of communities dedicated to vaccine refusal appear to seek out misinformation from multiple sources. Community administrators make use of several channels afforded by the Facebook platform to disseminate misinformation. Our findings suggest the need to address how social media platform architecture enables community formation and mobilisation around misinformative topics when managing the spread of online content.
△ Less
Submitted 11 August, 2022; v1 submitted 4 February, 2022;
originally announced February 2022.
-
Applying Word Embeddings to Measure Valence in Information Operations Targeting Journalists in Brazil
Authors:
David A. Broniatowski
Abstract:
Among the goals of information operations are to change the overall information environment vis-á-vis specific actors. For example, "trolling campaigns" seek to undermine the credibility of specific public figures, leading others to distrust them and intimidating these figures into silence. To accomplish these aims, information operations frequently make use of "trolls" -- malicious online actors…
▽ More
Among the goals of information operations are to change the overall information environment vis-á-vis specific actors. For example, "trolling campaigns" seek to undermine the credibility of specific public figures, leading others to distrust them and intimidating these figures into silence. To accomplish these aims, information operations frequently make use of "trolls" -- malicious online actors who target verbal abuse at these figures. In Brazil, in particular, allies of Brazil's current president have been accused of operating a "hate cabinet" -- a trolling operation that targets journalists who have alleged corruption by this politician and other members of his regime. Leading approaches to detecting harmful speech, such as Google's Perspective API, seek to identify specific messages with harmful content. While this approach is helpful in identifying content to downrank, flag, or remove, it is known to be brittle, and may miss attempts to introduce more subtle biases into the discourse. Here, we aim to develop a measure that might be used to assess how targeted information operations seek to change the overall valence, or appraisal, of specific actors. Preliminary results suggest known campaigns target female journalists more so than male journalists, and that these campaigns may leave detectable traces in overall Twitter discourse.
△ Less
Submitted 6 January, 2022;
originally announced January 2022.
-
Knowledge-Augmented Language Models for Cause-Effect Relation Classification
Authors:
Pedram Hosseini,
David A. Broniatowski,
Mona Diab
Abstract:
Previous studies have shown the efficacy of knowledge augmentation methods in pretrained language models. However, these methods behave differently across domains and downstream tasks. In this work, we investigate the augmentation of pretrained language models with commonsense knowledge in the cause-effect relation classification and commonsense causal reasoning tasks. After automatically verbaliz…
▽ More
Previous studies have shown the efficacy of knowledge augmentation methods in pretrained language models. However, these methods behave differently across domains and downstream tasks. In this work, we investigate the augmentation of pretrained language models with commonsense knowledge in the cause-effect relation classification and commonsense causal reasoning tasks. After automatically verbalizing ATOMIC2020, a wide coverage commonsense reasoning knowledge graph, and GLUCOSE, a dataset of implicit commonsense causal knowledge, we continually pretrain BERT and RoBERTa with the verbalized data. Then we evaluate the resulting models on cause-effect pair classification and answering commonsense causal reasoning questions. Our results show that continually pretrained language models augmented with commonsense knowledge outperform our baselines on two commonsense causal reasoning benchmarks, COPA and BCOPA-CE, and the Temporal and Causal Reasoning (TCR) dataset, without additional improvement in model architecture or using quality-enhanced data for fine-tuning.
△ Less
Submitted 1 June, 2022; v1 submitted 15 December, 2021;
originally announced December 2021.
-
Predicting Directionality in Causal Relations in Text
Authors:
Pedram Hosseini,
David A. Broniatowski,
Mona Diab
Abstract:
In this work, we test the performance of two bidirectional transformer-based language models, BERT and SpanBERT, on predicting directionality in causal pairs in the textual content. Our preliminary results show that predicting direction for inter-sentence and implicit causal relations is more challenging. And, SpanBERT performs better than BERT on causal samples with longer span length. We also in…
▽ More
In this work, we test the performance of two bidirectional transformer-based language models, BERT and SpanBERT, on predicting directionality in causal pairs in the textual content. Our preliminary results show that predicting direction for inter-sentence and implicit causal relations is more challenging. And, SpanBERT performs better than BERT on causal samples with longer span length. We also introduce CREST which is a framework for unifying a collection of scattered datasets of causal relations.
△ Less
Submitted 25 March, 2021;
originally announced March 2021.
-
A Multi-Modal Method for Satire Detection using Textual and Visual Cues
Authors:
Lily Li,
Or Levi,
Pedram Hosseini,
David A. Broniatowski
Abstract:
Satire is a form of humorous critique, but it is sometimes misinterpreted by readers as legitimate news, which can lead to harmful consequences. We observe that the images used in satirical news articles often contain absurd or ridiculous content and that image manipulation is used to create fictional scenarios. While previous work have studied text-based methods, in this work we propose a multi-m…
▽ More
Satire is a form of humorous critique, but it is sometimes misinterpreted by readers as legitimate news, which can lead to harmful consequences. We observe that the images used in satirical news articles often contain absurd or ridiculous content and that image manipulation is used to create fictional scenarios. While previous work have studied text-based methods, in this work we propose a multi-modal approach based on state-of-the-art visiolinguistic model ViLBERT. To this end, we create a new dataset consisting of images and headlines of regular and satirical news for the task of satire detection. We fine-tune ViLBERT on the dataset and train a convolutional neural network that uses an image forensics technique. Evaluation on the dataset shows that our proposed multi-modal approach outperforms image-only, text-only, and simple fusion baselines.
△ Less
Submitted 13 October, 2020;
originally announced October 2020.
-
The Role of Individual User Differences in Interpretable and Explainable Machine Learning Systems
Authors:
Lydia P. Gleaves,
Reva Schwartz,
David A. Broniatowski
Abstract:
There is increased interest in assisting non-expert audiences to effectively interact with machine learning (ML) tools and understand the complex output such systems produce. Here, we describe user experiments designed to study how individual skills and personality traits predict interpretability, explainability, and knowledge discovery from ML generated model output. Our work relies on Fuzzy Trac…
▽ More
There is increased interest in assisting non-expert audiences to effectively interact with machine learning (ML) tools and understand the complex output such systems produce. Here, we describe user experiments designed to study how individual skills and personality traits predict interpretability, explainability, and knowledge discovery from ML generated model output. Our work relies on Fuzzy Trace Theory, a leading theory of how humans process numerical stimuli, to examine how different end users will interpret the output they receive while interacting with the ML system. While our sample was small, we found that interpretability -- being able to make sense of system output -- and explainability -- understanding how that output was generated -- were distinct aspects of user experience. Additionally, subjects were more able to interpret model output if they possessed individual traits that promote metacognitive monitoring and editing, associated with more detailed, verbatim, processing of ML output. Finally, subjects who are more familiar with ML systems felt better supported by them and more able to discover new patterns in data; however, this did not necessarily translate to meaningful insights. Our work motivates the design of systems that explicitly take users' mental representations into account during the design process to more effectively support end user requirements.
△ Less
Submitted 14 September, 2020;
originally announced September 2020.
-
Twitter and Facebook posts about COVID-19 are less likely to spread false and low-credibility content compared to other health topics
Authors:
David A. Broniatowski,
Daniel Kerchner,
Fouzia Farooq,
Xiaolei Huang,
Amelia M. Jamison,
Mark Dredze,
Sandra Crouse Quinn
Abstract:
On February 2, 2020, the World Health Organization declared a COVID-19 social media "infodemic", with special attention to misinformation -- frequently understood as false claims. To understand the infodemic's scope and scale, we analyzed over 500 million posts from Twitter and Facebook about COVID-19 and other health topics, between March 8 and May 1, 2020. Following prior work, we assumed URL so…
▽ More
On February 2, 2020, the World Health Organization declared a COVID-19 social media "infodemic", with special attention to misinformation -- frequently understood as false claims. To understand the infodemic's scope and scale, we analyzed over 500 million posts from Twitter and Facebook about COVID-19 and other health topics, between March 8 and May 1, 2020. Following prior work, we assumed URL source credibility is a proxy for false content, but we also tested this assumption. Contrary to expectations, we found that messages about COVID-19 were more likely to contain links to more credible sources. Additionally, messages linking to government sources, and to news with intermediate credibility, were shared more often, on average, than links to non-credible sources. These results suggest that more ambiguous forms of misinformation about COVID-19 may be more likely to be disseminated through credible sources when compared to other health topics. Furthermore, the assumption that credibility is an adequate proxy for false content may overestimate the prevalence of false content online: less than 25% of posts linking to the least credible sources contained false content. Our results emphasize the importance of distinguishing between explicit falsehoods and more ambiguous forms of misinformation due to the search for meaning in an environment of scientific uncertainty.
△ Less
Submitted 26 March, 2021; v1 submitted 19 July, 2020;
originally announced July 2020.
-
Content analysis of Persian/Farsi Tweets during COVID-19 pandemic in Iran using NLP
Authors:
Pedram Hosseini,
Poorya Hosseini,
David A. Broniatowski
Abstract:
Iran, along with China, South Korea, and Italy was among the countries that were hit hard in the first wave of the COVID-19 spread. Twitter is one of the widely-used online platforms by Iranians inside and abroad for sharing their opinion, thoughts, and feelings about a wide range of issues. In this study, using more than 530,000 original tweets in Persian/Farsi on COVID-19, we analyzed the topics…
▽ More
Iran, along with China, South Korea, and Italy was among the countries that were hit hard in the first wave of the COVID-19 spread. Twitter is one of the widely-used online platforms by Iranians inside and abroad for sharing their opinion, thoughts, and feelings about a wide range of issues. In this study, using more than 530,000 original tweets in Persian/Farsi on COVID-19, we analyzed the topics discussed among users, who are mainly Iranians, to gauge and track the response to the pandemic and how it evolved over time. We applied a combination of manual annotation of a random sample of tweets and topic modeling tools to classify the contents and frequency of each category of topics. We identified the top 25 topics among which living experience under home quarantine emerged as a major talking point. We additionally categorized broader content of tweets that shows satire, followed by news, is the dominant tweet type among the Iranian users. While this framework and methodology can be used to track public response to ongoing developments related to COVID-19, a generalization of this framework can become a useful framework to gauge Iranian public reaction to ongoing policy measures or events locally and internationally.
△ Less
Submitted 17 May, 2020;
originally announced May 2020.
-
The Twitter Social Mobility Index: Measuring Social Distancing Practices from Geolocated Tweets
Authors:
Paiheng Xu,
Mark Dredze,
David A. Broniatowski
Abstract:
Social distancing is an important component of the response to the novel Coronavirus (COVID-19) pandemic. Minimizing social interactions and travel reduces the rate at which the infection spreads, and "flattens the curve" such that the medical system can better treat infected individuals. However, it remains unclear how the public will respond to these policies. This paper presents the Twitter Soc…
▽ More
Social distancing is an important component of the response to the novel Coronavirus (COVID-19) pandemic. Minimizing social interactions and travel reduces the rate at which the infection spreads, and "flattens the curve" such that the medical system can better treat infected individuals. However, it remains unclear how the public will respond to these policies. This paper presents the Twitter Social Mobility Index, a measure of social distancing and travel derived from Twitter data. We use public geolocated Twitter data to measure how much a user travels in a given week. We find a large reduction in travel in the United States after the implementation of social distancing policies, with larger reductions in states that were early adopters and smaller changes in states without policies. Our findings are presented on http://socialmobility.covid19dataresources.org and we will continue to update our analysis during the pandemic.
△ Less
Submitted 8 June, 2020; v1 submitted 6 April, 2020;
originally announced April 2020.
-
Identifying Nuances in Fake News vs. Satire: Using Semantic and Linguistic Cues
Authors:
Or Levi,
Pedram Hosseini,
Mona Diab,
David A. Broniatowski
Abstract:
The blurry line between nefarious fake news and protected-speech satire has been a notorious struggle for social media platforms. Further to the efforts of reducing exposure to misinformation on social media, purveyors of fake news have begun to masquerade as satire sites to avoid being demoted. In this work, we address the challenge of automatically classifying fake news versus satire. Previous w…
▽ More
The blurry line between nefarious fake news and protected-speech satire has been a notorious struggle for social media platforms. Further to the efforts of reducing exposure to misinformation on social media, purveyors of fake news have begun to masquerade as satire sites to avoid being demoted. In this work, we address the challenge of automatically classifying fake news versus satire. Previous work have studied whether fake news and satire can be distinguished based on language differences. Contrary to fake news, satire stories are usually humorous and carry some political or social message. We hypothesize that these nuances could be identified using semantic and linguistic cues. Consequently, we train a machine learning method using semantic representation, with a state-of-the-art contextual language model, and with linguistic features based on textual coherence metrics. Empirical evaluation attests to the merits of our approach compared to the language-based baseline and sheds light on the nuances between fake news and satire. As avenues for future work, we consider studying additional linguistic features related to the humor aspect, and enriching the data with current news events, to help identify a political or social message.
△ Less
Submitted 5 November, 2019; v1 submitted 2 October, 2019;
originally announced October 2019.