Skip to main content

Showing 1–7 of 7 results for author: Dinkov, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2108.12519  [pdf, other

    cs.CL cs.IR cs.LG cs.SI

    Predicting the Factuality of Reporting of News Media Using Observations About User Attention in Their YouTube Channels

    Authors: Krasimira Bozhanova, Yoan Dinkov, Ivan Koychev, Maria Castaldo, Tommaso Venturini, Preslav Nakov

    Abstract: We propose a novel framework for predicting the factuality of reporting of news media outlets by studying the user attention cycles in their YouTube channels. In particular, we design a rich set of features derived from the temporal evolution of the number of views, likes, dislikes, and comments for a video, which we then aggregate to the channel level. We develop and release a dataset for the tas… ▽ More

    Submitted 27 August, 2021; originally announced August 2021.

    Comments: Factuality, disinformation, misinformation, fake news, Youtube channels, propaganda, attention cycles

    MSC Class: 68T50 ACM Class: I.2.7

    Journal ref: RANLP-2021

  2. arXiv:2103.17055  [pdf, other

    cs.CL stat.ML

    A Neighbourhood Framework for Resource-Lean Content Flagging

    Authors: Sheikh Muhammad Sarwar, Dimitrina Zlatkova, Momchil Hardalov, Yoan Dinkov, Isabelle Augenstein, Preslav Nakov

    Abstract: We propose a novel framework for cross-lingual content flagging with limited target-language data, which significantly outperforms prior work in terms of predictive performance. The framework is based on a nearest-neighbour architecture. It is a modern instantiation of the vanilla k-nearest neighbour model, as we use Transformer representations in all its components. Our framework can adapt to new… ▽ More

    Submitted 27 January, 2022; v1 submitted 31 March, 2021; originally announced March 2021.

    Comments: Accepted to appear in Transactions of the Association for Computational Linguistics (TACL) -- this is a pre-MIT Press publication version

  3. arXiv:2103.00153  [pdf, other

    cs.CL cs.SI

    Detecting Harmful Content On Online Platforms: What Platforms Need Vs. Where Research Efforts Go

    Authors: Arnav Arora, Preslav Nakov, Momchil Hardalov, Sheikh Muhammad Sarwar, Vibha Nayak, Yoan Dinkov, Dimitrina Zlatkova, Kyle Dent, Ameya Bhatawdekar, Guillaume Bouchard, Isabelle Augenstein

    Abstract: The proliferation of harmful content on online platforms is a major societal problem, which comes in many different forms including hate speech, offensive language, bullying and harassment, misinformation, spam, violence, graphic content, sexual abuse, self harm, and many other. Online platforms seek to moderate such content to limit societal harm, to comply with legislation, and to create a more… ▽ More

    Submitted 6 June, 2023; v1 submitted 27 February, 2021; originally announced March 2021.

    Comments: The paper has been accepted for publication to ACM Computing Surveys (CSUR)

  4. arXiv:2011.03080  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    EXAMS: A Multi-Subject High School Examinations Dataset for Cross-Lingual and Multilingual Question Answering

    Authors: Momchil Hardalov, Todor Mihaylov, Dimitrina Zlatkova, Yoan Dinkov, Ivan Koychev, Preslav Nakov

    Abstract: We propose EXAMS -- a new benchmark dataset for cross-lingual and multilingual question answering for high school examinations. We collected more than 24,000 high-quality high school exam questions in 16 languages, covering 8 language families and 24 school subjects from Natural Sciences and Social Sciences, among others. EXAMS offers a fine-grained evaluation framework across multiple languages… ▽ More

    Submitted 5 November, 2020; originally announced November 2020.

    Comments: EMNLP 2020, 17 pages, 6 figures, 8 tables

  5. arXiv:2005.04518  [pdf, other

    cs.CL cs.IR cs.LG

    What Was Written vs. Who Read It: News Media Profiling Using Text Analysis and Social Media Context

    Authors: Ramy Baly, Georgi Karadzhov, Jisun An, Haewoon Kwak, Yoan Dinkov, Ahmed Ali, James Glass, Preslav Nakov

    Abstract: Predicting the political bias and the factuality of reporting of entire news outlets are critical elements of media profiling, which is an understudied but an increasingly important research direction. The present level of proliferation of fake, biased, and propagandistic content online, has made it impossible to fact-check every single suspicious claim, either manually or automatically. Alternati… ▽ More

    Submitted 9 May, 2020; originally announced May 2020.

    Comments: Factuality of reporting, fact-checking, political ideology, media bias, disinformation, propaganda, social media, news media

    MSC Class: 68T50 ACM Class: I.2.7

    Journal ref: ACL-2020

  6. arXiv:1910.08948  [pdf, other

    cs.CL cs.IR cs.SD eess.AS

    Predicting the Leading Political Ideology of YouTube Channels Using Acoustic, Textual, and Metadata Information

    Authors: Yoan Dinkov, Ahmed Ali, Ivan Koychev, Preslav Nakov

    Abstract: We address the problem of predicting the leading political ideology, i.e., left-center-right bias, for YouTube channels of news media. Previous work on the problem has focused exclusively on text and on analysis of the language used, topics discussed, sentiment, and the like. In contrast, here we study videos, which yields an interesting multimodal setup. Starting with gold annotations about the l… ▽ More

    Submitted 20 October, 2019; originally announced October 2019.

    Comments: media bias, political ideology, Youtube channels, propaganda, disinformation, fake news

    MSC Class: 68T50 ACM Class: I.2.7

    Journal ref: INTERSPEECH-2019

  7. arXiv:1908.09785  [pdf, other

    cs.CL cs.IR

    Detecting Toxicity in News Articles: Application to Bulgarian

    Authors: Yoan Dinkov, Ivan Koychev, Preslav Nakov

    Abstract: Online media aim for reaching ever bigger audience and for attracting ever longer attention span. This competition creates an environment that rewards sensational, fake, and toxic news. To help limit their spread and impact, we propose and develop a news toxicity detector that can recognize various types of toxic content. While previous research primarily focused on English, here we target Bulgari… ▽ More

    Submitted 26 August, 2019; originally announced August 2019.

    Comments: Fact-checking, source reliability, political ideology, news media, Bulgarian, RANLP-2019. arXiv admin note: text overlap with arXiv:1810.01765

    MSC Class: 68T50 ACM Class: I.2.7

    Journal ref: RANLP-2019