Skip to main content

Showing 1–6 of 6 results for author: Vogel, I

.
  1. arXiv:2504.07643  [pdf, other

    cs.IR cs.CL cs.CV

    CollEX -- A Multimodal Agentic RAG System Enabling Interactive Exploration of Scientific Collections

    Authors: Florian Schneider, Narges Baba Ahmadi, Niloufar Baba Ahmadi, Iris Vogel, Martin Semmann, Chris Biemann

    Abstract: In this paper, we introduce CollEx, an innovative multimodal agentic Retrieval-Augmented Generation (RAG) system designed to enhance interactive exploration of extensive scientific collections. Given the overwhelming volume and inherent complexity of scientific collections, conventional search systems often lack necessary intuitiveness and interactivity, presenting substantial barriers for learner… ▽ More

    Submitted 10 April, 2025; originally announced April 2025.

  2. arXiv:2407.08417  [pdf, other

    cs.LG

    Unveiling the Potential of BERTopic for Multilingual Fake News Analysis -- Use Case: Covid-19

    Authors: Karla Schäfer, Jeong-Eun Choi, Inna Vogel, Martin Steinebach

    Abstract: Topic modeling is frequently being used for analysing large text corpora such as news articles or social media data. BERTopic, consisting of sentence embedding, dimension reduction, clustering, and topic extraction, is the newest and currently the SOTA topic modeling method. However, current topic modeling methods have room for improvement because, as unsupervised methods, they require careful tun… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: Accepted at the Workshop on Representation Learning and Clustering (RLC) at the 17th ACM International WSDM Conference in 2024

  3. arXiv:2307.02377  [pdf, other

    cs.CL cs.LG

    Fraunhofer SIT at CheckThat! 2023: Tackling Classification Uncertainty Using Model Souping on the Example of Check-Worthiness Classification

    Authors: Raphael Frick, Inna Vogel, Jeong-Eun Choi

    Abstract: This paper describes the second-placed approach developed by the Fraunhofer SIT team in the CLEF-2023 CheckThat! lab Task 1B for English. Given a text snippet from a political debate, the aim of this task is to determine whether it should be assessed for check-worthiness. Detecting check-worthy statements aims to facilitate manual fact-checking efforts by prioritizing the claims that fact-checkers… ▽ More

    Submitted 27 July, 2023; v1 submitted 3 July, 2023; originally announced July 2023.

    Comments: 9 pages

    Journal ref: CLEF 2023

  4. arXiv:2307.00610  [pdf, other

    cs.LG cs.CL cs.SI

    Fraunhofer SIT at CheckThat! 2023: Mixing Single-Modal Classifiers to Estimate the Check-Worthiness of Multi-Modal Tweets

    Authors: Raphael Frick, Inna Vogel

    Abstract: The option of sharing images, videos and audio files on social media opens up new possibilities for distinguishing between false information and fake news on the Internet. Due to the vast amount of data shared every second on social media, not all data can be verified by a computer or a human expert. Here, a check-worthiness analysis can be used as a first step in the fact-checking pipeline and as… ▽ More

    Submitted 27 July, 2023; v1 submitted 2 July, 2023; originally announced July 2023.

    Comments: 8 pages

    Journal ref: CLEF 2023

  5. arXiv:2009.13859  [pdf, ps, other

    cs.CL

    Fake News Spreader Detection on Twitter using Character N-Grams. Notebook for PAN at CLEF 2020

    Authors: Inna Vogel, Meghana Meghana

    Abstract: The authors of fake news often use facts from verified news sources and mix them with misinformation to create confusion and provoke unrest among the readers. The spread of fake news can thereby have serious implications on our society. They can sway political elections, push down the stock price or crush reputations of corporations or public figures. Several websites have taken on the mission of… ▽ More

    Submitted 29 September, 2020; originally announced September 2020.

    Comments: CLEF 2020 Labs and Workshops, Notebook Papers

  6. arXiv:2009.13367   

    cs.CL cs.IR

    Similarity Detection Pipeline for Crawling a Topic Related Fake News Corpus

    Authors: Inna Vogel, Jeong-Eun Choi, Meghana Meghana

    Abstract: Fake news detection is a challenging task aiming to reduce human time and effort to check the truthfulness of news. Automated approaches to combat fake news, however, are limited by the lack of labeled benchmark datasets, especially in languages other than English. Moreover, many publicly available corpora have specific limitations that make them difficult to use. To address this problem, our cont… ▽ More

    Submitted 1 March, 2021; v1 submitted 28 September, 2020; originally announced September 2020.

    Comments: Further development done