Skip to main content

Showing 1–6 of 6 results for author: Dessi, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.07285  [pdf, ps, other

    cs.IR

    Research Knowledge Graphs: the Shifting Paradigm of Scholarly Information Representation

    Authors: Matthäus Zloch, Danilo Dessì, Jennifer D'Souza, Leyla Jael Castro, Benjamin Zapilko, Saurav Karmakar, Brigitte Mathiak, Markus Stocker, Wolfgang Otto, Sören Auer, Stefan Dietze

    Abstract: Sharing and reusing research artifacts, such as datasets, publications, or methods is a fundamental part of scientific activity, where heterogeneity of resources and metadata and the common practice of capturing information in unstructured publications pose crucial challenges. Reproducibility of research and finding state-of-the-art methods or data have become increasingly challenging. In this con… ▽ More

    Submitted 8 June, 2025; originally announced June 2025.

    Comments: Extended Semantic Web Conference 2025, In-use track, 10 pages, 1 figure

  2. arXiv:2504.19536  [pdf, other

    cs.SI

    TeleScope: A Longitudinal Dataset for Investigating Online Discourse and Information Interaction on Telegram

    Authors: Susmita Gangopadhyay, Danilo Dessi, Dimitar Dimitrov, Stefan Dietze

    Abstract: Telegram is a globally popular instant messaging platform known for its strong emphasis on security, privacy, and unique social networking features. It has recently emerged as the host for various cross-domain analysis and research works, such as social media influence, propaganda studies, and extremism. This paper introduces TeleScope, an extensive dataset suite that, to our knowledge, is the lar… ▽ More

    Submitted 28 April, 2025; originally announced April 2025.

    Comments: Accepted at ICWSM 2025

  3. arXiv:2501.04455  [pdf, other

    cs.CL cs.DL

    Hidden Entity Detection from GitHub Leveraging Large Language Models

    Authors: Lu Gan, Martin Blum, Danilo Dessi, Brigitte Mathiak, Ralf Schenkel, Stefan Dietze

    Abstract: Named entity recognition is an important task when constructing knowledge bases from unstructured data sources. Whereas entity detection methods mostly rely on extensive training data, Large Language Models (LLMs) have paved the way towards approaches that rely on zero-shot learning (ZSL) or few-shot learning (FSL) by taking advantage of the capabilities LLMs acquired during pretraining. Specifica… ▽ More

    Submitted 8 January, 2025; originally announced January 2025.

    Comments: accepted by KDD2024 workshop DL4KG

  4. Triplètoile: Extraction of Knowledge from Microblogging Text

    Authors: Vanni Zavarella, Sergio Consoli, Diego Reforgiato Recupero, Gianni Fenu, Simone Angioni, Davide Buscaldi, Danilo Dessì, Francesco Osborne

    Abstract: Numerous methods and pipelines have recently emerged for the automatic extraction of knowledge graphs from documents such as scientific publications and patents. However, adapting these methods to incorporate alternative text sources like micro-blogging posts and news has proven challenging as they struggle to model open-domain entities and relations, typically found in these sources. In this pape… ▽ More

    Submitted 27 August, 2024; originally announced August 2024.

    Comments: 42 pages, 6 figures

    MSC Class: 68T01; 68T50 ACM Class: I.2.7; I.2.1

    Journal ref: Heliyon 10(12) (2024) e32479

  5. TF-IDF vs Word Embeddings for Morbidity Identification in Clinical Notes: An Initial Study

    Authors: Danilo Dessi, Rim Helaoui, Vivek Kumar, Diego Reforgiato Recupero, Daniele Riboni

    Abstract: Today, we are seeing an ever-increasing number of clinical notes that contain clinical results, images, and textual descriptions of patient's health state. All these data can be analyzed and employed to cater novel services that can help people and domain experts with their common healthcare tasks. However, many technologies such as Deep Learning and tools like Word Embeddings have started to be i… ▽ More

    Submitted 9 June, 2021; v1 submitted 20 May, 2021; originally announced May 2021.

    Comments: 12 pages, 2 figures, 2 tables, SmartPhil 2020-First Workshop on Smart Personal Health Interfaces, Associated to ACM IUI 2020

  6. Generating Knowledge Graphs by Employing Natural Language Processing and Machine Learning Techniques within the Scholarly Domain

    Authors: Danilo Dessì, Francesco Osborne, Diego Reforgiato Recupero, Davide Buscaldi, Enrico Motta

    Abstract: The continuous growth of scientific literature brings innovations and, at the same time, raises new challenges. One of them is related to the fact that its analysis has become difficult due to the high volume of published papers for which manual effort for annotations and management is required. Novel technological infrastructures are needed to help researchers, research policy makers, and compani… ▽ More

    Submitted 28 October, 2020; originally announced November 2020.

    Comments: Accepted for publication in Future Generation Computer Systems journal - Special Issue on Machine Learning and Knowledge Graphs