Skip to main content

Showing 1–50 of 54 results for author: Peroni, S

.
  1. arXiv:2505.13276  [pdf, ps, other

    cs.DL

    CHAD-KG: A Knowledge Graph for Representing Cultural Heritage Objects and Digitisation Paradata

    Authors: Sebastian Barzaghi, Arianna Moretti, Ivan Heibi, Silvio Peroni

    Abstract: This paper presents CHAD-KG, a knowledge graph designed to describe bibliographic metadata and digitisation paradata of cultural heritage objects in exhibitions, museums, and collections. It also documents the related data model and materialisation engine. Originally based on two tabular datasets, the data was converted into RDF according to CHAD-AP, an OWL application profile built on standards l… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

  2. arXiv:2504.12195  [pdf, other

    cs.DL

    Validating and monitoring bibliographic and citation data in OpenCitations collections

    Authors: Ivan Heibi, Silvio Peroni, Elia Rizzetto

    Abstract: Purpose. The increasing emphasis on data quantity in research infrastructures has highlighted the need for equally robust mechanisms ensuring data quality, particularly in bibliographic and citation datasets. This paper addresses the challenge of maintaining high-quality open research information within OpenCitations, a community-guided Open Science Infrastructure, by introducing tools for validat… ▽ More

    Submitted 16 April, 2025; originally announced April 2025.

    ACM Class: H.3.7

  3. arXiv:2503.13464  [pdf

    cs.DL

    Mapping Research Data at the University of Bologna

    Authors: C. Basalti, G. Caldoni, S. Coppini, B. Gualandi, M. Marino, F. Masini, S. Peroni

    Abstract: Research data management (RDM) strategies and practices play a pivotal role in adhering to the paradigms of reproducibility and transparency by enabling research sharing in accordance with the principles of Open Science. Discipline-specificity is an essential factor when understanding RDM declinations, to tailor a comprehensive support service and to enhance interdisciplinarity. In this paper we… ▽ More

    Submitted 26 February, 2025; originally announced March 2025.

    Comments: 32 pages, 12 figures

  4. arXiv:2503.13448  [pdf, other

    cs.DL cs.CL

    Recent Developments in Deep Learning-based Author Name Disambiguation

    Authors: Francesca Cappelli, Giovanni Colavizza, Silvio Peroni

    Abstract: Author Name Disambiguation (AND) is a critical task for digital libraries aiming to link existing authors with their respective publications. Due to the lack of persistent identifiers used by researchers and the presence of intrinsic linguistic challenges, such as homonymy, the development of Deep Learning algorithms to address this issue has become widespread. Many AND deep learning methods have… ▽ More

    Submitted 23 December, 2024; originally announced March 2025.

  5. arXiv:2501.16197  [pdf

    cs.DL

    HERITRACE: A User-Friendly Semantic Data Editor with Change Tracking and Provenance Management for Cultural Heritage Institutions

    Authors: Arcangelo Massari, Silvio Peroni

    Abstract: HERITRACE is a data editor designed for galleries, libraries, archives and museums, aimed at simplifying data curation while enabling non-technical domain experts to manage data intuitively without losing its semantic integrity. While the semantic nature of RDF can pose a barrier to data curation due to its complexity, HERITRACE conceals this intricacy while preserving the advantages of semantic r… ▽ More

    Submitted 27 January, 2025; originally announced January 2025.

    Comments: 22 pages, 5 figures, 2 tables, submitted to Umanistica Digitale

  6. arXiv:2501.05821  [pdf

    cs.DL

    Analysing the coverage of the University of Bologna's publication metadata in an existing source of open research information

    Authors: Erica Andreose, Salvatore Di Marzo, Ivan Heibi, Silvio Peroni, Leonardo Zilli

    Abstract: This study focuses on analysing the coverage of publications' metadata available in the Current Research Information System (CRIS) infrastructure of the University of Bologna (UNIBO), implemented by the IRIS platform, within an authoritative source of open research information, i.e. OpenCitations. The analysis considers data regarding the publication entities alongside the citation links. We preci… ▽ More

    Submitted 10 January, 2025; originally announced January 2025.

  7. arXiv:2412.05880  [pdf

    cs.GR cs.DL

    Leveraging virtual technologies to enhance museums and art collections: insights from project CHANGES

    Authors: Gianluca Genovese, Ivan Heibi, Silvio Peroni, Sofia Pescarin

    Abstract: We investigated the use of virtual technologies to digitise and enhance cultural heritage (CH), aligning with Open Science and FAIR principles. Through case studies in museums, we developed reproducible workflows, 3D models, and tools fostering accessibility, inclusivity, and sustainability of CH. Applications include interdisciplinary research, educational innovation, and CH preservation.

    Submitted 8 December, 2024; originally announced December 2024.

  8. The OpenCitations Index

    Authors: Ivan Heibi, Arianna Moretti, Silvio Peroni, Marta Soricetti

    Abstract: This article presents the OpenCitations Index, a collection of open citation data maintained by OpenCitations, an independent, not-for-profit infrastructure organisation for open scholarship dedicated to publishing open bibliographic and citation data using Semantic Web and Linked Open Data technologies. The collection involves citation data harvested from multiple sources. To address the possibil… ▽ More

    Submitted 5 August, 2024; originally announced August 2024.

    Journal ref: Scientometrics (2024)

  9. CiteFusion: An Ensemble Framework for Citation Intent Classification Harnessing Dual-Model Binary Couples and SHAP Analyses

    Authors: Lorenzo Paolini, Sahar Vahdati, Angelo Di Iorio, Robert Wardenga, Ivan Heibi, Silvio Peroni

    Abstract: Understanding the motivations underlying scholarly citations is essential to evaluate research impact and pro-mote transparent scholarly communication. This study introduces CiteFusion, an ensemble framework designed to address the multi-class Citation Intent Classification task on two benchmark datasets: SciCite and ACL-ARC. The framework employs a one-vs-all decomposition of the multi-class task… ▽ More

    Submitted 11 June, 2025; v1 submitted 18 July, 2024; originally announced July 2024.

    Comments: Submitted to Scientometrics Journal

  10. A Proposal for a FAIR Management of 3D Data in Cultural Heritage: The Aldrovandi Digital Twin Case

    Authors: Sebastian Barzaghi, Alice Bordignon, Bianca Gualandi, Ivan Heibi, Arcangelo Massari, Arianna Moretti, Silvio Peroni, Giulia Renda

    Abstract: In this article we analyse 3D models of cultural heritage with the aim of answering three main questions: what processes can be put in place to create a FAIR-by-design digital twin of a temporary exhibition? What are the main challenges in applying FAIR principles to 3D data in cultural heritage studies and how are they different from other types of data (e.g. images) from a data management perspe… ▽ More

    Submitted 22 January, 2025; v1 submitted 2 July, 2024; originally announced July 2024.

    Journal ref: Data Intelligence, Vol. 6, Issue 4, 2024, pp. 1190-221, ISSN 2096-7004

  11. arXiv:2405.02113  [pdf

    cs.DL

    A Workflow for GLAM Metadata Crosswalk

    Authors: Arianna Moretti, Ivan Heibi, Silvio Peroni

    Abstract: The acquisition of physical artifacts not only involves transferring existing information into the digital ecosystem but also generates information as a process itself, underscoring the importance of meticulous management of FAIR data and metadata. In addition, the diversity of objects within the cultural heritage domain is reflected in a multitude of descriptive models. The digitization process e… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

    Comments: Submitted to AIUCD conference 2024 1 figure 8 pages

  12. arXiv:2404.12069  [pdf, other

    cs.DL

    Developing Application Profiles for Enhancing Data and Workflows in Cultural Heritage Digitisation Processes

    Authors: Sebastian Barzaghi, Ivan Heibi, Arianna Moretti, Silvio Peroni

    Abstract: As a result of the proliferation of 3D digitisation in the context of cultural heritage projects, digital assets and digitisation processes - being considered as proper research objects - must prioritise adherence to FAIR principles. Existing standards and ontologies, such as CIDOC CRM, play a crucial role in this regard, but they are often over-engineered for the need of a particular application… ▽ More

    Submitted 2 August, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

  13. Thinking Outside the Black Box: Insights from a Digital Exhibition in the Humanities

    Authors: Sebastian Barzaghi, Alice Bordignon, Bianca Gualandi, Silvio Peroni

    Abstract: One of the main goals of Open Science is to make research more reproducible. There is no consensus, however, on what exactly "reproducibility" is, as opposed for example to "replicability", and how it applies to different research fields. After a short review of the literature on reproducibility/replicability with a focus on the humanities, we describe how the creation of the digital twin of the t… ▽ More

    Submitted 10 April, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: Accepted to the AIUCD2024 Conference: https://aiucd2024.unict.it/ - will be published in conference proceedings

    Journal ref: Di Silvestro, A.; Spampinato, D. (eds.) (2024) Proceedings del XIII Convegno Annuale AIUCD2024. ISBN 978-88-942535-8-0. In: Quaderni di Umanistica Digitale

  14. arXiv:2402.00477  [pdf

    cs.DL

    HERITRACE: Tracing Evolution and Bridging Data for Streamlined Curatorial Work in the GLAM Domain

    Authors: Arcangelo Massari, Silvio Peroni

    Abstract: HERITRACE is a semantic data management system tailored for the GLAM sector. It is engineered to streamline data curation for non-technical users while also offering an efficient administrative interface for technical staff. The paper compares HERITRACE with other established platforms such as OmekaS, Semantic MediaWiki, Research Space, and CLEF, emphasizing its advantages in user friendliness, pr… ▽ More

    Submitted 24 April, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

    Comments: 5 pages, 1 figure, submitted to AIUCD 2024

  15. arXiv:2312.16523  [pdf

    cs.DL

    Mapping bibliographic metadata collections: the case of OpenCitations Meta and OpenAlex

    Authors: Elia Rizzetto, Silvio Peroni

    Abstract: This study describes the methodology and analyses the results of the process of mapping entities between two large open bibliographic metadata collections, OpenCitations Meta and OpenAlex. The primary objective of this mapping is to integrate OpenAlex internal identifiers into the existing metadata of bibliographic resources in OpenCitations Meta, thereby interlinking and aligning these collection… ▽ More

    Submitted 27 December, 2023; originally announced December 2023.

  16. Saving temporary exhibitions in virtual environments: the Digital Renaissance of Ulisse Aldrovandi -- acquisition and digitisation of cultural heritage objects

    Authors: Roberto Balzani, Sebastian Barzaghi, Gabriele Bitelli, Federica Bonifazi, Alice Bordignon, Luca Cipriani, Simona Colitti, Federica Collina, Marilena Daquino, Francesca Fabbri, Bruno Fanini, Filippo Fantini, Daniele Ferdani, Giulia Fiorini, Elena Formia, Anna Forte, Federica Giacomini, Valentina Alena Girelli, Bianca Gualandi, Ivan Heibi, Alessandro Iannucci, Rachele Manganelli Del Fà, Arcangelo Massari, Arianna Moretti, Silvio Peroni , et al. (8 additional authors not shown)

    Abstract: As per the objectives of Project CHANGES, particularly its thematic sub-project on the use of virtual technologies for museums and art collections, our goal was to obtain a digital twin of the temporary exhibition on Ulisse Aldrovandi called "The Other Renaissance", and make it accessible to users online. After a preliminary study of the exhibition, focussing on acquisition constraints and related… ▽ More

    Submitted 27 December, 2023; v1 submitted 30 August, 2023; originally announced August 2023.

  17. arXiv:2308.13573  [pdf

    cs.DL

    Retractions in Arts and Humanities: an Analysis of the Retraction Notices

    Authors: Ivan Heibi, Silvio Peroni

    Abstract: The aim of this work is to understand the retraction phenomenon in the arts and humanities domain through an analysis of the retraction notices: formal documents stating and describing the retraction of a particular publication. The retractions and the corresponding notices are identified using the data provided by Retraction Watch. Our methodology for the analysis combines a metadata analysis and… ▽ More

    Submitted 25 August, 2023; originally announced August 2023.

  18. arXiv:2307.01718  [pdf

    cs.DB cs.DL

    A Prototype for a Controlled and Valid RDF Data Production Using SHACL

    Authors: Elia Rizzetto, Arcangelo Massari, Ivan Heibi, Silvio Peroni

    Abstract: The paper introduces a tool prototype that combines SHACL's capabilities with ad-hoc validation functions to create a controlled and user-friendly form interface for producing valid RDF data. The proposed tool is developed within the context of the OpenCitations Data Model (OCDM) use case. The paper discusses the current status of the tool, outlines the future steps required for achieving full fun… ▽ More

    Submitted 4 July, 2023; originally announced July 2023.

  19. OpenCitations Meta

    Authors: Arcangelo Massari, Fabio Mariani, Ivan Heibi, Silvio Peroni, David Shotton

    Abstract: OpenCitations Meta is a new database that contains bibliographic metadata of scholarly publications involved in citations indexed by the OpenCitations infrastructure. It adheres to Open Science principles and provides data under a CC0 license for maximum reuse. The data can be accessed through a SPARQL endpoint, REST APIs, and dumps. OpenCitations Meta serves three important purposes. Firstly, it… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

    Comments: 26 pages, 7 figures

    Journal ref: Quantitative Science Studies 2024. 5 (1) 50-75

  20. arXiv:2305.08477  [pdf

    cs.DL

    Representing provenance and track changes of cultural heritage metadata in RDF: a survey of existing approaches

    Authors: Arcangelo Massari, Silvio Peroni, Francesca Tomasi, Ivan Heibi

    Abstract: In the realm of Digital Humanities, the management of cultural heritage metadata is pivotal for ensuring data trustworthiness. Provenance information - contextual metadata detailing the origin and history of data - plays a crucial role in this process. However, tracking provenance and changes in metadata using the Resource Description Framework (RDF) presents significant challenges due to the limi… ▽ More

    Submitted 22 September, 2024; v1 submitted 15 May, 2023; originally announced May 2023.

    Comments: 23 pages, 4 figures, submitted to Digital Scholarship in the Humanities

  21. A maturity model for catalogues of semantic artefacts

    Authors: Oscar Corcho, Fajar J. Ekaputra, Ivan Heibi, Clement Jonquet, Andras Micsik, Silvio Peroni, Emanuele Storti

    Abstract: This work presents a maturity model for assessing catalogues of semantic artefacts, one of the keystones that permit semantic interoperability of systems. We defined the dimensions and related features to include in the maturity model by analysing the current literature and existing catalogues of semantic artefacts provided by experts. In addition, we assessed 26 different catalogues to demonstrat… ▽ More

    Submitted 24 March, 2024; v1 submitted 11 May, 2023; originally announced May 2023.

    Journal ref: Scientific Data, 11, 479

  22. arXiv:2210.02534  [pdf

    cs.DB

    Performing live time-traversal queries via SPARQL on RDF datasets

    Authors: Arcangelo Massari, Silvio Peroni

    Abstract: This article introduces a methodology to perform live time-traversal SPARQL queries on RDF datasets and software based on this methodology that offers a solution to manage the provenance and change-tracking of entities described using RDF. These are crucial factors in ensuring verifiability and trust. Nevertheless, some of the most prominent knowledge bases - including DBpedia, Wikidata, Yago, and… ▽ More

    Submitted 12 October, 2022; v1 submitted 5 October, 2022; originally announced October 2022.

    Comments: 26 pages, 10 figures, 3 tables, submitted to the Journal of the Association for Information Science and Technology (JASIST)

  23. arXiv:2209.06091  [pdf

    cs.DL

    Approaching Digital Humanities at the University: a Cultural Challenge

    Authors: Silvio Peroni, Francesca Tomasi

    Abstract: The University of Bologna has a long tradition in Digital Humanities, both at the level of research and teaching. In this article, we want to introduce some experiences in developing new educational models based on the idea of transversal learning, collaborative approaches and projects-oriented outputs, together with the definition of research fields within this vast domain, accompanied by practic… ▽ More

    Submitted 27 November, 2022; v1 submitted 13 September, 2022; originally announced September 2022.

  24. arXiv:2206.07476  [pdf

    cs.DL

    OpenCitations, an open e-infrastructure to foster maximum reuse of citation data

    Authors: Chiara Di Giambattista, Ivan Heibi, Silvio Peroni, David Shotton

    Abstract: OpenCitations is an independent not-for-profit infrastructure organization for open scholarship dedicated to the publication of open bibliographic and citation data by the use of Semantic Web (Linked Data) technologies. OpenCitations collaborates with projects that are part of the Open Science ecosystem and complies with the UNESCO founding principles of Open Science, the I4OC recommendations, and… ▽ More

    Submitted 15 June, 2022; originally announced June 2022.

  25. Enabling Portability and Reusability of Open Science Infrastructures

    Authors: Giuseppe Grieco, Ivan Heibi, Arcangelo Massari, Arianna Moretti, Silvio Peroni

    Abstract: This paper presents a methodology for designing a containerized and distributed open science infrastructure to simplify its reusability, replicability, and portability in different environments. The methodology is depicted in a step-by-step schema based on four main phases: (1) Analysis, (2) Design, (3) Definition, and (4) Managing and provisioning. We accompany the description of each step with e… ▽ More

    Submitted 28 July, 2022; v1 submitted 8 June, 2022; originally announced June 2022.

    Comments: 8 pages, 1 PostScript figure, submitted to TPDL 2022

    Journal ref: Linking Theory and Practice of Digital Libraries. TPDL 2022. Lecture Notes in Computer Science, vol 13541. Springer, Cham

  26. arXiv:2205.14677  [pdf

    cs.DL

    Structured references from PDF articles: assessing the tools for bibliographic reference extraction and parsing

    Authors: Alessia Cioffi, Silvio Peroni

    Abstract: Many solutions have been provided to extract bibliographic references from PDF papers. Machine learning, rule-based and regular expressions approaches were among the most used methods adopted in tools for addressing this task. This work aims to identify and evaluate all and only the tools which, given a full-text paper in PDF format, can recognise, extract and parse bibliographic references. We id… ▽ More

    Submitted 6 September, 2022; v1 submitted 29 May, 2022; originally announced May 2022.

  27. arXiv:2205.13419  [pdf

    cs.DL

    The way we cite: common metadata used across disciplines for defining bibliographic references

    Authors: Erika Alves dos Santos, Silvio Peroni, Marcos Luiz Mucheroni

    Abstract: Current citation practices observed in articles are very noisy, confusing, and not standardised, making identifying the cited works problematic for hu-mans and any reference extraction software. In this work, we want to investigate such citation practices for referencing different types of entities and, in particular, to understand the most used metadata in bibliographic refer-ences. We identified… ▽ More

    Submitted 21 July, 2022; v1 submitted 26 May, 2022; originally announced May 2022.

  28. What do we mean by "data"? A proposed classification of data types in the arts and humanities

    Authors: Bianca Gualandi, Luca Pareschi, Silvio Peroni

    Abstract: Purpose: This article describes the interviews we conducted in late 2021 with 19 researchers at the Department of Classical Philology and Italian Studies at the University of Bologna. The main purpose was to shed light on the definition of the word "data" in the humanities domain, as far as FAIR data management practices are concerned, and on what researchers think of the term. Methodology: We inv… ▽ More

    Submitted 8 November, 2022; v1 submitted 13 May, 2022; originally announced May 2022.

  29. An analysis of citing and referencing habits across all scholarly disciplines: approaches and trends in bibliographic referencing and citing practices

    Authors: Erika Alves dos Santos, Silvio Peroni, Marcos Luiz Mucheroni

    Abstract: Purpose. In this study, we want to identify current possible causes for citing and referencing errors in scholarly literature to compare if something changed from the snapshot provided Sweetland in his 1989 paper. Design/methodology/approach. We analysed reference elements, i.e. bibliographic references, mentions, quotations, and respective in-text reference pointers, from 729 articles published i… ▽ More

    Submitted 10 June, 2023; v1 submitted 17 February, 2022; originally announced February 2022.

  30. arXiv:2201.09555  [pdf, other

    cs.AI cs.CL cs.DL

    A Knowledge Graph Embeddings based Approach for Author Name Disambiguation using Literals

    Authors: Cristian Santini, Genet Asefa Gesese, Silvio Peroni, Aldo Gangemi, Harald Sack, Mehwish Alam

    Abstract: Scholarly data is growing continuously containing information about the articles from a plethora of venues including conferences, journals, etc. Many initiatives have been taken to make scholarly data available as Knowledge Graphs (KGs). These efforts to standardize these data and make them accessible have also led to many challenges such as exploration of scholarly articles, ambiguous authors, et… ▽ More

    Submitted 1 June, 2022; v1 submitted 24 January, 2022; originally announced January 2022.

  31. Identifying and correcting invalid citations due to DOI errors in Crossref data

    Authors: Alessia Cioffi, Sara Coppini, Arcangelo Massari, Arianna Moretti, Silvio Peroni, Cristian Santini, Nooshin Shahidzadeh Asadi

    Abstract: This work aims to identify classes of DOI mistakes by analysing the open bibliographic metadata available in Crossref, highlighting which publishers were responsible for such mistakes and how many of these incorrect DOIs could be corrected through automatic processes. By using a list of invalid cited DOIs gathered by OpenCitations while processing the OpenCitations Index of Crossref open DOI-to-DO… ▽ More

    Submitted 7 March, 2022; v1 submitted 22 November, 2021; originally announced November 2021.

    Journal ref: Scientometrics 127, 3593-3612 (2022)

  32. arXiv:2111.05223  [pdf

    cs.DL cs.IR

    A quantitative and qualitative open citation analysis of retracted articles in the humanities

    Authors: Ivan Heibi, Silvio Peroni

    Abstract: In this article, we show and discuss the results of a quantitative and qualitative analysis of open citations to retracted publications in the humanities domain. Our study was conducted by selecting retracted papers in the humanities domain and marking their main characteristics (e.g., retraction reason). Then, we gathered the citing entities and annotated their basic metadata (e.g., title, venue,… ▽ More

    Submitted 10 October, 2022; v1 submitted 9 November, 2021; originally announced November 2021.

  33. arXiv:2110.02111  [pdf

    cs.DL

    Open bibliographic data and the Italian National Scientific Qualification: measuring coverage of academic fields

    Authors: Federica Bologna, Angelo Di Iorio, Silvio Peroni, Francesco Poggi

    Abstract: The importance of open bibliographic repositories is widely accepted by the scientific community. For evaluation processes, however, there is still some skepticism: even if large repositories of open access articles and free publication indexes exist and are continuously growing, assessment procedures still rely on proprietary databases, mainly due to the richness of the data available in these pr… ▽ More

    Submitted 13 May, 2022; v1 submitted 5 October, 2021; originally announced October 2021.

  34. arXiv:2110.00307  [pdf, other

    cs.DL

    The case for the Humanities Citation Index (HuCI): a citation index by the humanities, for the humanities

    Authors: Giovanni Colavizza, Silvio Peroni, Matteo Romanello

    Abstract: Citation indexes are by now part of the research infrastructure in use by most scientists: a necessary tool in order to cope with the increasing amounts of scientific literature being published. Commercial citation indexes are designed for the sciences and have uneven coverage and unsatisfactory characteristics for humanities scholars, while no comprehensive citation index is published by a public… ▽ More

    Submitted 14 May, 2022; v1 submitted 1 October, 2021; originally announced October 2021.

  35. arXiv:2108.12190  [pdf

    cs.DL cs.SI

    A map of Digital Humanities research across bibliographic data sources

    Authors: Gianmarco Spinaci, Giovanni Colavizza, Silvio Peroni

    Abstract: Purpose. This study presents the results of an experiment we performed to measure the coverage of Digital Humanities (DH) publications in mainstream open and proprietary bibliographic data sources, by further highlighting the relations among DH and other disciplines. Methodology. We created a list of DH journals based on manual curation and bibliometric data. We used that list to identify DH publi… ▽ More

    Submitted 1 March, 2022; v1 submitted 27 August, 2021; originally announced August 2021.

  36. arXiv:2106.12320  [pdf, ps, other

    cs.DL cs.IR cs.LG

    BiblioDAP: The 1st Workshop on Bibliographic Data Analysis and Processing

    Authors: Zeyd Boukhers, Philipp Mayr, Silvio Peroni

    Abstract: Automatic processing of bibliographic data becomes very important in digital libraries, data science and machine learning due to its importance in keeping pace with the significant increase of published papers every year from one side and to the inherent challenges from the other side. This processing has several aspects including but not limited to I) Automatic extraction of references from PDF d… ▽ More

    Submitted 23 June, 2021; originally announced June 2021.

    Comments: This workshop will be held in conjunction with KDD' 2021

  37. arXiv:2106.05725  [pdf

    cs.DL cs.CY

    Academics evaluating academics: a methodology to inform the review process on top of open citations

    Authors: Federica Bologna, Angelo Di Iorio, Silvio Peroni, Francesco Poggi

    Abstract: In the past, several works have investigated ways for combining quantitative and qualitative methods in research assessment exercises. In this work, we aim at introducing a methodology to explore whether citation-based metrics, calculated only considering open bibliographic and citation data, can yield insights on how human peer-review of research assessment exercises is conducted. To understand i… ▽ More

    Submitted 10 June, 2021; originally announced June 2021.

    Comments: arXiv admin note: substantial text overlap with arXiv:2103.07942

  38. A protocol to gather, characterize and analyze incoming citations of retracted articles

    Authors: Ivan Heibi, Silvio Peroni

    Abstract: In this article, we present a methodology which takes as input a collection of retracted articles, gathers the entities citing them, characterizes such entities according to multiple dimensions (disciplines, year of publication, sentiment, etc.), and applies a quantitative and qualitative analysis on the collected values. The methodology is composed of four phases: (1) identifying, retrieving, and… ▽ More

    Submitted 3 June, 2021; originally announced June 2021.

  39. arXiv:2105.08599  [pdf

    cs.DL

    Can we assess research using open scientific knowledge graphs? A case study within the Italian National Scientific Qualification

    Authors: Federica Bologna, Angelo Di Iorio, Silvio Peroni, Francesco Poggi

    Abstract: The need for open scientific knowledge graphs is ever increasing. While there are large repositories of open access articles and free publication indexes, there are still few free knowledge graphs exposing citation networks, and often their coverage is partial. Consequently, most evaluation processes based on citation counts rely on commercial citation databases. Things are changing thanks to the… ▽ More

    Submitted 18 May, 2021; originally announced May 2021.

  40. Do open citations give insights on the qualitative peer-review evaluation in research assessments? An analysis of the Italian National Scientific Qualification

    Authors: Federica Bologna, Angelo Di Iorio, Silvio Peroni, Francesco Poggi

    Abstract: In the past, several works have investigated ways for combining quantitative and qualitative methods in research assessment exercises. Indeed, the Italian National Scientific Qualification (NSQ), i.e. the national assessment exercise which aims at deciding whether a scholar can apply to professorial academic positions as Associate Professor and Full Professor, adopts a quantitative and qualitative… ▽ More

    Submitted 23 October, 2022; v1 submitted 14 March, 2021; originally announced March 2021.

  41. arXiv:2012.11475  [pdf

    cs.DL

    A qualitative and quantitative analysis of open citations to retracted articles: the Wakefield et al.'s case

    Authors: Ivan Heibi, Silvio Peroni

    Abstract: In this article, we show the results of a quantitative and qualitative analysis of open citations on a popular and highly cited retracted paper: "Ileal-lymphoid-nodular hyperplasia, non-specific colitis, and pervasive developmental disorder in children" by Wakefield et al., published in 1998. The main purpose of our study is to understand the behavior of the publications citing retracted articles… ▽ More

    Submitted 24 May, 2021; v1 submitted 21 December, 2020; originally announced December 2020.

  42. arXiv:2011.13886  [pdf

    cs.DL

    MITAO: a tool for enabling scholars in the Humanities to use Topic Modelling in their studies

    Authors: Ivan Heibi, Silvio Peroni, Luca Pareschi, Paolo Ferri

    Abstract: Automatic text analysis methods, such as Topic Modelling, are gaining much attention in Humanities. However, scholars need to have extensive coding skills to use such methods appropriately. The need of having this technical expertise prevents the broad adoption of these methods in Humanities research. In this paper, to help scholars in the Humanities to use Topic Modelling having no or limited cod… ▽ More

    Submitted 27 November, 2020; originally announced November 2020.

  43. The Landscape of Ontology Reuse Approaches

    Authors: Valentina Anita Carriero, Marilena Daquino, Aldo Gangemi, Andrea Giovanni Nuzzolese, Silvio Peroni, Valentina Presutti, Francesca Tomasi

    Abstract: Ontology reuse aims to foster interoperability and facilitate knowledge reuse. Several approaches are typically evaluated by ontology engineers when bootstrapping a new project. However, current practices are often motivated by subjective, case-by-case decisions, which hamper the definition of a recommended behaviour. In this chapter we argue that to date there are no effective solutions for suppo… ▽ More

    Submitted 25 November, 2020; originally announced November 2020.

  44. Citing and referencing habits in Medicine and Social Sciences journals in 2019

    Authors: Erika Alves dos Santos, Silvio Peroni, Marcos Luiz Mucheroni

    Abstract: This article explores citing and referencing systems in Social Sciences and Medicine articles from different theoretical and practical perspectives, considering bibliographic references as a facet of descriptive representation. The analysis of citing and referencing elements (i.e. bibliographic references, mentions, quotations, and respective in-text reference pointers) identified citing and refer… ▽ More

    Submitted 20 January, 2021; v1 submitted 11 September, 2020; originally announced September 2020.

    Comments: Accepted for publication on 18 January 2021 in Journal of Documentation

  45. arXiv:2007.16079  [pdf

    cs.DB

    Creating RESTful APIs over SPARQL endpoints using RAMOSE

    Authors: Marilena Daquino, Ivan Heibi, Silvio Peroni, David Shotton

    Abstract: Semantic Web technologies are widely used for storing RDF data and making them available on the Web through SPARQL endpoints, queryable using the SPARQL query language. While the use of SPARQL endpoints is strongly supported by Semantic Web experts, it hinders broader use of RDF data by common Web users, engineers and developers unfamiliar with Semantic Web technologies, who normally rely on Web R… ▽ More

    Submitted 30 May, 2021; v1 submitted 31 July, 2020; originally announced July 2020.

  46. arXiv:2005.11981  [pdf, other

    cs.DL

    The OpenCitations Data Model

    Authors: Marilena Daquino, Silvio Peroni, David Shotton, Giovanni Colavizza, Behnam Ghavimi, Anne Lauscher, Philipp Mayr, Matteo Romanello, Philipp Zumstein

    Abstract: A variety of schemas and ontologies are currently used for the machine-readable description of bibliographic entities and citations. This diversity, and the reuse of the same ontology terms with different nuances, generates inconsistencies in data. Adoption of a single data model would facilitate data integration tasks regardless of the data supplier or context application. In this paper we presen… ▽ More

    Submitted 24 August, 2020; v1 submitted 25 May, 2020; originally announced May 2020.

    Comments: ISWC 2020 Conference proceedings

  47. OpenCitations, an infrastructure organization for open scholarship

    Authors: Silvio Peroni, David Shotton

    Abstract: OpenCitations is an infrastructure organization for open scholarship dedicated to the publication of open citation data as Linked Open Data using Semantic Web technologies, thereby providing a disruptive alternative to traditional proprietary citation indexes. Open citation data are valuable for bibliometric analysis, increasing the reproducibility of large-scale analyses by enabling publication o… ▽ More

    Submitted 9 December, 2019; v1 submitted 27 June, 2019; originally announced June 2019.

  48. Nine Million Book Items and Eleven Million Citations: A Study of Book-Based Scholarly Communication Using OpenCitations

    Authors: Yongjun Zhu, Erjia Yan, Silvio Peroni, Chao Che

    Abstract: Books have been widely used to share information and contribute to human knowledge. However, the quantitative use of books as a method of scholarly communication is relatively unexamined compared to journal articles and conference papers. This study uses the COCI dataset (a comprehensive open citation dataset provided by OpenCitations) to explore books' roles in scholarly communication. The COCI d… ▽ More

    Submitted 6 December, 2019; v1 submitted 14 June, 2019; originally announced June 2019.

  49. COCI, the OpenCitations Index of Crossref open DOI-to-DOI citations

    Authors: Ivan Heibi, Silvio Peroni, David Shotton

    Abstract: In this paper, we present COCI, the OpenCitations Index of Crossref open DOI-to-DOI citations (http://opencitations.net/index/coci). COCI is the first open citation index created by OpenCitations, in which we have applied the concept of citations as first-class data entities, and it contains more than 445 million DOI-to-DOI citation links derived from the data available in Crossref. These citation… ▽ More

    Submitted 26 July, 2019; v1 submitted 12 April, 2019; originally announced April 2019.

    Comments: Submitted to Scientometrics (https://link.springer.com/journal/11192)

  50. The practice of self-citations: a longitudinal study

    Authors: Silvio Peroni, Paolo Ciancarini, Aldo Gangemi, Andrea Giovanni Nuzzolese, Francesco Poggi, Valentina Presutti

    Abstract: In this article, we discuss the outcomes of an experiment where we analysed whether and to what extent the introduction, in 2012, of the new research assessment exercise in Italy (a.k.a. Italian Scientific Habilitation) affected self-citation behaviours in the Italian research community. The Italian Scientific Habilitation attests to the scientific maturity of researchers and in Italy, as in many… ▽ More

    Submitted 19 February, 2020; v1 submitted 14 March, 2019; originally announced March 2019.