Skip to main content

Showing 1–5 of 5 results for author: Pride, D

.
  1. arXiv:2503.08541  [pdf, other

    cs.IR

    LongEval at CLEF 2025: Longitudinal Evaluation of IR Model Performance

    Authors: Matteo Cancellieri, Alaa El-Ebshihy, Tobias Fink, Petra Galuščáková, Gabriela Gonzalez-Saez, Lorraine Goeuriot, David Iommi, Jüri Keller, Petr Knoth, Philippe Mulhem, Florina Piroi, David Pride, Philipp Schaer

    Abstract: This paper presents the third edition of the LongEval Lab, part of the CLEF 2025 conference, which continues to explore the challenges of temporal persistence in Information Retrieval (IR). The lab features two tasks designed to provide researchers with test data that reflect the evolving nature of user queries and document relevance over time. By evaluating how model performance degrades as test… ▽ More

    Submitted 11 March, 2025; originally announced March 2025.

    Comments: Accepted for ECIR 2025. To be published in Advances in Information Retrieval - 47th European Conference on Information Retrieval, ECIR 2025, Lucca, Italy, April 6-10, 2025, Proceedings

  2. arXiv:2501.10415  [pdf

    cs.DL cs.IR cs.LG cs.SE

    Making Software FAIR: A machine-assisted workflow for the research software lifecycle

    Authors: Petr Knoth, Laurent Romary, Patrice Lopez, Roberto Di Cosmo, Pavel Smrz, Tomasz Umerle, Melissa Harrison, Alain Monteil, Matteo Cancellieri, David Pride

    Abstract: A key issue hindering discoverability, attribution and reusability of open research software is that its existence often remains hidden within the manuscript of research papers. For these resources to become first-class bibliographic records, they first need to be identified and subsequently registered with persistent identifiers (PIDs) to be made FAIR (Findable, Accessible, Interoperable and Reus… ▽ More

    Submitted 8 January, 2025; originally announced January 2025.

    Comments: 5 pages

  3. arXiv:2307.04683  [pdf, other

    cs.CL cs.AI

    CORE-GPT: Combining Open Access research and large language models for credible, trustworthy question answering

    Authors: David Pride, Matteo Cancellieri, Petr Knoth

    Abstract: In this paper, we present CORE-GPT, a novel question-answering platform that combines GPT-based language models and more than 32 million full-text open access scientific articles from CORE. We first demonstrate that GPT3.5 and GPT4 cannot be relied upon to provide references or citations for generated text. We then introduce CORE-GPT which delivers evidence-based answers to questions, along with c… ▽ More

    Submitted 6 July, 2023; originally announced July 2023.

    Comments: 12 pages, accepted submission to TPDL2023

  4. arXiv:1805.08529  [pdf, other

    cs.DL

    Peer review and citation data in predicting university rankings, a large-scale analysis

    Authors: David Pride, Petr Knoth

    Abstract: Most Performance-based Research Funding Systems (PRFS) draw on peer review and bibliometric indicators, two different methodologies which are sometimes combined. A common argument against the use of indicators in such research evaluation exercises is their low correlation at the article level with peer review judgments. In this study, we analyse 191,000 papers from 154 higher education institutes… ▽ More

    Submitted 22 May, 2018; originally announced May 2018.

    Comments: 12 pages, 7 tables, 2 figures. Submitted to TPDL2018

  5. arXiv:1707.04207  [pdf, ps, other

    cs.DL

    Incidental or influential? - Challenges in automatically detecting citation importance using publication full texts

    Authors: David Pride, Petr Knoth

    Abstract: This work looks in depth at several studies that have attempted to automate the process of citation importance classification based on the publications full text. We analyse a range of features that have been previously used in this task. Our experimental results confirm that the number of in text references are highly predictive of influence. Contrary to the work of Valenzuela et al. we find abst… ▽ More

    Submitted 13 July, 2017; originally announced July 2017.