Skip to main content

Showing 1–5 of 5 results for author: Mulhem, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.08541  [pdf, other

    cs.IR

    LongEval at CLEF 2025: Longitudinal Evaluation of IR Model Performance

    Authors: Matteo Cancellieri, Alaa El-Ebshihy, Tobias Fink, Petra Galuščáková, Gabriela Gonzalez-Saez, Lorraine Goeuriot, David Iommi, Jüri Keller, Petr Knoth, Philippe Mulhem, Florina Piroi, David Pride, Philipp Schaer

    Abstract: This paper presents the third edition of the LongEval Lab, part of the CLEF 2025 conference, which continues to explore the challenges of temporal persistence in Information Retrieval (IR). The lab features two tasks designed to provide researchers with test data that reflect the evolving nature of user queries and document relevance over time. By evaluating how model performance degrades as test… ▽ More

    Submitted 11 March, 2025; originally announced March 2025.

    Comments: Accepted for ECIR 2025. To be published in Advances in Information Retrieval - 47th European Conference on Information Retrieval, ECIR 2025, Lucca, Italy, April 6-10, 2025, Proceedings

  2. arXiv:2303.03229  [pdf, other

    cs.IR

    LongEval-Retrieval: French-English Dynamic Test Collection for Continuous Web Search Evaluation

    Authors: Petra Galuščáková Romain Deveaud, Gabriela Gonzalez-Saez, Philippe Mulhem, Lorraine Goeuriot, Florina Piroi, Martin Popel

    Abstract: LongEval-Retrieval is a Web document retrieval benchmark that focuses on continuous retrieval evaluation. This test collection is intended to be used to study the temporal persistence of Information Retrieval systems and will be used as the test collection in the Longitudinal Evaluation of Model Performance Track (LongEval) at CLEF 2023. This benchmark simulates an evolving information system envi… ▽ More

    Submitted 27 April, 2023; v1 submitted 6 March, 2023; originally announced March 2023.

  3. arXiv:2004.11759  [pdf, other

    cs.IR

    Learning Term Discrimination

    Authors: Jibril Frej, Phillipe Mulhem, Didier Schwab, Jean-Pierre Chevallet

    Abstract: Document indexing is a key component for efficient information retrieval (IR). After preprocessing steps such as stemming and stop-word removal, document indexes usually store term-frequencies (tf). Along with tf (that only reflects the importance of a term in a document), traditional IR models use term discrimination values (TDVs) such as inverse document frequency (idf) to favor discriminative t… ▽ More

    Submitted 28 April, 2020; v1 submitted 24 April, 2020; originally announced April 2020.

    Comments: Accepted to ACM SIGIR 2020

  4. arXiv:1911.07317  [pdf, ps, other

    cs.IR

    Quels corpus d'entraînement pour l'expansion de requêtes par plongement de mots : application à la recherche de microblogs culturels

    Authors: Philippe Mulhem, Lorraine Goeuriot, Massih-Reza Amini, Nayanika Dogra

    Abstract: We describe here an experimental framework and the results obtained on microblogs retrieval. We study the contribution one popular approach, i.e., words embeddings, and investigate the impact of the training set on the learned embedding. We focus on query expansion for the retrieval of tweets on the CLEF CMC 2016 corpus. Our results show that using embeddings trained on a corpus in the same domain… ▽ More

    Submitted 17 November, 2019; originally announced November 2019.

    Comments: 23 pages. in French

  5. arXiv:1606.06991  [pdf, other

    cs.IR cs.CL

    Toward Word Embedding for Personalized Information Retrieval

    Authors: Nawal Ould-Amer, Philippe Mulhem, Mathias Gery

    Abstract: This paper presents preliminary works on using Word Embedding (word2vec) for query expansion in the context of Personalized Information Retrieval. Traditionally, word embeddings are learned on a general corpus, like Wikipedia. In this work we try to personalize the word embeddings learning, by achieving the learning on the user's profile. The word embeddings are then in the same context than the u… ▽ More

    Submitted 22 June, 2016; originally announced June 2016.