Skip to main content

Showing 1–4 of 4 results for author: Le-Khac, P H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.07731  [pdf, ps, other

    cs.AI

    NeurIPS 2025 E2LM Competition : Early Training Evaluation of Language Models

    Authors: Mouadh Yagoubi, Yasser Dahou, Billel Mokeddem, Younes Belkada, Phuc H. Le-Khac, Basma El Amel Boussaha, Reda Alami, Jingwei Zuo, Damiano Marsili, Mugariya Farooq, Mounia Lalmas, Georgia Gkioxari, Patrick Gallinari, Philip Torr, Hakim Hacid

    Abstract: Existing benchmarks have proven effective for assessing the performance of fully trained large language models. However, we find striking differences in the early training stages of small models, where benchmarks often fail to provide meaningful or discriminative signals. To explore how these differences arise, this competition tackles the challenge of designing scientific knowledge evaluation tas… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

  2. arXiv:2212.10273  [pdf, other

    cs.LG cs.AI math.NA

    Managing Large Dataset Gaps in Urban Air Quality Prediction: DCU-Insight-AQ at MediaEval 2022

    Authors: Dinh Viet Cuong, Phuc H. Le-Khac, Adam Stapleton, Elke Eichlemann, Mark Roantree, Alan F. Smeaton

    Abstract: Calculating an Air Quality Index (AQI) typically uses data streams from air quality sensors deployed at fixed locations and the calculation is a real time process. If one or a number of sensors are broken or offline, then the real time AQI value cannot be computed. Estimating AQI values for some point in the future is a predictive process and uses historical AQI values to train and build models. I… ▽ More

    Submitted 19 December, 2022; originally announced December 2022.

    Comments: 5 pages, 1 Figure, 1 Table

  3. arXiv:2012.15641  [pdf, other

    cs.MM cs.AI cs.CV

    Investigating Memorability of Dynamic Media

    Authors: Phuc H. Le-Khac, Ayush K. Rai, Graham Healy, Alan F. Smeaton, Noel E. O'Connor

    Abstract: The Predicting Media Memorability task in MediaEval'20 has some challenging aspects compared to previous years. In this paper we identify the high-dynamic content in videos and dataset of limited size as the core challenges for the task, we propose directions to overcome some of these challenges and we present our initial result in these directions.

    Submitted 31 December, 2020; originally announced December 2020.

    Comments: 3 pages, 1 figure. 1 table

    Journal ref: MediaEval Multimedia Benchmark Workshop Working Notes, 14-15 December 2020

  4. Contrastive Representation Learning: A Framework and Review

    Authors: Phuc H. Le-Khac, Graham Healy, Alan F. Smeaton

    Abstract: Contrastive Learning has recently received interest due to its success in self-supervised representation learning in the computer vision domain. However, the origins of Contrastive Learning date as far back as the 1990s and its development has spanned across many fields and domains including Metric Learning and natural language processing. In this paper we provide a comprehensive literature review… ▽ More

    Submitted 27 October, 2020; v1 submitted 10 October, 2020; originally announced October 2020.

    Comments: 28 pages, 9 figures, update with the accepted version in IEEE Access