Skip to main content

Showing 1–5 of 5 results for author: Bernabeu-Perez, P

.
  1. arXiv:2505.23575  [pdf, other

    cs.AI cs.LG

    CoT Red-Handed: Stress Testing Chain-of-Thought Monitoring

    Authors: Benjamin Arnav, Pablo Bernabeu-Pérez, Nathan Helm-Burger, Tim Kostolansky, Hannes Whittingham, Mary Phuong

    Abstract: As AI models are deployed with increasing autonomy, it is important to ensure they do not take harmful actions unnoticed. As a potential mitigation, we investigate Chain-of-Thought (CoT) monitoring, wherein a weaker trusted monitor model continuously oversees the intermediate reasoning steps of a more powerful but untrusted model. We compare CoT monitoring to action-only monitoring, where only fin… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

  2. arXiv:2505.04388  [pdf, ps, other

    cs.CL cs.AI

    The Aloe Family Recipe for Open and Specialized Healthcare LLMs

    Authors: Dario Garcia-Gasulla, Jordi Bayarri-Planas, Ashwin Kumar Gururajan, Enrique Lopez-Cuena, Adrian Tormos, Daniel Hinjos, Pablo Bernabeu-Perez, Anna Arias-Duart, Pablo Agustin Martin-Torres, Marta Gonzalez-Mallo, Sergio Alvarez-Napagao, Eduard Ayguadé-Parra, Ulises Cortés

    Abstract: Purpose: With advancements in Large Language Models (LLMs) for healthcare, the need arises for competitive open-source models to protect the public interest. This work contributes to the field of open medical LLMs by optimizing key stages of data preprocessing and training, while showing how to improve model safety (through DPO) and efficacy (through RAG). The evaluation methodology used, which in… ▽ More

    Submitted 28 May, 2025; v1 submitted 7 May, 2025; originally announced May 2025.

    Comments: Follow-up work from arXiv:2405.01886

  3. arXiv:2502.06666  [pdf, other

    cs.CL cs.AI

    Automatic Evaluation of Healthcare LLMs Beyond Question-Answering

    Authors: Anna Arias-Duart, Pablo Agustin Martin-Torres, Daniel Hinjos, Pablo Bernabeu-Perez, Lucia Urcelay Ganzabal, Marta Gonzalez Mallo, Ashwin Kumar Gururajan, Enrique Lopez-Cuena, Sergio Alvarez-Napagao, Dario Garcia-Gasulla

    Abstract: Current Large Language Models (LLMs) benchmarks are often based on open-ended or close-ended QA evaluations, avoiding the requirement of human labor. Close-ended measurements evaluate the factuality of responses but lack expressiveness. Open-ended capture the model's capacity to produce discourse responses but are harder to assess for correctness. These two approaches are commonly used, either ind… ▽ More

    Submitted 10 February, 2025; originally announced February 2025.

  4. arXiv:2409.14128  [pdf, other

    cs.CV cs.AI cs.LG

    Present and Future Generalization of Synthetic Image Detectors

    Authors: Pablo Bernabeu-Perez, Enrique Lopez-Cuena, Dario Garcia-Gasulla

    Abstract: The continued release of increasingly realistic image generation models creates a demand for synthetic image detectors. To build effective detectors we must first understand how factors like data source diversity, training methodologies and image alterations affect their generalization capabilities. This work conducts a systematic analysis and uses its insights to develop practical guidelines for… ▽ More

    Submitted 26 November, 2024; v1 submitted 21 September, 2024; originally announced September 2024.

    Comments: 21 pages, 12 figures

  5. arXiv:2405.01886  [pdf, other

    cs.CL cs.AI

    Aloe: A Family of Fine-tuned Open Healthcare LLMs

    Authors: Ashwin Kumar Gururajan, Enrique Lopez-Cuena, Jordi Bayarri-Planas, Adrian Tormos, Daniel Hinjos, Pablo Bernabeu-Perez, Anna Arias-Duart, Pablo Agustin Martin-Torres, Lucia Urcelay-Ganzabal, Marta Gonzalez-Mallo, Sergio Alvarez-Napagao, Eduard Ayguadé-Parra, Ulises Cortés Dario Garcia-Gasulla

    Abstract: As the capabilities of Large Language Models (LLMs) in healthcare and medicine continue to advance, there is a growing need for competitive open-source models that can safeguard public interest. With the increasing availability of highly competitive open base models, the impact of continued pre-training is increasingly uncertain. In this work, we explore the role of instruct tuning, model merging,… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

    Comments: Five appendix