Skip to main content

Showing 1–6 of 6 results for author: Gururajan, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.04388  [pdf, ps, other

    cs.CL cs.AI

    The Aloe Family Recipe for Open and Specialized Healthcare LLMs

    Authors: Dario Garcia-Gasulla, Jordi Bayarri-Planas, Ashwin Kumar Gururajan, Enrique Lopez-Cuena, Adrian Tormos, Daniel Hinjos, Pablo Bernabeu-Perez, Anna Arias-Duart, Pablo Agustin Martin-Torres, Marta Gonzalez-Mallo, Sergio Alvarez-Napagao, Eduard Ayguadé-Parra, Ulises Cortés

    Abstract: Purpose: With advancements in Large Language Models (LLMs) for healthcare, the need arises for competitive open-source models to protect the public interest. This work contributes to the field of open medical LLMs by optimizing key stages of data preprocessing and training, while showing how to improve model safety (through DPO) and efficacy (through RAG). The evaluation methodology used, which in… ▽ More

    Submitted 28 May, 2025; v1 submitted 7 May, 2025; originally announced May 2025.

    Comments: Follow-up work from arXiv:2405.01886

  2. arXiv:2502.13603  [pdf, other

    cs.CL cs.AI cs.LG

    Efficient Safety Retrofitting Against Jailbreaking for LLMs

    Authors: Dario Garcia-Gasulla, Adrian Tormos, Anna Arias-Duart, Daniel Hinjos, Oscar Molina-Sedano, Ashwin Kumar Gururajan, Maria Eugenia Cardello

    Abstract: Direct Preference Optimization (DPO) is an efficient alignment technique that steers LLMs towards preferable outputs by training on preference data, bypassing the need for explicit reward models. Its simplicity enables easy adaptation to various domains and safety requirements. This paper examines DPO's effectiveness in model safety against jailbreaking attacks while minimizing data requirements a… ▽ More

    Submitted 25 February, 2025; v1 submitted 19 February, 2025; originally announced February 2025.

  3. arXiv:2502.06666  [pdf, other

    cs.CL cs.AI

    Automatic Evaluation of Healthcare LLMs Beyond Question-Answering

    Authors: Anna Arias-Duart, Pablo Agustin Martin-Torres, Daniel Hinjos, Pablo Bernabeu-Perez, Lucia Urcelay Ganzabal, Marta Gonzalez Mallo, Ashwin Kumar Gururajan, Enrique Lopez-Cuena, Sergio Alvarez-Napagao, Dario Garcia-Gasulla

    Abstract: Current Large Language Models (LLMs) benchmarks are often based on open-ended or close-ended QA evaluations, avoiding the requirement of human labor. Close-ended measurements evaluate the factuality of responses but lack expressiveness. Open-ended capture the model's capacity to produce discourse responses but are harder to assess for correctness. These two approaches are commonly used, either ind… ▽ More

    Submitted 10 February, 2025; originally announced February 2025.

  4. arXiv:2409.15127  [pdf, other

    cs.AI

    Pareto-Optimized Open-Source LLMs for Healthcare via Context Retrieval

    Authors: Jordi Bayarri-Planas, Ashwin Kumar Gururajan, Dario Garcia-Gasulla

    Abstract: This study leverages optimized context retrieval to enhance open-source Large Language Models (LLMs) for cost-effective, high performance healthcare AI. We demonstrate that this approach achieves state-of-the-art accuracy on medical question answering at a fraction of the cost of proprietary models, significantly improving the cost-accuracy Pareto frontier on the MedQA benchmark. Key contributions… ▽ More

    Submitted 3 April, 2025; v1 submitted 23 September, 2024; originally announced September 2024.

    Comments: 14 pages, 3 figures, 5 tables, Accepted for publication at the 21st International Conference on Artificial Intelligence Applications and Innovations (AIAI 2025)

    ACM Class: I.2.0; I.2.7

  5. arXiv:2405.01886  [pdf, other

    cs.CL cs.AI

    Aloe: A Family of Fine-tuned Open Healthcare LLMs

    Authors: Ashwin Kumar Gururajan, Enrique Lopez-Cuena, Jordi Bayarri-Planas, Adrian Tormos, Daniel Hinjos, Pablo Bernabeu-Perez, Anna Arias-Duart, Pablo Agustin Martin-Torres, Lucia Urcelay-Ganzabal, Marta Gonzalez-Mallo, Sergio Alvarez-Napagao, Eduard Ayguadé-Parra, Ulises Cortés Dario Garcia-Gasulla

    Abstract: As the capabilities of Large Language Models (LLMs) in healthcare and medicine continue to advance, there is a growing need for competitive open-source models that can safeguard public interest. With the increasing availability of highly competitive open base models, the impact of continued pre-training is increasingly uncertain. In this work, we explore the role of instruct tuning, model merging,… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

    Comments: Five appendix

  6. arXiv:2106.05256  [pdf, other

    cs.CR

    URLTran: Improving Phishing URL Detection Using Transformers

    Authors: Pranav Maneriker, Jack W. Stokes, Edir Garcia Lazo, Diana Carutasu, Farid Tajaddodianfar, Arun Gururajan

    Abstract: Browsers often include security features to detect phishing web pages. In the past, some browsers evaluated an unknown URL for inclusion in a list of known phishing pages. However, as the number of URLs and known phishing pages continued to increase at a rapid pace, browsers started to include one or more machine learning classifiers as part of their security services that aim to better protect en… ▽ More

    Submitted 27 August, 2021; v1 submitted 9 June, 2021; originally announced June 2021.