Skip to main content

Showing 1–1 of 1 results for author: Krolik, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2409.01941  [pdf, other

    cs.CL cs.LG

    Towards Leveraging Large Language Models for Automated Medical Q&A Evaluation

    Authors: Jack Krolik, Herprit Mahal, Feroz Ahmad, Gaurav Trivedi, Bahador Saket

    Abstract: This paper explores the potential of using Large Language Models (LLMs) to automate the evaluation of responses in medical Question and Answer (Q\&A) systems, a crucial form of Natural Language Processing. Traditionally, human evaluation has been indispensable for assessing the quality of these responses. However, manual evaluation by medical professionals is time-consuming and costly. Our study e… ▽ More

    Submitted 3 September, 2024; originally announced September 2024.

    Comments: 10 pages, 3 figures, 3 tables

    ACM Class: I.2.7; J.3