Skip to main content

Showing 1–1 of 1 results for author: Ronn, N

.
  1. arXiv:2401.14493  [pdf, other

    cs.CL cs.HC cs.LG

    K-QA: A Real-World Medical Q&A Benchmark

    Authors: Itay Manes, Naama Ronn, David Cohen, Ran Ilan Ber, Zehavi Horowitz-Kugler, Gabriel Stanovsky

    Abstract: Ensuring the accuracy of responses provided by large language models (LLMs) is crucial, particularly in clinical settings where incorrect information may directly impact patient health. To address this challenge, we construct K-QA, a dataset containing 1,212 patient questions originating from real-world conversations held on K Health (an AI-driven clinical platform). We employ a panel of in-house… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: The data and the evaluation script are available at https://github.com/Itaymanes/K-QA. Results and model comparisons can be viewed at https://huggingface.co/spaces/Itaykhealth/K-QA