Skip to main content

Showing 1–3 of 3 results for author: Shotande, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.11129  [pdf, ps, other

    cs.CL cs.AI

    Trustworthy AI for Medicine: Continuous Hallucination Detection and Elimination with CHECK

    Authors: Carlos Garcia-Fernandez, Luis Felipe, Monique Shotande, Muntasir Zitu, Aakash Tripathi, Ghulam Rasool, Issam El Naqa, Vivek Rudrapatna, Gilmer Valdes

    Abstract: Large language models (LLMs) show promise in healthcare, but hallucinations remain a major barrier to clinical use. We present CHECK, a continuous-learning framework that integrates structured clinical databases with a classifier grounded in information theory to detect both factual and reasoning-based hallucinations. Evaluated on 1500 questions from 100 pivotal clinical trials, CHECK reduced LLam… ▽ More

    Submitted 10 June, 2025; originally announced June 2025.

  2. arXiv:2504.02874  [pdf, other

    cs.CL

    TheBlueScrubs-v1, a comprehensive curated medical dataset derived from the internet

    Authors: Luis Felipe, Carlos Garcia, Issam El Naqa, Monique Shotande, Aakash Tripathi, Vivek Rudrapatna, Ghulam Rasool, Danielle Bitterman, Gilmer Valdes

    Abstract: The need for robust and diverse data sets to train clinical large language models (cLLMs) is critical given that currently available public repositories often prove too limited in size or scope for comprehensive medical use. While resources like PubMed provide foundational medical literature, they capture only a narrow range of formal publications and omit the broader medical discourse on the inte… ▽ More

    Submitted 1 April, 2025; originally announced April 2025.

    Comments: 22 pages, 8 figures, 10 tables

  3. arXiv:1907.00146  [pdf, other

    cs.DB cs.HC

    DataPop: Knowledge Base Population using Distributed Voice Enabled Devices

    Authors: Elena Montes, Monique Shotande, Daniel Helm, Christan Grant

    Abstract: Data scientists are constantly creating methods to efficiently and accurately populate big data sets for use in large-scale applications. Many recent efforts utilize crowd-sourcing and textual interfaces. In this paper, we propose a new method of curating data; namely, creating a multi-device Amazon Alexa Skill in the form of a research trivia game. Users experience a synchronized gaming experienc… ▽ More

    Submitted 29 June, 2019; originally announced July 2019.

    Comments: 7 pages, 2 references, unsubmitted