Skip to main content

Showing 1–2 of 2 results for author: Goff, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.13508  [pdf

    cs.CL cs.AI cs.CY

    It is Too Many Options: Pitfalls of Multiple-Choice Questions in Generative AI and Medical Education

    Authors: Shrutika Singh, Anton Alyakin, Daniel Alexander Alber, Jaden Stryker, Ai Phuong S Tong, Karl Sangwon, Nicolas Goff, Mathew de la Paz, Miguel Hernandez-Rovira, Ki Yun Park, Eric Claude Leuthardt, Eric Karl Oermann

    Abstract: The performance of Large Language Models (LLMs) on multiple-choice question (MCQ) benchmarks is frequently cited as proof of their medical capabilities. We hypothesized that LLM performance on medical MCQs may in part be illusory and driven by factors beyond medical content knowledge and reasoning capabilities. To assess this, we created a novel benchmark of free-response questions with paired MCQ… ▽ More

    Submitted 13 March, 2025; originally announced March 2025.

    Comments: 14 pages, 5 figures

  2. arXiv:2412.10982  [pdf, ps, other

    cs.AI

    MedG-KRP: Medical Graph Knowledge Representation Probing

    Authors: Gabriel R. Rosenbaum, Lavender Yao Jiang, Ivaxi Sheth, Jaden Stryker, Anton Alyakin, Daniel Alexander Alber, Nicolas K. Goff, Young Joon Fred Kwon, John Markert, Mustafa Nasir-Moin, Jan Moritz Niehues, Karl L. Sangwon, Eunice Yang, Eric Karl Oermann

    Abstract: Large language models (LLMs) have recently emerged as powerful tools, finding many medical applications. LLMs' ability to coalesce vast amounts of information from many sources to generate a response-a process similar to that of a human expert-has led many to see potential in deploying LLMs for clinical use. However, medicine is a setting where accurate reasoning is paramount. Many researchers are… ▽ More

    Submitted 16 December, 2024; v1 submitted 14 December, 2024; originally announced December 2024.

    Comments: Findings paper presented at Machine Learning for Health (ML4H) symposium 2024, December 15-16, 2024, Vancouver, Canada, 19 pages