Skip to main content

Showing 1–6 of 6 results for author: Kondziolka, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.23477  [pdf

    cs.CL

    Evaluating the performance and fragility of large language models on the self-assessment for neurological surgeons

    Authors: Krithik Vishwanath, Anton Alyakin, Mrigayu Ghosh, Jin Vivian Lee, Daniel Alexander Alber, Karl L. Sangwon, Douglas Kondziolka, Eric Karl Oermann

    Abstract: The Congress of Neurological Surgeons Self-Assessment for Neurological Surgeons (CNS-SANS) questions are widely used by neurosurgical residents to prepare for written board examinations. Recently, these questions have also served as benchmarks for evaluating large language models' (LLMs) neurosurgical knowledge. This study aims to assess the performance of state-of-the-art LLMs on neurosurgery boa… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

    Comments: 22 pages, 3 main figures, 3 supplemental figures

  2. arXiv:2504.01201  [pdf, other

    cs.CL cs.AI cs.HC

    Medical large language models are easily distracted

    Authors: Krithik Vishwanath, Anton Alyakin, Daniel Alexander Alber, Jin Vivian Lee, Douglas Kondziolka, Eric Karl Oermann

    Abstract: Large language models (LLMs) have the potential to transform medicine, but real-world clinical scenarios contain extraneous information that can hinder performance. The rise of assistive technologies like ambient dictation, which automatically generates draft notes from live patient encounters, has the potential to introduce additional noise making it crucial to assess the ability of LLM's to filt… ▽ More

    Submitted 1 April, 2025; originally announced April 2025.

    Comments: 20 pages, 2 main figures, 6 extended figures

  3. arXiv:2502.19546  [pdf

    cs.AI cs.CL cs.HC

    Repurposing the scientific literature with vision-language models

    Authors: Anton Alyakin, Jaden Stryker, Daniel Alexander Alber, Karl L. Sangwon, Jin Vivian Lee, Brandon Duderstadt, Akshay Save, David Kurland, Spencer Frome, Shrutika Singh, Jeff Zhang, Eunice Yang, Ki Yun Park, Cordelia Orillac, Aly A. Valliani, Sean Neifert, Albert Liu, Aneek Patel, Christopher Livia, Darryl Lau, Ilya Laufer, Peter A. Rozman, Eveline Teresa Hidalgo, Howard Riina, Rui Feng , et al. (7 additional authors not shown)

    Abstract: Leading vision-language models (VLMs) are trained on general Internet content, overlooking scientific journals' rich, domain-specific knowledge. Training on specialty-specific literature could yield high-performance, task-specific tools, enabling generative AI to match generalist models in specialty publishing, educational, and clinical tasks. We created NeuroPubs, a multimodal dataset of 23,000 N… ▽ More

    Submitted 27 April, 2025; v1 submitted 26 February, 2025; originally announced February 2025.

  4. arXiv:2111.00340  [pdf

    cs.LG cs.CY

    Identifying and mitigating bias in algorithms used to manage patients in a pandemic

    Authors: Yifan Li, Garrett Yoon, Mustafa Nasir-Moin, David Rosenberg, Sean Neifert, Douglas Kondziolka, Eric Karl Oermann

    Abstract: Numerous COVID-19 clinical decision support systems have been developed. However many of these systems do not have the merit for validity due to methodological shortcomings including algorithmic bias. Methods Logistic regression models were created to predict COVID-19 mortality, ventilator status and inpatient status using a real-world dataset consisting of four hospitals in New York City and anal… ▽ More

    Submitted 30 October, 2021; originally announced November 2021.

    Comments: 4 pages, 1 tables

  5. arXiv:2110.11872  [pdf

    cs.LG

    Patient level simulation and reinforcement learning to discover novel strategies for treating ovarian cancer

    Authors: Brian Murphy, Mustafa Nasir-Moin, Grace von Oiste, Viola Chen, Howard A Riina, Douglas Kondziolka, Eric K Oermann

    Abstract: The prognosis for patients with epithelial ovarian cancer remains dismal despite improvements in survival for other cancers. Treatment involves multiple lines of chemotherapy and becomes increasingly heterogeneous after first-line therapy. Reinforcement learning with real-world outcomes data has the potential to identify novel treatment strategies to improve overall survival. We design a reinforce… ▽ More

    Submitted 22 October, 2021; originally announced October 2021.

  6. arXiv:2109.08227  [pdf, other

    eess.IV cs.CV cs.LG

    Stereo Video Reconstruction Without Explicit Depth Maps for Endoscopic Surgery

    Authors: Annika Brundyn, Jesse Swanson, Kyunghyun Cho, Doug Kondziolka, Eric Oermann

    Abstract: We introduce the task of stereo video reconstruction or, equivalently, 2D-to-3D video conversion for minimally invasive surgical video. We design and implement a series of end-to-end U-Net-based solutions for this task by varying the input (single frame vs. multiple consecutive frames), loss function (MSE, MAE, or perceptual losses), and network architecture. We evaluate these solutions by surveyi… ▽ More

    Submitted 16 September, 2021; originally announced September 2021.

    Comments: 9 pages, 5 figures