Skip to main content

Showing 1–6 of 6 results for author: Antaki, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.11186  [pdf

    cs.CL cs.AI

    Benchmarking Next-Generation Reasoning-Focused Large Language Models in Ophthalmology: A Head-to-Head Evaluation on 5,888 Items

    Authors: Minjie Zou, Sahana Srinivasan, Thaddaeus Wai Soon Lo, Ke Zou, Gabriel Dawei Yang, Xuguang Ai, Hyunjae Kim, Maxwell Singer, Fares Antaki, Kelvin Li, Robert Chang, Marcus Tan, David Ziyou Chen, Dianbo Liu, Qingyu Chen, Yih Chung Tham

    Abstract: Recent advances in reasoning-focused large language models (LLMs) mark a shift from general LLMs toward models designed for complex decision-making, a crucial aspect in medicine. However, their performance in specialized domains like ophthalmology remains underexplored. This study comprehensively evaluated and compared the accuracy and reasoning capabilities of four newly developed reasoning-focus… ▽ More

    Submitted 15 April, 2025; originally announced April 2025.

    Comments: 83 pages, 6 figures, 3 tables, 9 supplementary figures, 7 supplementary tables

  2. arXiv:2502.08073  [pdf

    cs.CY

    Large language models perpetuate bias in palliative care: development and analysis of the Palliative Care Adversarial Dataset (PCAD)

    Authors: Naomi Akhras, Fares Antaki, Fannie Mottet, Olivia Nguyen, Shyam Sawhney, Sabrina Bajwah, Joanna M Davies

    Abstract: Bias and inequity in palliative care disproportionately affect marginalised groups. Large language models (LLMs), such as GPT-4o, hold potential to enhance care but risk perpetuating biases present in their training data. This study aimed to systematically evaluate whether GPT-4o propagates biases in palliative care responses using adversarially designed datasets. In July 2024, GPT-4o was probed u… ▽ More

    Submitted 11 February, 2025; originally announced February 2025.

    Comments: The complete PCAD datasets are available on Figshare: dx.doi.org/10.6084/m9.figshare.28396016

  3. arXiv:2501.13949  [pdf

    cs.CL cs.AI

    Can OpenAI o1 Reason Well in Ophthalmology? A 6,990-Question Head-to-Head Evaluation Study

    Authors: Sahana Srinivasan, Xuguang Ai, Minjie Zou, Ke Zou, Hyunjae Kim, Thaddaeus Wai Soon Lo, Krithi Pushpanathan, Yiming Kong, Anran Li, Maxwell Singer, Kai Jin, Fares Antaki, David Ziyou Chen, Dianbo Liu, Ron A. Adelman, Qingyu Chen, Yih Chung Tham

    Abstract: Question: What is the performance and reasoning ability of OpenAI o1 compared to other large language models in addressing ophthalmology-specific questions? Findings: This study evaluated OpenAI o1 and five LLMs using 6,990 ophthalmological questions from MedMCQA. O1 achieved the highest accuracy (0.88) and macro-F1 score but ranked third in reasoning capabilities based on text-generation metric… ▽ More

    Submitted 19 January, 2025; originally announced January 2025.

    Comments: 44 pages

  4. arXiv:2401.10883  [pdf

    cs.HC

    RetinaVR: Democratizing Vitreoretinal Surgery Training with a Portable and Affordable Virtual Reality Simulator in the Metaverse

    Authors: Fares Antaki, Cédryk Doucet, Daniel Milad, Charles-Édouard Giguère, Benoit Ozell, Karim Hammamji

    Abstract: We developed and validated RetinaVR, an affordable and immersive virtual reality simulator for vitreoretinal surgery training, using the Meta Quest 2 VR headset. We focused on four core fundamental skills: core vitrectomy, peripheral shaving, membrane peeling, and endolaser application. The validation study involved 10 novice ophthalmology residents and 10 expert vitreoretinal surgeons. We demonst… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

  5. arXiv:2109.09463  [pdf, other

    eess.IV cs.CV cs.LG

    Predicting Visual Improvement after Macular Hole Surgery: a Cautionary Tale on Deep Learning with Very Limited Data

    Authors: M. Godbout, A. Lachance, F. Antaki, A. Dirani, A. Durand

    Abstract: We investigate the potential of machine learning models for the prediction of visual improvement after macular hole surgery from preoperative data (retinal images and clinical features). Collecting our own data for the task, we end up with only 121 total samples, putting our work in the very limited data regime. We explore a variety of deep learning methods for limited data to train deep computer… ▽ More

    Submitted 14 November, 2021; v1 submitted 20 September, 2021; originally announced September 2021.

    Comments: Machine Learning for Health (ML4H) - Extended Abstract

  6. Limitations of ROC on Imbalanced Data: Evaluation of LVAD Mortality Risk Scores

    Authors: Faezeh Movahedi, Rema Padman, James F. Antaki

    Abstract: Objective: This study illustrates the ambiguity of ROC in evaluating two classifiers of 90-day LVAD mortality. This paper also introduces the precision recall curve (PRC) as a supplemental metric that is more representative of LVAD classifiers performance in predicting the minority class. Background: In the LVAD domain, the receiver operating characteristic (ROC) is a commonly applied metric of… ▽ More

    Submitted 29 October, 2020; originally announced October 2020.

    Comments: Submitted to JACC Heart Failure

    Journal ref: The Journal of Thoracic and Cardiovascular Surgery. 2021 Jul 30