Skip to main content

Showing 1–11 of 11 results for author: Rousseau, J F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2412.15271  [pdf, other

    cs.CL cs.IR

    A MapReduce Approach to Effectively Utilize Long Context Information in Retrieval Augmented Language Models

    Authors: Gongbo Zhang, Zihan Xu, Qiao Jin, Fangyi Chen, Yilu Fang, Yi Liu, Justin F. Rousseau, Ziyang Xu, Zhiyong Lu, Chunhua Weng, Yifan Peng

    Abstract: While holding great promise for improving and facilitating healthcare, large language models (LLMs) struggle to produce up-to-date responses on evolving topics due to outdated knowledge or hallucination. Retrieval-augmented generation (RAG) is a pivotal innovation that improves the accuracy and relevance of LLM responses by integrating LLMs with a search engine and external sources of knowledge. H… ▽ More

    Submitted 17 December, 2024; originally announced December 2024.

  2. arXiv:2401.08396  [pdf

    cs.CV cs.AI cs.CL

    Hidden flaws behind expert-level accuracy of multimodal GPT-4 vision in medicine

    Authors: Qiao Jin, Fangyuan Chen, Yiliang Zhou, Ziyang Xu, Justin M. Cheung, Robert Chen, Ronald M. Summers, Justin F. Rousseau, Peiyun Ni, Marc J Landsman, Sally L. Baxter, Subhi J. Al'Aref, Yijia Li, Alex Chen, Josef A. Brejt, Michael F. Chiang, Yifan Peng, Zhiyong Lu

    Abstract: Recent studies indicate that Generative Pre-trained Transformer 4 with Vision (GPT-4V) outperforms human physicians in medical challenge tasks. However, these evaluations primarily focused on the accuracy of multi-choice questions alone. Our study extends the current scope by conducting a comprehensive analysis of GPT-4V's rationales of image comprehension, recall of medical knowledge, and step-by… ▽ More

    Submitted 31 August, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

    Journal ref: npj Digital Medicine, 2024

  3. arXiv:2305.19339  [pdf, other

    cs.CL cs.AI

    Less Likely Brainstorming: Using Language Models to Generate Alternative Hypotheses

    Authors: Liyan Tang, Yifan Peng, Yanshan Wang, Ying Ding, Greg Durrett, Justin F. Rousseau

    Abstract: A human decision-maker benefits the most from an AI assistant that corrects for their biases. For problems such as generating interpretation of a radiology report given findings, a system predicting only highly likely outcomes may be less useful, where such outcomes are already obvious to the user. To alleviate biases in human decision-making, it is worth considering a broad differential diagnosis… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

    Comments: Accepted to ACL (Findings) 2023

  4. arXiv:2303.00517  [pdf

    cs.LG cs.AI cs.CY

    Analyzing Impact of Socio-Economic Factors on COVID-19 Mortality Prediction Using SHAP Value

    Authors: Redoan Rahman, Jooyeong Kang, Justin F Rousseau, Ying Ding

    Abstract: This paper applies multiple machine learning (ML) algorithms to a dataset of de-identified COVID-19 patients provided by the COVID-19 Research Database. The dataset consists of 20,878 COVID-positive patients, among which 9,177 patients died in the year 2020. This paper aims to understand and interpret the association of socio-economic characteristics of patients with their mortality instead of max… ▽ More

    Submitted 27 February, 2023; originally announced March 2023.

    Comments: 10 pages, 10 figures, American Medical Informatics Association(AMIA) Annual Conference 2022, Washington DC, USA, Nov 5-9, 2022

    Journal ref: AMIA 2022 Annual Symposium

  5. arXiv:2302.08605  [pdf

    cs.LG cs.AI cs.CY

    Using Explainable AI to Cross-Validate Socio-economic Disparities Among Covid-19 Patient Mortality

    Authors: Li Shi, Redoan Rahman, Esther Melamed, Jacek Gwizdka, Justin F. Rousseau, Ying Ding

    Abstract: This paper applies eXplainable Artificial Intelligence (XAI) methods to investigate the socioeconomic disparities in COVID patient mortality. An Extreme Gradient Boosting (XGBoost) prediction model is built based on a de-identified Austin area hospital dataset to predict the mortality of COVID-19 patients. We apply two XAI methods, Shapley Additive exPlanations (SHAP) and Locally Interpretable Mod… ▽ More

    Submitted 16 February, 2023; originally announced February 2023.

    Comments: AMIA 2023 Informatics Summit, March 13-16, Seattle, WA, USA. 10 pages

    Journal ref: AMIA 2023 Informatics Summit

  6. arXiv:2212.02675  [pdf, other

    cs.CV

    Attend Who is Weak: Pruning-assisted Medical Image Localization under Sophisticated and Implicit Imbalances

    Authors: Ajay Jaiswal, Tianlong Chen, Justin F. Rousseau, Yifan Peng, Ying Ding, Zhangyang Wang

    Abstract: Deep neural networks (DNNs) have rapidly become a \textit{de facto} choice for medical image understanding tasks. However, DNNs are notoriously fragile to the class imbalance in image classification. We further point out that such imbalance fragility can be amplified when it comes to more sophisticated tasks such as pathology localization, as imbalances in such problems can have highly complex and… ▽ More

    Submitted 5 December, 2022; originally announced December 2022.

    Comments: Accepted in WACV 2023

  7. arXiv:2210.08388  [pdf, other

    cs.CV cs.LG

    RoS-KD: A Robust Stochastic Knowledge Distillation Approach for Noisy Medical Imaging

    Authors: Ajay Jaiswal, Kumar Ashutosh, Justin F Rousseau, Yifan Peng, Zhangyang Wang, Ying Ding

    Abstract: AI-powered Medical Imaging has recently achieved enormous attention due to its ability to provide fast-paced healthcare diagnoses. However, it usually suffers from a lack of high-quality datasets due to high annotation cost, inter-observer variability, human annotator error, and errors in computer-generated labels. Deep learning models trained on noisy labelled datasets are sensitive to the noise… ▽ More

    Submitted 2 December, 2022; v1 submitted 15 October, 2022; originally announced October 2022.

    Comments: Accepted in ICDM 2022

  8. arXiv:2210.08122  [pdf, other

    cs.LG cs.SI

    Old can be Gold: Better Gradient Flow can Make Vanilla-GCNs Great Again

    Authors: Ajay Jaiswal, Peihao Wang, Tianlong Chen, Justin F. Rousseau, Ying Ding, Zhangyang Wang

    Abstract: Despite the enormous success of Graph Convolutional Networks (GCNs) in modeling graph-structured data, most of the current GCNs are shallow due to the notoriously challenging problems of over-smoothening and information squashing along with conventional difficulty caused by vanishing gradients and over-fitting. Previous works have been primarily focused on the study of over-smoothening and over-sq… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

    Comments: Advances in Neural Information Processing Systems (NeurIPS), 2022

  9. arXiv:2205.12854  [pdf, other

    cs.CL cs.AI

    Understanding Factual Errors in Summarization: Errors, Summarizers, Datasets, Error Detectors

    Authors: Liyan Tang, Tanya Goyal, Alexander R. Fabbri, Philippe Laban, Jiacheng Xu, Semih Yavuz, Wojciech Kryściński, Justin F. Rousseau, Greg Durrett

    Abstract: The propensity of abstractive summarization models to make factual errors has been studied extensively, including design of metrics to detect factual errors and annotation of errors in current systems' outputs. However, the ever-evolving nature of summarization systems, metrics, and annotated benchmarks makes factuality evaluation a moving target, and drawing clear comparisons among metrics has be… ▽ More

    Submitted 25 May, 2023; v1 submitted 25 May, 2022; originally announced May 2022.

    Comments: Accepted to ACL 2023

  10. arXiv:2110.14787  [pdf, other

    eess.IV cs.CV

    SCALP -- Supervised Contrastive Learning for Cardiopulmonary Disease Classification and Localization in Chest X-rays using Patient Metadata

    Authors: Ajay Jaiswal, Tianhao Li, Cyprian Zander, Yan Han, Justin F. Rousseau, Yifan Peng, Ying Ding

    Abstract: Computer-aided diagnosis plays a salient role in more accessible and accurate cardiopulmonary diseases classification and localization on chest radiography. Millions of people get affected and die due to these diseases without an accurate and timely diagnosis. Recently proposed contrastive learning heavily relies on data augmentation, especially positive data augmentation. However, generating clin… ▽ More

    Submitted 27 October, 2021; originally announced October 2021.

  11. arXiv:2005.04308  [pdf

    cs.DL

    Building a PubMed knowledge graph

    Authors: Jian Xu, Sunkyu Kim, Min Song, Minbyul Jeong, Donghyeon Kim, Jaewoo Kang, Justin F. Rousseau, Xin Li, Weijia Xu, Vetle I. Torvik, Yi Bu, Chongyan Chen, Islam Akef Ebeid, Daifeng Li, Ying Ding

    Abstract: PubMed is an essential resource for the medical domain, but useful concepts are either difficult to extract or are ambiguated, which has significantly hindered knowledge discovery. To address this issue, we constructed a PubMed knowledge graph (PKG) by extracting bio-entities from 29 million PubMed abstracts, disambiguating author names, integrating funding data through the National Institutes of… ▽ More

    Submitted 15 May, 2020; v1 submitted 8 May, 2020; originally announced May 2020.

    Comments: 19 pages, 5 figures, 14 tables