Skip to main content

Showing 1–13 of 13 results for author: Kass-Hout, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.18503  [pdf, ps, other

    cs.CV

    Focus on What Matters: Enhancing Medical Vision-Language Models with Automatic Attention Alignment Tuning

    Authors: Aofei Chang, Le Huang, Alex James Boyd, Parminder Bhatia, Taha Kass-Hout, Cao Xiao, Fenglong Ma

    Abstract: Medical Large Vision-Language Models (Med-LVLMs) often exhibit suboptimal attention distribution on visual inputs, leading to hallucinated or inaccurate outputs. Existing mitigation methods primarily rely on inference-time interventions, which are limited in attention adaptation or require additional supervision. To address this, we propose A$^3$Tune, a novel fine-tuning framework for Automatic At… ▽ More

    Submitted 24 May, 2025; originally announced May 2025.

    Comments: Accepted to ACL2025 (main)

  2. arXiv:2505.17100  [pdf, ps, other

    cs.CL

    Any Large Language Model Can Be a Reliable Judge: Debiasing with a Reasoning-based Bias Detector

    Authors: Haoyan Yang, Runxue Bao, Cao Xiao, Jun Ma, Parminder Bhatia, Shangqian Gao, Taha Kass-Hout

    Abstract: LLM-as-a-Judge has emerged as a promising tool for automatically evaluating generated outputs, but its reliability is often undermined by potential biases in judgment. Existing efforts to mitigate these biases face key limitations: in-context learning-based methods fail to address rooted biases due to the evaluator's limited capacity for self-reflection, whereas fine-tuning is not applicable to al… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

  3. arXiv:2503.04639  [pdf, other

    cs.CV cs.LG

    Enhancing SAM with Efficient Prompting and Preference Optimization for Semi-supervised Medical Image Segmentation

    Authors: Aishik Konwer, Zhijian Yang, Erhan Bas, Cao Xiao, Prateek Prasanna, Parminder Bhatia, Taha Kass-Hout

    Abstract: Foundational models such as the Segment Anything Model (SAM) are gaining traction in medical imaging segmentation, supporting multiple downstream tasks. However, such models are supervised in nature, still relying on large annotated datasets or prompts supplied by experts. Conventional techniques such as active learning to alleviate such limitations are limited in scope and still necessitate conti… ▽ More

    Submitted 6 March, 2025; originally announced March 2025.

    Comments: Accepted to CVPR 2025

  4. arXiv:2503.02157  [pdf, other

    cs.CV cs.AI

    MedHEval: Benchmarking Hallucinations and Mitigation Strategies in Medical Large Vision-Language Models

    Authors: Aofei Chang, Le Huang, Parminder Bhatia, Taha Kass-Hout, Fenglong Ma, Cao Xiao

    Abstract: Large Vision Language Models (LVLMs) are becoming increasingly important in the medical domain, yet Medical LVLMs (Med-LVLMs) frequently generate hallucinations due to limited expertise and the complexity of medical applications. Existing benchmarks fail to effectively evaluate hallucinations based on their underlying causes and lack assessments of mitigation strategies. To address this gap, we in… ▽ More

    Submitted 3 March, 2025; originally announced March 2025.

    Comments: Preprint, under review

  5. arXiv:2412.19634  [pdf, other

    stat.ML cs.LG

    Deep Linear Hawkes Processes

    Authors: Yuxin Chang, Alex Boyd, Cao Xiao, Taha Kass-Hout, Parminder Bhatia, Padhraic Smyth, Andrew Warrington

    Abstract: Marked temporal point processes (MTPPs) are used to model sequences of different types of events with irregular arrival times, with broad applications ranging from healthcare and social networks to finance. We address shortcomings in existing point process models by drawing connections between modern deep state-space models (SSMs) and linear Hawkes processes (LHPs), culminating in an MTPP that we… ▽ More

    Submitted 27 December, 2024; originally announced December 2024.

  6. arXiv:2410.23605  [pdf, other

    cs.CL

    Dynamic Uncertainty Ranking: Enhancing Retrieval-Augmented In-Context Learning for Long-Tail Knowledge in LLMs

    Authors: Shuyang Yu, Runxue Bao, Parminder Bhatia, Taha Kass-Hout, Jiayu Zhou, Cao Xiao

    Abstract: Large language models (LLMs) can learn vast amounts of knowledge from diverse domains during pre-training. However, long-tail knowledge from specialized domains is often scarce and underrepresented, rarely appearing in the models' memorization. Prior work has shown that in-context learning (ICL) with retriever augmentation can help LLMs better capture long-tail knowledge, reducing their reliance o… ▽ More

    Submitted 7 February, 2025; v1 submitted 30 October, 2024; originally announced October 2024.

    Comments: Accepted by NAACL 2025

  7. arXiv:2410.12831  [pdf, ps, other

    eess.IV cs.AI cs.CV

    Segment as You Wish -- Free-Form Language-Based Segmentation for Medical Images

    Authors: Longchao Da, Rui Wang, Xiaojian Xu, Parminder Bhatia, Taha Kass-Hout, Hua Wei, Cao Xiao

    Abstract: Medical imaging is crucial for diagnosing a patient's health condition, and accurate segmentation of these images is essential for isolating regions of interest to ensure precise diagnosis and treatment planning. Existing methods primarily rely on bounding boxes or point-based prompts, while few have explored text-related prompts, despite clinicians often describing their observations and instruct… ▽ More

    Submitted 29 June, 2025; v1 submitted 2 October, 2024; originally announced October 2024.

    Comments: 19 pages, 9 as main content. The paper was accepted to KDD2025

    MSC Class: 68T45; 68U10; 92C55 ACM Class: I.2.7; I.4.9; H.3.3; I.2.6

  8. arXiv:2410.04585  [pdf, other

    cs.CL

    Reasoning-Enhanced Healthcare Predictions with Knowledge Graph Community Retrieval

    Authors: Pengcheng Jiang, Cao Xiao, Minhao Jiang, Parminder Bhatia, Taha Kass-Hout, Jimeng Sun, Jiawei Han

    Abstract: Large language models (LLMs) have demonstrated significant potential in clinical decision support. Yet LLMs still suffer from hallucinations and lack fine-grained contextual medical knowledge, limiting their high-stake healthcare applications such as clinical diagnosis. Traditional retrieval-augmented generation (RAG) methods attempt to address these limitations but frequently retrieve sparse or i… ▽ More

    Submitted 20 April, 2025; v1 submitted 6 October, 2024; originally announced October 2024.

    Comments: ICLR 2025 Camera-Ready

  9. arXiv:2310.18642  [pdf

    cs.CV cs.AI

    One-shot Localization and Segmentation of Medical Images with Foundation Models

    Authors: Deepa Anand, Gurunath Reddy M, Vanika Singhal, Dattesh D. Shanbhag, Shriram KS, Uday Patil, Chitresh Bhushan, Kavitha Manickam, Dawei Gui, Rakesh Mullick, Avinash Gopal, Parminder Bhatia, Taha Kass-Hout

    Abstract: Recent advances in Vision Transformers (ViT) and Stable Diffusion (SD) models with their ability to capture rich semantic features of the image have been used for image correspondence tasks on natural images. In this paper, we examine the ability of a variety of pre-trained ViT (DINO, DINOv2, SAM, CLIP) and SD models, trained exclusively on natural images, for solving the correspondence problems o… ▽ More

    Submitted 28 October, 2023; originally announced October 2023.

    Comments: Accepted at NeurIPS 2023 R0-FoMo Workshop

  10. arXiv:2306.01631  [pdf, other

    cs.LG cs.AI q-bio.QM

    Bi-level Contrastive Learning for Knowledge-Enhanced Molecule Representations

    Authors: Pengcheng Jiang, Cao Xiao, Tianfan Fu, Parminder Bhatia, Taha Kass-Hout, Jimeng Sun, Jiawei Han

    Abstract: Molecular representation learning is vital for various downstream applications, including the analysis and prediction of molecular properties and side effects. While Graph Neural Networks (GNNs) have been a popular framework for modeling molecular data, they often struggle to capture the full complexity of molecular representations. In this paper, we introduce a novel method called GODE, which acc… ▽ More

    Submitted 16 February, 2025; v1 submitted 2 June, 2023; originally announced June 2023.

    Comments: AAAI 2025

  11. arXiv:2107.11094  [pdf, other

    cs.CL

    Improving Early Sepsis Prediction with Multi Modal Learning

    Authors: Fred Qin, Vivek Madan, Ujjwal Ratan, Zohar Karnin, Vishaal Kapoor, Parminder Bhatia, Taha Kass-Hout

    Abstract: Sepsis is a life-threatening disease with high morbidity, mortality and healthcare costs. The early prediction and administration of antibiotics and intravenous fluids is considered crucial for the treatment of sepsis and can save potentially millions of lives and billions in health care costs. Professional clinical care practitioners have proposed clinical criterion which aid in early detection o… ▽ More

    Submitted 23 July, 2021; originally announced July 2021.

  12. arXiv:2007.09186  [pdf, other

    cs.IR

    AWS CORD-19 Search: A Neural Search Engine for COVID-19 Literature

    Authors: Parminder Bhatia, Lan Liu, Kristjan Arumae, Nima Pourdamghani, Suyog Deshpande, Ben Snively, Mona Mona, Colby Wise, George Price, Shyam Ramaswamy, Xiaofei Ma, Ramesh Nallapati, Zhiheng Huang, Bing Xiang, Taha Kass-Hout

    Abstract: Coronavirus disease (COVID-19) has been declared as a pandemic by WHO with thousands of cases being reported each day. Numerous scientific articles are being published on the disease raising the need for a service which can organize, and query them in a reliable fashion. To support this cause we present AWS CORD-19 Search (ACS), a public, COVID-19 specific, neural search engine that is powered by… ▽ More

    Submitted 7 October, 2020; v1 submitted 17 July, 2020; originally announced July 2020.

  13. arXiv:1811.12276  [pdf, other

    cs.CL cs.AI cs.LG

    Improving Hospital Mortality Prediction with Medical Named Entities and Multimodal Learning

    Authors: Mengqi Jin, Mohammad Taha Bahadori, Aaron Colak, Parminder Bhatia, Busra Celikkaya, Ram Bhakta, Selvan Senthivel, Mohammed Khalilia, Daniel Navarro, Borui Zhang, Tiberiu Doman, Arun Ravi, Matthieu Liger, Taha Kass-hout

    Abstract: Clinical text provides essential information to estimate the acuity of a patient during hospital stays in addition to structured clinical data. In this study, we explore how clinical text can complement a clinical predictive learning task. We leverage an internal medical natural language processing service to perform named entity extraction and negation detection on clinical notes and compose sele… ▽ More

    Submitted 3 December, 2018; v1 submitted 29 November, 2018; originally announced November 2018.

    Comments: Machine Learning for Health (ML4H) Workshop at NeurIPS 2018 arXiv:1811.07216