Skip to main content

Showing 1–5 of 5 results for author: Shing, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.00448  [pdf, ps, other

    cs.CL

    Fact-Controlled Diagnosis of Hallucinations in Medical Text Summarization

    Authors: Suhas BN, Han-Chin Shing, Lei Xu, Mitch Strong, Jon Burnsky, Jessica Ofor, Jordan R. Mason, Susan Chen, Sundararajan Srinivasan, Chaitanya Shivade, Jack Moriarty, Joseph Paul Cohen

    Abstract: Hallucinations in large language models (LLMs) during summarization of patient-clinician dialogues pose significant risks to patient care and clinical decision-making. However, the phenomenon remains understudied in the clinical domain, with uncertainty surrounding the applicability of general-domain hallucination detectors. The rarity and randomness of hallucinations further complicate their inve… ▽ More

    Submitted 31 May, 2025; originally announced June 2025.

    Comments: https://github.com/amazon-science/acibench-hallucination-annotations

  2. arXiv:2208.07444  [pdf, other

    cs.LG cs.CL

    Entity Anchored ICD Coding

    Authors: Jay DeYoung, Han-Chin Shing, Luyang Kong, Christopher Winestock, Chaitanya Shivade

    Abstract: Medical coding is a complex task, requiring assignment of a subset of over 72,000 ICD codes to a patient's notes. Modern natural language processing approaches to these tasks have been challenged by the length of the input and size of the output space. We limit our model inputs to a small window around medical entities found in our documents. From those local contexts, we build contextualized repr… ▽ More

    Submitted 15 August, 2022; originally announced August 2022.

    Comments: Accepted to American Medical Informatics Association (AMIA) 2022 Annual Symposium

  3. arXiv:2204.10290  [pdf, other

    cs.CL

    Learning to Revise References for Faithful Summarization

    Authors: Griffin Adams, Han-Chin Shing, Qing Sun, Christopher Winestock, Kathleen McKeown, NoƩmie Elhadad

    Abstract: In real-world scenarios with naturally occurring datasets, reference summaries are noisy and may contain information that cannot be inferred from the source text. On large news corpora, removing low quality samples has been shown to reduce model hallucinations. Yet, for smaller, and/or noisier corpora, filtering is detrimental to performance. To improve reference quality while retaining all data,… ▽ More

    Submitted 11 October, 2022; v1 submitted 13 April, 2022; originally announced April 2022.

    Comments: Findings of EMNLP 2022

  4. arXiv:2104.13498  [pdf, other

    cs.CL cs.LG

    Towards Clinical Encounter Summarization: Learning to Compose Discharge Summaries from Prior Notes

    Authors: Han-Chin Shing, Chaitanya Shivade, Nima Pourdamghani, Feng Nan, Philip Resnik, Douglas Oard, Parminder Bhatia

    Abstract: The records of a clinical encounter can be extensive and complex, thus placing a premium on tools that can extract and summarize relevant information. This paper introduces the task of generating discharge summaries for a clinical encounter. Summaries in this setting need to be faithful, traceable, and scale to multiple long documents, motivating the use of extract-then-abstract summarization casc… ▽ More

    Submitted 27 April, 2021; originally announced April 2021.

  5. arXiv:1911.06848  [pdf, other

    cs.CL cs.LG

    Assigning Medical Codes at the Encounter Level by Paying Attention to Documents

    Authors: Han-Chin Shing, Guoli Wang, Philip Resnik

    Abstract: The vast majority of research in computer assisted medical coding focuses on coding at the document level, but a substantial proportion of medical coding in the real world involves coding at the level of clinical encounters, each of which is typically represented by a potentially large set of documents. We introduce encounter-level document attention networks, which use hierarchical attention to e… ▽ More

    Submitted 15 November, 2019; originally announced November 2019.

    Comments: Machine Learning for Health (ML4H) at NeurIPS 2019 - Extended Abstract