Skip to main content

Showing 1–2 of 2 results for author: Butgul, M

.
  1. arXiv:2503.20794  [pdf, other

    cs.CL cs.CR cs.IR cs.LG

    Can Zero-Shot Commercial APIs Deliver Regulatory-Grade Clinical Text DeIdentification?

    Authors: Veysel Kocaman, Muhammed Santas, Yigit Gul, Mehmet Butgul, David Talby

    Abstract: We evaluate the performance of four leading solutions for de-identification of unstructured medical text - Azure Health Data Services, AWS Comprehend Medical, OpenAI GPT-4o, and John Snow Labs - on a ground truth dataset of 48 clinical documents annotated by medical experts. The analysis, conducted at both entity-level and token-level, suggests that John Snow Labs' Medical Language Models solution… ▽ More

    Submitted 31 March, 2025; v1 submitted 21 March, 2025; originally announced March 2025.

    Comments: 14 pages, accepted at Text2Story Workshop at ECIR 2025

    ACM Class: H.3; F.2.2; I.2.7

  2. arXiv:2503.17425  [pdf, other

    cs.CL cs.IR cs.LG

    Beyond Negation Detection: Comprehensive Assertion Detection Models for Clinical NLP

    Authors: Veysel Kocaman, Yigit Gul, M. Aytug Kaya, Hasham Ul Haq, Mehmet Butgul, Cabir Celik, David Talby

    Abstract: Assertion status detection is a critical yet often overlooked component of clinical NLP, essential for accurately attributing extracted medical facts. Past studies have narrowly focused on negation detection, leading to underperforming commercial solutions such as AWS Medical Comprehend, Azure AI Text Analytics, and GPT-4o due to their limited domain adaptation. To address this gap, we developed s… ▽ More

    Submitted 21 March, 2025; originally announced March 2025.

    Comments: accepted at Text2Story Workshop at ECIR 2025

    MSC Class: H.3 ACM Class: H.3