Skip to main content

Showing 1–5 of 5 results for author: Low, C H

.
  1. arXiv:2506.02555  [pdf, other

    cs.CV

    SurgVLM: A Large Vision-Language Model and Systematic Evaluation Benchmark for Surgical Intelligence

    Authors: Zhitao Zeng, Zhu Zhuo, Xiaojun Jia, Erli Zhang, Junde Wu, Jiaan Zhang, Yuxuan Wang, Chang Han Low, Jian Jiang, Zilong Zheng, Xiaochun Cao, Yutong Ban, Qi Dou, Yang Liu, Yueming Jin

    Abstract: Foundation models have achieved transformative success across biomedical domains by enabling holistic understanding of multimodal data. However, their application in surgery remains underexplored. Surgical intelligence presents unique challenges - requiring surgical visual perception, temporal analysis, and reasoning. Existing general-purpose vision-language models fail to address these needs due… ▽ More

    Submitted 3 June, 2025; originally announced June 2025.

    Comments: 29 pages, 5 figures

    MSC Class: 68T45 ACM Class: I.2.10

  2. arXiv:2503.18968  [pdf, other

    cs.AI

    MedAgent-Pro: Towards Evidence-based Multi-modal Medical Diagnosis via Reasoning Agentic Workflow

    Authors: Ziyue Wang, Junde Wu, Linghan Cai, Chang Han Low, Xihong Yang, Qiaxuan Li, Yueming Jin

    Abstract: In modern medicine, clinical diagnosis relies on the comprehensive analysis of primarily textual and visual data, drawing on medical expertise to ensure systematic and rigorous reasoning. Recent advances in large Vision-Language Models (VLMs) and agent-based methods hold great potential for medical diagnosis, thanks to the ability to effectively integrate multi-modal patient data. However, they of… ▽ More

    Submitted 22 May, 2025; v1 submitted 21 March, 2025; originally announced March 2025.

  3. arXiv:2503.10265  [pdf, other

    cs.AI cs.RO

    SurgRAW: Multi-Agent Workflow with Chain-of-Thought Reasoning for Surgical Intelligence

    Authors: Chang Han Low, Ziyue Wang, Tianyi Zhang, Zhitao Zeng, Zhu Zhuo, Evangelos B. Mazomenos, Yueming Jin

    Abstract: Integration of Vision-Language Models (VLMs) in surgical intelligence is hindered by hallucinations, domain knowledge gaps, and limited understanding of task interdependencies within surgical scenes, undermining clinical reliability. While recent VLMs demonstrate strong general reasoning and thinking capabilities, they still lack the domain expertise and task-awareness required for precise surgica… ▽ More

    Submitted 13 March, 2025; originally announced March 2025.

  4. arXiv:2503.03152  [pdf, other

    eess.IV q-bio.QM

    UnPuzzle: A Unified Framework for Pathology Image Analysis

    Authors: Dankai Liao, Sicheng Chen, Nuwa Xi, Qiaochu Xue, Jieyu Li, Lingxuan Hou, Zeyu Liu, Chang Han Low, Yufeng Wu, Yiling Liu, Yanqin Jiang, Dandan Li, Shangqing Lyu

    Abstract: Pathology image analysis plays a pivotal role in medical diagnosis, with deep learning techniques significantly advancing diagnostic accuracy and research. While numerous studies have been conducted to address specific pathological tasks, the lack of standardization in pre-processing methods and model/database architectures complicates fair comparisons across different approaches. This highlights… ▽ More

    Submitted 28 March, 2025; v1 submitted 4 March, 2025; originally announced March 2025.

    Comments: 11 pages,2 figures

  5. arXiv:2402.16664  [pdf, other

    cs.IR

    LLM-Assisted Multi-Teacher Continual Learning for Visual Question Answering in Robotic Surgery

    Authors: Yuyang Du, Kexin Chen, Yue Zhan, Chang Han Low, Tao You, Mobarakol Islam, Ziyu Guo, Yueming Jin, Guangyong Chen, Pheng-Ann Heng

    Abstract: Visual question answering (VQA) is crucial for promoting surgical education. In practice, the needs of trainees are constantly evolving, such as learning more surgical types, adapting to different robots, and learning new surgical instruments and techniques for various surgeries. However, patient data privacy often restricts the availability of old data when updating the model, necessitating an ex… ▽ More

    Submitted 23 October, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

    Comments: This paper has been accapted by 2024 IEEE International Conference on Robotics and Automation (ICRA)