Skip to main content

Showing 1–25 of 25 results for author: Verjans, J

.
  1. arXiv:2505.15123  [pdf, ps, other

    cs.CV cs.AI

    Seeing the Trees for the Forest: Rethinking Weakly-Supervised Medical Visual Grounding

    Authors: Ta Duc Huy, Duy Anh Huynh, Yutong Xie, Yuankai Qi, Qi Chen, Phi Le Nguyen, Sen Kim Tran, Son Lam Phung, Anton van den Hengel, Zhibin Liao, Minh-Son To, Johan W. Verjans, Vu Minh Hieu Phan

    Abstract: Visual grounding (VG) is the capability to identify the specific regions in an image associated with a particular text description. In medical imaging, VG enhances interpretability by highlighting relevant pathological features corresponding to textual descriptions, improving model transparency and trustworthiness for wider adoption of deep learning models in clinical practice. Current models stru… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

    Comments: Under Review

  2. arXiv:2505.00744  [pdf, other

    cs.CV

    Localizing Before Answering: A Hallucination Evaluation Benchmark for Grounded Medical Multimodal LLMs

    Authors: Dung Nguyen, Minh Khoi Ho, Huy Ta, Thanh Tam Nguyen, Qi Chen, Kumar Rav, Quy Duong Dang, Satwik Ramchandre, Son Lam Phung, Zhibin Liao, Minh-Son To, Johan Verjans, Phi Le Nguyen, Vu Minh Hieu Phan

    Abstract: Medical Large Multi-modal Models (LMMs) have demonstrated remarkable capabilities in medical data interpretation. However, these models frequently generate hallucinations contradicting source evidence, particularly due to inadequate localization reasoning. This work reveals a critical limitation in current medical LMMs: instead of analyzing relevant pathological regions, they often rely on linguis… ▽ More

    Submitted 21 May, 2025; v1 submitted 30 April, 2025; originally announced May 2025.

    Comments: Accepted at Joint Conference on Artificial Intelligence (IJCAI) 2025

  3. arXiv:2503.06873  [pdf, other

    cs.CV cs.AI cs.LG

    Interactive Medical Image Analysis with Concept-based Similarity Reasoning

    Authors: Ta Duc Huy, Sen Kim Tran, Phan Nguyen, Nguyen Hoang Tran, Tran Bao Sam, Anton van den Hengel, Zhibin Liao, Johan W. Verjans, Minh-Son To, Vu Minh Hieu Phan

    Abstract: The ability to interpret and intervene model decisions is important for the adoption of computer-aided diagnosis methods in clinical workflows. Recent concept-based methods link the model predictions with interpretable concepts and modify their activation scores to interact with the model. However, these concepts are at the image level, which hinders the model from pinpointing the exact patches th… ▽ More

    Submitted 11 March, 2025; v1 submitted 9 March, 2025; originally announced March 2025.

    Comments: Accepted CVPR2025

    Journal ref: CVPR 2025

  4. arXiv:2411.12195  [pdf, other

    cs.CV

    A Survey of Medical Vision-and-Language Applications and Their Techniques

    Authors: Qi Chen, Ruoshan Zhao, Sinuo Wang, Vu Minh Hieu Phan, Anton van den Hengel, Johan Verjans, Zhibin Liao, Minh-Son To, Yong Xia, Jian Chen, Yutong Xie, Qi Wu

    Abstract: Medical vision-and-language models (MVLMs) have attracted substantial interest due to their capability to offer a natural language interface for interpreting complex medical data. Their applications are versatile and have the potential to improve diagnostic accuracy and decision-making for individual patients while also contributing to enhanced public health monitoring, disease surveillance, and p… ▽ More

    Submitted 18 November, 2024; originally announced November 2024.

  5. arXiv:2408.02001  [pdf, other

    cs.CV

    AdaCBM: An Adaptive Concept Bottleneck Model for Explainable and Accurate Diagnosis

    Authors: Townim F. Chowdhury, Vu Minh Hieu Phan, Kewen Liao, Minh-Son To, Yutong Xie, Anton van den Hengel, Johan W. Verjans, Zhibin Liao

    Abstract: The integration of vision-language models such as CLIP and Concept Bottleneck Models (CBMs) offers a promising approach to explaining deep neural network (DNN) decisions using concepts understandable by humans, addressing the black-box concern of DNNs. While CLIP provides both explainability and zero-shot classification capability, its pre-training on generic image and text data may limit its clas… ▽ More

    Submitted 4 August, 2024; originally announced August 2024.

    Comments: Accepted at MICCAI 2024, the 27th International Conference on Medical Image Computing and Computer Assisted Intervention

  6. arXiv:2407.14825  [pdf

    physics.optics physics.bio-ph physics.med-ph

    3D-printed axicon enables extended depth-of-focus intravascular optical coherence tomography

    Authors: Pavel Ruchka, Alok Kushwaha, Jessica A. Marathe, Lei Xiang, Rouyan Chen, Rodney Kirk, Joanne T. M. Tan, Christina A. Bursill, Johan Verjans, Simon Thiele, Robert Fitridge, Robert A. McLaughlin, Peter J. Psaltis, Harald Giessen, Jiawen Li

    Abstract: A fundamental challenge in endoscopy is how to fabricate a small fiber-optic probe that can achieve comparable function to probes with large, complicated optics (e.g., high resolution and extended depth of focus). To achieve high resolution over an extended depth of focus (DOF), the application of needle-like beams has been proposed. However, existing methods using miniaturized needle beam designs… ▽ More

    Submitted 20 July, 2024; originally announced July 2024.

  7. arXiv:2406.18967  [pdf, other

    cs.CV

    Structural Attention: Rethinking Transformer for Unpaired Medical Image Synthesis

    Authors: Vu Minh Hieu Phan, Yutong Xie, Bowen Zhang, Yuankai Qi, Zhibin Liao, Antonios Perperidis, Son Lam Phung, Johan W. Verjans, Minh-Son To

    Abstract: Unpaired medical image synthesis aims to provide complementary information for an accurate clinical diagnostics, and address challenges in obtaining aligned multi-modal medical scans. Transformer-based models excel in imaging translation tasks thanks to their ability to capture long-range dependencies. Although effective in supervised training settings, their performance falters in unpaired image… ▽ More

    Submitted 28 August, 2024; v1 submitted 27 June, 2024; originally announced June 2024.

    Comments: MICCAI version before camera ready

  8. arXiv:2404.02388  [pdf, other

    cs.CV

    CAPE: CAM as a Probabilistic Ensemble for Enhanced DNN Interpretation

    Authors: Townim Faisal Chowdhury, Kewen Liao, Vu Minh Hieu Phan, Minh-Son To, Yutong Xie, Kevin Hung, David Ross, Anton van den Hengel, Johan W. Verjans, Zhibin Liao

    Abstract: Deep Neural Networks (DNNs) are widely used for visual classification tasks, but their complex computation process and black-box nature hinder decision transparency and interpretability. Class activation maps (CAMs) and recent variants provide ways to visually explain the DNN decision-making process by displaying 'attention' heatmaps of the DNNs. Nevertheless, the CAM explanation only offers relat… ▽ More

    Submitted 4 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

  9. arXiv:2403.07636  [pdf, other

    cs.CV

    Decomposing Disease Descriptions for Enhanced Pathology Detection: A Multi-Aspect Vision-Language Pre-training Framework

    Authors: Vu Minh Hieu Phan, Yutong Xie, Yuankai Qi, Lingqiao Liu, Liyang Liu, Bowen Zhang, Zhibin Liao, Qi Wu, Minh-Son To, Johan W. Verjans

    Abstract: Medical vision language pre-training (VLP) has emerged as a frontier of research, enabling zero-shot pathological recognition by comparing the query image with the textual descriptions for each disease. Due to the complex semantics of biomedical texts, current methods struggle to align medical images with key pathological findings in unstructured reports. This leads to the misalignment with the ta… ▽ More

    Submitted 31 March, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

    Comments: Accepted at CVPR2024. Pre-print before final camera-ready version

    Journal ref: CVPR2024

  10. arXiv:2311.06956  [pdf, other

    cs.CV

    SegReg: Segmenting OARs by Registering MR Images and CT Annotations

    Authors: Zeyu Zhang, Xuyin Qi, Bowen Zhang, Biao Wu, Hien Le, Bora Jeong, Zhibin Liao, Yunxiang Liu, Johan Verjans, Minh-Son To, Richard Hartley

    Abstract: Organ at risk (OAR) segmentation is a critical process in radiotherapy treatment planning such as head and neck tumors. Nevertheless, in clinical practice, radiation oncologists predominantly perform OAR segmentations manually on CT scans. This manual process is highly time-consuming and expensive, limiting the number of patients who can receive timely radiotherapy. Additionally, CT scans offer lo… ▽ More

    Submitted 1 March, 2024; v1 submitted 12 November, 2023; originally announced November 2023.

    Comments: Accepted to ISBI 2024

  11. arXiv:2307.16143  [pdf, other

    eess.IV cs.CV

    Structure-Preserving Synthesis: MaskGAN for Unpaired MR-CT Translation

    Authors: Minh Hieu Phan, Zhibin Liao, Johan W. Verjans, Minh-Son To

    Abstract: Medical image synthesis is a challenging task due to the scarcity of paired data. Several methods have applied CycleGAN to leverage unpaired data, but they often generate inaccurate mappings that shift the anatomy. This problem is further exacerbated when the images from the source and target modalities are heavily misaligned. Recently, current methods have aimed to address this issue by incorpora… ▽ More

    Submitted 31 July, 2023; v1 submitted 30 July, 2023; originally announced July 2023.

    Comments: Accepted to MICCAI 2023

    Journal ref: MICCAI 2023

  12. arXiv:2303.01099  [pdf, other

    cs.CV cs.AI

    Multi-Head Multi-Loss Model Calibration

    Authors: Adrian Galdran, Johan Verjans, Gustavo Carneiro, Miguel A. González Ballester

    Abstract: Delivering meaningful uncertainty estimates is essential for a successful deployment of machine learning models in the clinical practice. A central aspect of uncertainty quantification is the ability of a model to return predictions that are well-aligned with the actual probability of the model being correct, also known as model calibration. Although many methods have been proposed to improve cali… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.

    Comments: Under review

  13. arXiv:2203.12121  [pdf, other

    cs.CV

    Contrastive Transformer-based Multiple Instance Learning for Weakly Supervised Polyp Frame Detection

    Authors: Yu Tian, Guansong Pang, Fengbei Liu, Yuyuan Liu, Chong Wang, Yuanhong Chen, Johan W Verjans, Gustavo Carneiro

    Abstract: Current polyp detection methods from colonoscopy videos use exclusively normal (i.e., healthy) training images, which i) ignore the importance of temporal information in consecutive video frames, and ii) lack knowledge about the polyps. Consequently, they often have high detection errors, especially on challenging polyp cases (e.g., small, flat, or partially visible polyps). In this work, we formu… ▽ More

    Submitted 18 May, 2022; v1 submitted 22 March, 2022; originally announced March 2022.

    Comments: MICCAI 2022 Early Accept

  14. arXiv:2203.11725  [pdf, other

    eess.IV cs.CV

    Unsupervised Anomaly Detection in Medical Images with a Memory-augmented Multi-level Cross-attentional Masked Autoencoder

    Authors: Yu Tian, Guansong Pang, Yuyuan Liu, Chong Wang, Yuanhong Chen, Fengbei Liu, Rajvinder Singh, Johan W Verjans, Mengyu Wang, Gustavo Carneiro

    Abstract: Unsupervised anomaly detection (UAD) aims to find anomalous images by optimising a detector using a training set that contains only normal images. UAD approaches can be based on reconstruction methods, self-supervised approaches, and Imagenet pre-trained models. Reconstruction methods, which detect anomalies from image reconstruction errors, are advantageous because they do not rely on the design… ▽ More

    Submitted 21 August, 2023; v1 submitted 22 March, 2022; originally announced March 2022.

    Comments: Accepted to MICCAI MLMI2023

  15. CNN Attention Guidance for Improved Orthopedics Radiographic Fracture Classification

    Authors: Zhibin Liao, Kewen Liao, Haifeng Shen, Marouska F. van Boxel, Jasper Prijs, Ruurd L. Jaarsma, Job N. Doornberg, Anton van den Hengel, Johan W. Verjans

    Abstract: Convolutional neural networks (CNNs) have gained significant popularity in orthopedic imaging in recent years due to their ability to solve fracture classification problems. A common criticism of CNNs is their opaque learning and reasoning process, making it difficult to trust machine diagnosis and the subsequent adoption of such algorithms in clinical setting. This is especially true when the CNN… ▽ More

    Submitted 20 March, 2022; originally announced March 2022.

    Comments: 12 pages, Published in IEEE Journal of Biomedical and Health Informatics

  16. Mutual information neural estimation for unsupervised multi-modal registration of brain images

    Authors: Gerard Snaauw, Michele Sasdelli, Gabriel Maicas, Stephan Lau, Johan Verjans, Mark Jenkinson, Gustavo Carneiro

    Abstract: Many applications in image-guided surgery and therapy require fast and reliable non-linear, multi-modal image registration. Recently proposed unsupervised deep learning-based registration methods have demonstrated superior performance compared to iterative methods in just a fraction of the time. Most of the learning-based methods have focused on mono-modal image registration. The extension to mult… ▽ More

    Submitted 6 October, 2022; v1 submitted 25 January, 2022; originally announced January 2022.

    Comments: 4 pages, 4 figures, 2022 44th Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), oral presentation

    Journal ref: 2022 44th Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), 2022, pp. 3510-3513

  17. arXiv:2109.01303  [pdf, other

    eess.IV cs.CV

    Self-supervised Pseudo Multi-class Pre-training for Unsupervised Anomaly Detection and Segmentation in Medical Images

    Authors: Yu Tian, Fengbei Liu, Guansong Pang, Yuanhong Chen, Yuyuan Liu, Johan W. Verjans, Rajvinder Singh, Gustavo Carneiro

    Abstract: Unsupervised anomaly detection (UAD) methods are trained with normal (or healthy) images only, but during testing, they are able to classify normal and abnormal (or disease) images. UAD is an important medical image analysis (MIA) method to be applied in disease screening problems because the training sets available for those problems usually contain only normal images. However, the exclusive reli… ▽ More

    Submitted 14 August, 2023; v1 submitted 3 September, 2021; originally announced September 2021.

    Comments: Accepted to Medical Image Analysis

  18. arXiv:2103.03423  [pdf, other

    cs.CV

    Constrained Contrastive Distribution Learning for Unsupervised Anomaly Detection and Localisation in Medical Images

    Authors: Yu Tian, Guansong Pang, Fengbei Liu, Yuanhong chen, Seon Ho Shin, Johan W. Verjans, Rajvinder Singh, Gustavo Carneiro

    Abstract: Unsupervised anomaly detection (UAD) learns one-class classifiers exclusively with normal (i.e., healthy) images to detect any abnormal (i.e., unhealthy) samples that do not conform to the expected normal patterns. UAD has two main advantages over its fully supervised counterpart. Firstly, it is able to directly leverage large datasets available from health screening programs that contain mostly n… ▽ More

    Submitted 30 June, 2021; v1 submitted 4 March, 2021; originally announced March 2021.

    Comments: Accepted at MICCAI 2021

  19. arXiv:2101.10030  [pdf, other

    cs.CV

    Weakly-supervised Video Anomaly Detection with Robust Temporal Feature Magnitude Learning

    Authors: Yu Tian, Guansong Pang, Yuanhong Chen, Rajvinder Singh, Johan W. Verjans, Gustavo Carneiro

    Abstract: Anomaly detection with weakly supervised video-level labels is typically formulated as a multiple instance learning (MIL) problem, in which we aim to identify snippets containing abnormal events, with each video represented as a bag of video snippets. Although current methods show effective detection performance, their recognition of the positive instances, i.e., rare abnormal snippets in the abno… ▽ More

    Submitted 5 August, 2021; v1 submitted 25 January, 2021; originally announced January 2021.

    Comments: Accepted to ICCV 2021

  20. arXiv:2101.03285  [pdf, other

    cs.CV cs.LG

    Detecting, Localising and Classifying Polyps from Colonoscopy Videos using Deep Learning

    Authors: Yu Tian, Leonardo Zorron Cheng Tao Pu, Yuyuan Liu, Gabriel Maicas, Johan W. Verjans, Alastair D. Burt, Seon Ho Shin, Rajvinder Singh, Gustavo Carneiro

    Abstract: In this paper, we propose and analyse a system that can automatically detect, localise and classify polyps from colonoscopy videos. The detection of frames with polyps is formulated as a few-shot anomaly classification problem, where the training set is highly imbalanced with the large majority of frames consisting of normal images and a small minority comprising frames with polyps. Colonoscopy vi… ▽ More

    Submitted 8 January, 2021; originally announced January 2021.

    Comments: Preprint to submit to IEEE journals

  21. arXiv:2008.02699  [pdf, other

    cs.CV cs.LG eess.IV

    Pairwise Relation Learning for Semi-supervised Gland Segmentation

    Authors: Yutong Xie, Jianpeng Zhang, Zhibin Liao, Chunhua Shen, Johan Verjans, Yong Xia

    Abstract: Accurate and automated gland segmentation on histology tissue images is an essential but challenging task in the computer-aided diagnosis of adenocarcinoma. Despite their prevalence, deep learning models always require a myriad number of densely annotated training images, which are difficult to obtain due to extensive labor and associated expert costs related to histology image annotations. In thi… ▽ More

    Submitted 6 August, 2020; originally announced August 2020.

    Comments: Accepted by MICCAI2020

  22. arXiv:2006.14811  [pdf, other

    cs.CV

    Few-Shot Anomaly Detection for Polyp Frames from Colonoscopy

    Authors: Yu Tian, Gabriel Maicas, Leonardo Zorron Cheng Tao Pu, Rajvinder Singh, Johan W. Verjans, Gustavo Carneiro

    Abstract: Anomaly detection methods generally target the learning of a normal image distribution (i.e., inliers showing healthy cases) and during testing, samples relatively far from the learned distribution are classified as anomalies (i.e., outliers showing disease cases). These approaches tend to be sensitive to outliers that lie relatively close to inliers (e.g., a colonoscopy image with a small polyp).… ▽ More

    Submitted 26 June, 2020; originally announced June 2020.

    Comments: Accept at MICCAI 2020

  23. arXiv:2003.12338  [pdf, other

    eess.IV cs.CV

    Viral Pneumonia Screening on Chest X-ray Images Using Confidence-Aware Anomaly Detection

    Authors: Jianpeng Zhang, Yutong Xie, Guansong Pang, Zhibin Liao, Johan Verjans, Wenxin Li, Zongji Sun, Jian He, Yi Li, Chunhua Shen, Yong Xia

    Abstract: Cluster of viral pneumonia occurrences during a short period of time may be a harbinger of an outbreak or pandemic, like SARS, MERS, and recent COVID-19. Rapid and accurate detection of viral pneumonia using chest X-ray can be significantly useful in large-scale screening and epidemic prevention, particularly when other chest imaging modalities are less available. Viral pneumonia often have divers… ▽ More

    Submitted 1 December, 2020; v1 submitted 27 March, 2020; originally announced March 2020.

    Comments: Accepted to IEEE Trans. Medical Imaging. 12 pages

  24. arXiv:1910.10345  [pdf, other

    eess.IV cs.CV

    Unsupervised Dual Adversarial Learning for Anomaly Detection in Colonoscopy Video Frames

    Authors: Yuyuan Liu, Yu Tian, Gabriel Maicas, Leonardo Z. C. T. Pu, Rajvinder Singh, Johan W. Verjans, Gustavo Carneiro

    Abstract: The automatic detection of frames containing polyps from a colonoscopy video sequence is an important first step for a fully automated colonoscopy analysis tool. Typically, such detection system is built using a large annotated data set of frames with and without polyps, which is expensive to be obtained. In this paper, we introduce a new system that detects frames containing polyps as anomalies f… ▽ More

    Submitted 6 February, 2021; v1 submitted 23 October, 2019; originally announced October 2019.

    Comments: Accepted by ISBI 2020

  25. arXiv:1810.10117  [pdf, other

    cs.CV

    End-to-End Diagnosis and Segmentation Learning from Cardiac Magnetic Resonance Imaging

    Authors: Gerard Snaauw, Dong Gong, Gabriel Maicas, Anton van den Hengel, Wiro J. Niessen, Johan Verjans, Gustavo Carneiro

    Abstract: Cardiac magnetic resonance (CMR) is used extensively in the diagnosis and management of cardiovascular disease. Deep learning methods have proven to deliver segmentation results comparable to human experts in CMR imaging, but there have been no convincing results for the problem of end-to-end segmentation and diagnosis from CMR. This is in part due to a lack of sufficiently large datasets required… ▽ More

    Submitted 23 October, 2018; originally announced October 2018.

    Comments: submitted to 2019 IEEE International Symposium on Biomedical Imaging (ISBI)