Skip to main content

Showing 1–50 of 83 results for author: Yaqub, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.23145  [pdf, ps, other

    cs.LG cs.CR cs.CV

    Forget-MI: Machine Unlearning for Forgetting Multimodal Information in Healthcare Settings

    Authors: Shahad Hardan, Darya Taratynova, Abdelmajid Essofi, Karthik Nandakumar, Mohammad Yaqub

    Abstract: Privacy preservation in AI is crucial, especially in healthcare, where models rely on sensitive patient data. In the emerging field of machine unlearning, existing methodologies struggle to remove patient data from trained multimodal architectures, which are widely used in healthcare. We propose Forget-MI, a novel machine unlearning method for multimodal medical data, by establishing loss function… ▽ More

    Submitted 29 June, 2025; originally announced June 2025.

  2. arXiv:2506.22900  [pdf, ps, other

    cs.CV cs.CL

    MOTOR: Multimodal Optimal Transport via Grounded Retrieval in Medical Visual Question Answering

    Authors: Mai A. Shaaban, Tausifa Jan Saleem, Vijay Ram Papineni, Mohammad Yaqub

    Abstract: Medical visual question answering (MedVQA) plays a vital role in clinical decision-making by providing contextually rich answers to image-based queries. Although vision-language models (VLMs) are widely used for this task, they often generate factually incorrect answers. Retrieval-augmented generation addresses this challenge by providing information from external sources, but risks retrieving irr… ▽ More

    Submitted 28 June, 2025; originally announced June 2025.

  3. arXiv:2506.21237  [pdf, ps, other

    cs.CV

    DiMPLe -- Disentangled Multi-Modal Prompt Learning: Enhancing Out-Of-Distribution Alignment with Invariant and Spurious Feature Separation

    Authors: Umaima Rahman, Mohammad Yaqub, Dwarikanath Mahapatra

    Abstract: We introduce DiMPLe (Disentangled Multi-Modal Prompt Learning), a novel approach to disentangle invariant and spurious features across vision and language modalities in multi-modal learning. Spurious correlations in visual data often hinder out-of-distribution (OOD) performance. Unlike prior methods focusing solely on image features, DiMPLe disentangles features within and across modalities while… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

  4. arXiv:2506.12006  [pdf, ps, other

    eess.IV cs.CV

    crossMoDA Challenge: Evolution of Cross-Modality Domain Adaptation Techniques for Vestibular Schwannoma and Cochlea Segmentation from 2021 to 2023

    Authors: Navodini Wijethilake, Reuben Dorent, Marina Ivory, Aaron Kujawa, Stefan Cornelissen, Patrick Langenhuizen, Mohamed Okasha, Anna Oviedova, Hexin Dong, Bogyeong Kang, Guillaume Sallé, Luyi Han, Ziyuan Zhao, Han Liu, Tao Yang, Shahad Hardan, Hussain Alasmawi, Santosh Sanjeev, Yuzhou Zhuang, Satoshi Kondo, Maria Baldeon Calisto, Shaikh Muhammad Uzair Noman, Cancan Chen, Ipek Oguz, Rongguo Zhang , et al. (14 additional authors not shown)

    Abstract: The cross-Modality Domain Adaptation (crossMoDA) challenge series, initiated in 2021 in conjunction with the International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI), focuses on unsupervised cross-modality segmentation, learning from contrast-enhanced T1 (ceT1) and transferring to T2 MRI. The task is an extreme example of domain shift chosen to serve as a mea… ▽ More

    Submitted 24 June, 2025; v1 submitted 13 June, 2025; originally announced June 2025.

  5. arXiv:2505.15425  [pdf, other

    cs.CV

    On the Robustness of Medical Vision-Language Models: Are they Truly Generalizable?

    Authors: Raza Imam, Rufael Marew, Mohammad Yaqub

    Abstract: Medical Vision-Language Models (MVLMs) have achieved par excellence generalization in medical image analysis, yet their performance under noisy, corrupted conditions remains largely untested. Clinical imaging is inherently susceptible to acquisition artifacts and noise; however, existing evaluations predominantly assess generally clean datasets, overlooking robustness -- i.e., the model's ability… ▽ More

    Submitted 23 May, 2025; v1 submitted 21 May, 2025; originally announced May 2025.

    Comments: Dataset and Code is available at https://github.com/BioMedIA-MBZUAI/RobustMedCLIP Accepted at: Medical Image Understanding and Analysis (MIUA) 2025

  6. arXiv:2505.07527  [pdf, ps, other

    cs.LG

    Kalman Filter Enhanced GRPO for Reinforcement Learning-Based Language Model Reasoning

    Authors: Hu Wang, Congbo Ma, Ian Reid, Mohammad Yaqub

    Abstract: Reward baseline is important for Reinforcement Learning (RL) algorithms to reduce variance in policy gradient estimates. Recently, for language modeling, Group Relative Policy Optimization (GRPO) is proposed to compute the advantage for each output by subtracting the mean reward, as the baseline, for all outputs in the group. However, it can lead to inaccurate advantage estimates in environments w… ▽ More

    Submitted 21 May, 2025; v1 submitted 12 May, 2025; originally announced May 2025.

  7. arXiv:2504.15865  [pdf, other

    cs.CV cs.AI cs.LG

    MedNNS: Supernet-based Medical Task-Adaptive Neural Network Search

    Authors: Lotfi Abdelkrim Mecharbat, Ibrahim Almakky, Martin Takac, Mohammad Yaqub

    Abstract: Deep learning (DL) has achieved remarkable progress in the field of medical imaging. However, adapting DL models to medical tasks remains a significant challenge, primarily due to two key factors: (1) architecture selection, as different tasks necessitate specialized model designs, and (2) weight initialization, which directly impacts the convergence speed and final performance of the models. Alth… ▽ More

    Submitted 23 April, 2025; v1 submitted 22 April, 2025; originally announced April 2025.

  8. arXiv:2504.13645  [pdf, other

    cs.CV cs.LG

    Efficient Parameter Adaptation for Multi-Modal Medical Image Segmentation and Prognosis

    Authors: Numan Saeed, Shahad Hardan, Muhammad Ridzuan, Nada Saadi, Karthik Nandakumar, Mohammad Yaqub

    Abstract: Cancer detection and prognosis relies heavily on medical imaging, particularly CT and PET scans. Deep Neural Networks (DNNs) have shown promise in tumor segmentation by fusing information from these modalities. However, a critical bottleneck exists: the dependency on CT-PET data concurrently for training and inference, posing a challenge due to the limited availability of PET scans. Hence, there i… ▽ More

    Submitted 18 April, 2025; originally announced April 2025.

  9. arXiv:2504.07117  [pdf, other

    q-bio.TO cs.AI

    RP-SAM2: Refining Point Prompts for Stable Surgical Instrument Segmentation

    Authors: Nuren Zhaksylyk, Ibrahim Almakky, Jay Paranjape, S. Swaroop Vedula, Shameema Sikder, Vishal M. Patel, Mohammad Yaqub

    Abstract: Accurate surgical instrument segmentation is essential in cataract surgery for tasks such as skill assessment and workflow optimization. However, limited annotated data makes it difficult to develop fully automatic models. Prompt-based methods like SAM2 offer flexibility yet remain highly sensitive to the point prompt placement, often leading to inconsistent segmentations. We address this issue by… ▽ More

    Submitted 25 March, 2025; originally announced April 2025.

  10. arXiv:2503.16055  [pdf, other

    eess.IV cs.CV

    SALT: Singular Value Adaptation with Low-Rank Transformation

    Authors: Abdelrahman Elsayed, Sarim Hashmi, Mohammed Elseiagy, Hu Wang, Mohammad Yaqub, Ibrahim Almakky

    Abstract: The complex nature of medical image segmentation calls for models that are specifically designed to capture detailed, domain-specific features. Large foundation models offer considerable flexibility, yet the cost of fine-tuning these models remains a significant barrier. Parameter-Efficient Fine-Tuning (PEFT) methods, such as Low-Rank Adaptation (LoRA), efficiently update model weights with low-ra… ▽ More

    Submitted 20 March, 2025; originally announced March 2025.

  11. arXiv:2503.08515  [pdf, other

    cs.CV physics.med-ph

    Segmentation-Guided CT Synthesis with Pixel-Wise Conformal Uncertainty Bounds

    Authors: David Vallmanya Poch, Yorick Estievenart, Elnura Zhalieva, Sukanya Patra, Mohammad Yaqub, Souhaib Ben Taieb

    Abstract: Accurate dose calculations in proton therapy rely on high-quality CT images. While planning CTs (pCTs) serve as a reference for dosimetric planning, Cone Beam CT (CBCT) is used throughout Adaptive Radiotherapy (ART) to generate sCTs for improved dose calculations. Despite its lower cost and reduced radiation exposure advantages, CBCT suffers from severe artefacts and poor image quality, making it… ▽ More

    Submitted 11 March, 2025; originally announced March 2025.

    Comments: MICCAI 2025 Conference Submission. Follows the required LNCS format. 12 pages including references. Contains 4 figures and 1 table

  12. arXiv:2502.20516  [pdf, other

    cs.CV

    In-Model Merging for Enhancing the Robustness of Medical Imaging Classification Models

    Authors: Hu Wang, Ibrahim Almakky, Congbo Ma, Numan Saeed, Mohammad Yaqub

    Abstract: Model merging is an effective strategy to merge multiple models for enhancing model performances, and more efficient than ensemble learning as it will not introduce extra computation into inference. However, limited research explores if the merging process can occur within one model and enhance the model's robustness, which is particularly critical in the medical image domain. In the paper, we are… ▽ More

    Submitted 16 May, 2025; v1 submitted 27 February, 2025; originally announced February 2025.

  13. arXiv:2502.14807  [pdf, other

    eess.IV cs.AI cs.CV

    FetalCLIP: A Visual-Language Foundation Model for Fetal Ultrasound Image Analysis

    Authors: Fadillah Maani, Numan Saeed, Tausifa Saleem, Zaid Farooq, Hussain Alasmawi, Werner Diehl, Ameera Mohammad, Gareth Waring, Saudabi Valappi, Leanne Bricker, Mohammad Yaqub

    Abstract: Foundation models are becoming increasingly effective in the medical domain, offering pre-trained models on large datasets that can be readily adapted for downstream tasks. Despite progress, fetal ultrasound images remain a challenging domain for foundation models due to their inherent complexity, often requiring substantial additional training and facing limitations due to the scarcity of paired… ▽ More

    Submitted 7 April, 2025; v1 submitted 20 February, 2025; originally announced February 2025.

  14. Automatic Quality Assessment of First Trimester Crown-Rump-Length Ultrasound Images

    Authors: Sevim Cengiz, Ibraheem Hamdi, Mohammad Yaqub

    Abstract: Fetal gestational age (GA) is vital clinical information that is estimated during pregnancy in order to assess fetal growth. This is usually performed by measuring the crown-rump-length (CRL) on an ultrasound image in the Dating scan which is then correlated with fetal age and growth trajectory. A major issue when performing the CRL measurement is ensuring that the image is acquired at the correct… ▽ More

    Submitted 15 February, 2025; originally announced February 2025.

    Comments: 9 pages, 2 figures

    Journal ref: Springer Lecture Notes in Computer Science (LNCS) - 2022

  15. Breaking Down the Hierarchy: A New Approach to Leukemia Classification

    Authors: Ibraheem Hamdi, Hosam El-Gendy, Ahmed Sharshar, Mohamed Saeed, Muhammad Ridzuan, Shahrukh K. Hashmi, Naveed Syed, Imran Mirza, Shakir Hussain, Amira Mahmoud Abdalla, Mohammad Yaqub

    Abstract: The complexities inherent to leukemia, multifaceted cancer affecting white blood cells, pose considerable diagnostic and treatment challenges, primarily due to reliance on laborious morphological analyses and expert judgment that are susceptible to errors. Addressing these challenges, this study presents a refined, comprehensive strategy leveraging advanced deep-learning techniques for the classif… ▽ More

    Submitted 15 February, 2025; originally announced February 2025.

    Comments: 9 pages, 11 figures

    Journal ref: Lecture Notes in Computer Science (LNCS,volume 14313) - 2023

  16. arXiv:2502.06019  [pdf, other

    cs.CV

    Noise is an Efficient Learner for Zero-Shot Vision-Language Models

    Authors: Raza Imam, Asif Hanif, Jian Zhang, Khaled Waleed Dawoud, Yova Kementchedjhieva, Mohammad Yaqub

    Abstract: Recently, test-time adaptation has garnered attention as a method for tuning models without labeled data. The conventional modus operandi for adapting pre-trained vision-language models (VLMs) during test-time primarily focuses on tuning learnable prompts; however, this approach overlooks potential distribution shifts in the visual representations themselves. In this work, we address this limitati… ▽ More

    Submitted 9 February, 2025; originally announced February 2025.

    Comments: Our code is available at https://github.com/Razaimam45/TNT

  17. arXiv:2501.17699  [pdf, other

    eess.IV cs.AI cs.CV

    PulmoFusion: Advancing Pulmonary Health with Efficient Multi-Modal Fusion

    Authors: Ahmed Sharshar, Yasser Attia, Mohammad Yaqub, Mohsen Guizani

    Abstract: Traditional remote spirometry lacks the precision required for effective pulmonary monitoring. We present a novel, non-invasive approach using multimodal predictive models that integrate RGB or thermal video data with patient metadata. Our method leverages energy-efficient Spiking Neural Networks (SNNs) for the regression of Peak Expiratory Flow (PEF) and classification of Forced Expiratory Volume… ▽ More

    Submitted 29 January, 2025; originally announced January 2025.

    Journal ref: (ISBI 2025) 2025 IEEE International Symposium on Biomedical Imaging

  18. Multi-scale and Multi-path Cascaded Convolutional Network for Semantic Segmentation of Colorectal Polyps

    Authors: Malik Abdul Manan, Feng Jinchao, Muhammad Yaqub, Shahzad Ahmed, Syed Muhammad Ali Imran, Imran Shabir Chuhan, Haroon Ahmed Khan

    Abstract: Colorectal polyps are structural abnormalities of the gastrointestinal tract that can potentially become cancerous in some cases. The study introduces a novel framework for colorectal polyp segmentation named the Multi-Scale and Multi-Path Cascaded Convolution Network (MMCC-Net), aimed at addressing the limitations of existing models, such as inadequate spatial dependence representation and the ab… ▽ More

    Submitted 3 December, 2024; originally announced December 2024.

    Journal ref: Alexandria Engineering Journal Volume 105, October 2024, Pages 341-359

  19. arXiv:2411.15872  [pdf, other

    eess.IV cs.CV

    Optimizing Brain Tumor Segmentation with MedNeXt: BraTS 2024 SSA and Pediatrics

    Authors: Sarim Hashmi, Juan Lugo, Abdelrahman Elsayed, Dinesh Saggurthi, Mohammed Elseiagy, Alikhan Nurkamal, Jaskaran Walia, Fadillah Adamsyah Maani, Mohammad Yaqub

    Abstract: Identifying key pathological features in brain MRIs is crucial for the long-term survival of glioma patients. However, manual segmentation is time-consuming, requiring expert intervention and is susceptible to human error. Therefore, significant research has been devoted to developing machine learning methods that can accurately segment tumors in 3D multimodal brain MRI scans. Despite their progre… ▽ More

    Submitted 26 November, 2024; v1 submitted 24 November, 2024; originally announced November 2024.

  20. arXiv:2411.09263  [pdf, other

    cs.LG cs.CV

    Rethinking Weight-Averaged Model-merging

    Authors: Hu Wang, Congbo Ma, Ibrahim Almakky, Ian Reid, Gustavo Carneiro, Mohammad Yaqub

    Abstract: Model-merging has emerged as a powerful approach in deep learning, capable of enhancing model performance without any training. However, the underlying mechanisms that explain its effectiveness remain largely unexplored. In this paper, we investigate this technique from three novel perspectives to empirically provide deeper insights into why and how weight-averaged model-merging~\cite{wortsman2022… ▽ More

    Submitted 16 May, 2025; v1 submitted 14 November, 2024; originally announced November 2024.

  21. arXiv:2411.04155  [pdf, other

    eess.IV cs.CV cs.LG

    MINDSETS: Multi-omics Integration with Neuroimaging for Dementia Subtyping and Effective Temporal Study

    Authors: Salma Hassan, Dawlat Akaila, Maryam Arjemandi, Vijay Papineni, Mohammad Yaqub

    Abstract: In the complex realm of cognitive disorders, Alzheimer's disease (AD) and vascular dementia (VaD) are the two most prevalent dementia types, presenting entangled symptoms yet requiring distinct treatment approaches. The crux of effective treatment in slowing neurodegeneration lies in early, accurate diagnosis, as this significantly assists doctors in determining the appropriate course of action. H… ▽ More

    Submitted 6 November, 2024; originally announced November 2024.

  22. arXiv:2411.03861  [pdf, other

    cs.CV cs.CR

    FedSECA: Sign Election and Coordinate-wise Aggregation of Gradients for Byzantine Tolerant Federated Learning

    Authors: Joseph Geo Benjamin, Mothilal Asokan, Mohammad Yaqub, Karthik Nandakumar

    Abstract: One of the most common defense strategies against Byzantine clients in federated learning (FL) is to employ a robust aggregator mechanism that makes the training more resilient. While many existing Byzantine robust aggregators provide theoretical convergence guarantees and are empirically effective against certain categories of attacks, we observe that certain high-strength attacks can subvert the… ▽ More

    Submitted 8 April, 2025; v1 submitted 6 November, 2024; originally announced November 2024.

    Comments: Accepted in 4th Workshop on Federated Learning for Computer Vision (FedVision-2025), held in conjunction with CVPR-2025

  23. arXiv:2410.01003  [pdf, other

    cs.CV

    Y-CA-Net: A Convolutional Attention Based Network for Volumetric Medical Image Segmentation

    Authors: Muhammad Hamza Sharif, Muzammal Naseer, Mohammad Yaqub, Min Xu, Mohsen Guizani

    Abstract: Recent attention-based volumetric segmentation (VS) methods have achieved remarkable performance in the medical domain which focuses on modeling long-range dependencies. However, for voxel-wise prediction tasks, discriminative local features are key components for the performance of the VS models which is missing in attention-based VS methods. Aiming at resolving this issue, we deliberately incorp… ▽ More

    Submitted 1 October, 2024; originally announced October 2024.

  24. arXiv:2410.00986  [pdf, other

    eess.IV cs.CV

    TransResNet: Integrating the Strengths of ViTs and CNNs for High Resolution Medical Image Segmentation via Feature Grafting

    Authors: Muhammad Hamza Sharif, Dmitry Demidov, Asif Hanif, Mohammad Yaqub, Min Xu

    Abstract: High-resolution images are preferable in medical imaging domain as they significantly improve the diagnostic capability of the underlying method. In particular, high resolution helps substantially in improving automatic image segmentation. However, most of the existing deep learning-based techniques for medical image segmentation are optimized for input images having small spatial dimensions and p… ▽ More

    Submitted 1 October, 2024; originally announced October 2024.

    Comments: The 33rd British Machine Vision Conference 2022

  25. arXiv:2409.19901  [pdf, other

    cs.LG stat.ML

    SurvCORN: Survival Analysis with Conditional Ordinal Ranking Neural Network

    Authors: Muhammad Ridzuan, Numan Saeed, Fadillah Adamsyah Maani, Karthik Nandakumar, Mohammad Yaqub

    Abstract: Survival analysis plays a crucial role in estimating the likelihood of future events for patients by modeling time-to-event data, particularly in healthcare settings where predictions about outcomes such as death and disease recurrence are essential. However, this analysis poses challenges due to the presence of censored data, where time-to-event information is missing for certain data points. Yet… ▽ More

    Submitted 29 September, 2024; originally announced September 2024.

  26. arXiv:2409.02729  [pdf, other

    cs.CV

    Can language-guided unsupervised adaptation improve medical image classification using unpaired images and texts?

    Authors: Umaima Rahman, Raza Imam, Mohammad Yaqub, Boulbaba Ben Amor, Dwarikanath Mahapatra

    Abstract: In medical image classification, supervised learning is challenging due to the scarcity of labeled medical images. To address this, we leverage the visual-textual alignment within Vision-Language Models (VLMs) to enable unsupervised learning of a medical image classifier. In this work, we propose \underline{Med}ical \underline{Un}supervised \underline{A}daptation (\texttt{MedUnA}) of VLMs, where t… ▽ More

    Submitted 29 March, 2025; v1 submitted 3 September, 2024; originally announced September 2024.

    Comments: Conference paper at International Symposium on Biomedical Imaging (ISBI) 2025

  27. arXiv:2407.21739  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    A Federated Learning-Friendly Approach for Parameter-Efficient Fine-Tuning of SAM in 3D Segmentation

    Authors: Mothilal Asokan, Joseph Geo Benjamin, Mohammad Yaqub, Karthik Nandakumar

    Abstract: Adapting foundation models for medical image analysis requires finetuning them on a considerable amount of data because of extreme distribution shifts between natural (source) data used for pretraining and medical (target) data. However, collecting task-specific medical data for such finetuning at a central location raises many privacy concerns. Although Federated learning (FL) provides an effecti… ▽ More

    Submitted 31 July, 2024; originally announced July 2024.

  28. arXiv:2407.21738  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    Leveraging Self-Supervised Learning for Fetal Cardiac Planes Classification using Ultrasound Scan Videos

    Authors: Joseph Geo Benjamin, Mothilal Asokan, Amna Alhosani, Hussain Alasmawi, Werner Gerhard Diehl, Leanne Bricker, Karthik Nandakumar, Mohammad Yaqub

    Abstract: Self-supervised learning (SSL) methods are popular since they can address situations with limited annotated data by directly utilising the underlying data distribution. However, the adoption of such methods is not explored enough in ultrasound (US) imaging, especially for fetal assessment. We investigate the potential of dual-encoder SSL in utilizing unlabelled US video data to improve the perform… ▽ More

    Submitted 31 July, 2024; originally announced July 2024.

    Comments: Simplifying Medical Ultrasound: 4th International Workshop, ASMUS 2023, Held in Conjunction with MICCAI 2023, Vancouver, BC, Canada, October 8, 2023, Proceedings

  29. arXiv:2406.08486  [pdf, other

    eess.IV cs.CV

    On Evaluating Adversarial Robustness of Volumetric Medical Segmentation Models

    Authors: Hashmat Shadab Malik, Numan Saeed, Asif Hanif, Muzammal Naseer, Mohammad Yaqub, Salman Khan, Fahad Shahbaz Khan

    Abstract: Volumetric medical segmentation models have achieved significant success on organ and tumor-based segmentation tasks in recent years. However, their vulnerability to adversarial attacks remains largely unexplored, raising serious concerns regarding the real-world deployment of tools employing such models in the healthcare sector. This underscores the importance of investigating the robustness of e… ▽ More

    Submitted 2 September, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

    Comments: Accepted at British Machine Vision Conference 2024

  30. arXiv:2405.13482  [pdf, other

    cs.CV

    Continual Learning in Medical Imaging: A Survey and Practical Analysis

    Authors: Mohammad Areeb Qazi, Anees Ur Rehman Hashmi, Santosh Sanjeev, Ibrahim Almakky, Numan Saeed, Camila Gonzalez, Mohammad Yaqub

    Abstract: Deep Learning has shown great success in reshaping medical imaging, yet it faces numerous challenges hindering widespread application. Issues like catastrophic forgetting and distribution shifts in the continuously evolving data stream increase the gap between research and applications. Continual Learning offers promise in addressing these hurdles by enabling the sequential acquisition of new know… ▽ More

    Submitted 1 October, 2024; v1 submitted 22 May, 2024; originally announced May 2024.

    Comments: 16 pages, 9 figures

  31. arXiv:2405.07155  [pdf, other

    cs.CV

    Meta-Learned Modality-Weighted Knowledge Distillation for Robust Multi-Modal Learning with Missing Data

    Authors: Hu Wang, Salma Hassan, Yuyuan Liu, Congbo Ma, Yuanhong Chen, Yutong Xie, Mostafa Salem, Yu Tian, Jodie Avery, Louise Hull, Ian Reid, Mohammad Yaqub, Gustavo Carneiro

    Abstract: In multi-modal learning, some modalities are more influential than others, and their absence can have a significant impact on classification/segmentation accuracy. Addressing this challenge, we propose a novel approach called Meta-learned Modality-weighted Knowledge Distillation (MetaKD), which enables multi-modal models to maintain high accuracy even when key modalities are missing. MetaKD adapti… ▽ More

    Submitted 6 March, 2025; v1 submitted 12 May, 2024; originally announced May 2024.

  32. arXiv:2405.02852  [pdf, other

    eess.IV cs.CV

    On Enhancing Brain Tumor Segmentation Across Diverse Populations with Convolutional Neural Networks

    Authors: Fadillah Maani, Anees Ur Rehman Hashmi, Numan Saeed, Mohammad Yaqub

    Abstract: Brain tumor segmentation is a fundamental step in assessing a patient's cancer progression. However, manual segmentation demands significant expert time to identify tumors in 3D multimodal brain MRI scans accurately. This reliance on manual segmentation makes the process prone to intra- and inter-observer variability. This work proposes a brain tumor segmentation method as part of the BraTS-GoAT c… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

  33. arXiv:2404.14099  [pdf, other

    cs.CV

    DynaMMo: Dynamic Model Merging for Efficient Class Incremental Learning for Medical Images

    Authors: Mohammad Areeb Qazi, Ibrahim Almakky, Anees Ur Rehman Hashmi, Santosh Sanjeev, Mohammad Yaqub

    Abstract: Continual learning, the ability to acquire knowledge from new data while retaining previously learned information, is a fundamental challenge in machine learning. Various approaches, including memory replay, knowledge distillation, model regularization, and dynamic network expansion, have been proposed to address this issue. Thus far, dynamic network expansion methods have achieved state-of-the-ar… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  34. arXiv:2404.13704  [pdf, other

    eess.IV cs.CV cs.LG

    PEMMA: Parameter-Efficient Multi-Modal Adaptation for Medical Image Segmentation

    Authors: Nada Saadi, Numan Saeed, Mohammad Yaqub, Karthik Nandakumar

    Abstract: Imaging modalities such as Computed Tomography (CT) and Positron Emission Tomography (PET) are key in cancer detection, inspiring Deep Neural Networks (DNN) models that merge these scans for tumor segmentation. When both CT and PET scans are available, it is common to combine them as two channels of the input to the segmentation model. However, this method requires both scan types during training… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

  35. arXiv:2403.18996  [pdf, other

    cs.CV

    Envisioning MedCLIP: A Deep Dive into Explainability for Medical Vision-Language Models

    Authors: Anees Ur Rehman Hashmi, Dwarikanath Mahapatra, Mohammad Yaqub

    Abstract: Explaining Deep Learning models is becoming increasingly important in the face of daily emerging multimodal models, particularly in safety-critical domains like medical imaging. However, the lack of detailed investigations into the performance of explainability methods on these models is widening the gap between their development and safe deployment. In this work, we analyze the performance of var… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

  36. arXiv:2403.16594  [pdf, other

    eess.IV cs.CV cs.LG

    EDUE: Expert Disagreement-Guided One-Pass Uncertainty Estimation for Medical Image Segmentation

    Authors: Kudaibergen Abutalip, Numan Saeed, Ikboljon Sobirov, Vincent Andrearczyk, Adrien Depeursinge, Mohammad Yaqub

    Abstract: Deploying deep learning (DL) models in medical applications relies on predictive performance and other critical factors, such as conveying trustworthy predictive uncertainty. Uncertainty estimation (UE) methods provide potential solutions for evaluating prediction reliability and improving the model confidence calibration. Despite increasing interest in UE, challenges persist, such as the need for… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  37. MedPromptX: Grounded Multimodal Prompting for Chest X-ray Diagnosis

    Authors: Mai A. Shaaban, Adnan Khan, Mohammad Yaqub

    Abstract: Chest X-ray images are commonly used for predicting acute and chronic cardiopulmonary conditions, but efforts to integrate them with structured clinical data face challenges due to incomplete electronic health records (EHR). This paper introduces MedPromptX, the first clinical decision support system that integrates multimodal large language models (MLLMs), few-shot prompting (FP) and visual groun… ▽ More

    Submitted 27 January, 2025; v1 submitted 22 March, 2024; originally announced March 2024.

  38. arXiv:2403.13343  [pdf, other

    cs.CV

    TiBiX: Leveraging Temporal Information for Bidirectional X-ray and Report Generation

    Authors: Santosh Sanjeev, Fadillah Adamsyah Maani, Arsen Abzhanov, Vijay Ram Papineni, Ibrahim Almakky, Bartłomiej W. Papież, Mohammad Yaqub

    Abstract: With the emergence of vision language models in the medical imaging domain, numerous studies have focused on two dominant research activities: (1) report generation from Chest X-rays (CXR), and (2) synthetic scan generation from text or reports. Despite some research incorporating multi-view CXRs into the generative process, prior patient scans and reports have been generally disregarded. This can… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

  39. arXiv:2403.13341  [pdf, other

    cs.CV cs.AI

    FissionFusion: Fast Geometric Generation and Hierarchical Souping for Medical Image Analysis

    Authors: Santosh Sanjeev, Nuren Zhaksylyk, Ibrahim Almakky, Anees Ur Rehman Hashmi, Mohammad Areeb Qazi, Mohammad Yaqub

    Abstract: The scarcity of well-annotated medical datasets requires leveraging transfer learning from broader datasets like ImageNet or pre-trained models like CLIP. Model soups averages multiple fine-tuned models aiming to improve performance on In-Domain (ID) tasks and enhance robustness against Out-of-Distribution (OOD) datasets. However, applying these methods to the medical imaging domain faces challeng… ▽ More

    Submitted 3 June, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

  40. arXiv:2403.13078  [pdf, other

    cs.CV cs.AI cs.HC

    HuLP: Human-in-the-Loop for Prognosis

    Authors: Muhammad Ridzuan, Mai Kassem, Numan Saeed, Ikboljon Sobirov, Mohammad Yaqub

    Abstract: This paper introduces HuLP, a Human-in-the-Loop for Prognosis model designed to enhance the reliability and interpretability of prognostic models in clinical contexts, especially when faced with the complexities of missing covariates and outcomes. HuLP offers an innovative approach that enables human expert intervention, empowering clinicians to interact with and correct models' predictions, thus… ▽ More

    Submitted 9 July, 2024; v1 submitted 19 March, 2024; originally announced March 2024.

  41. arXiv:2403.11646  [pdf, other

    cs.CV

    MedMerge: Merging Models for Effective Transfer Learning to Medical Imaging Tasks

    Authors: Ibrahim Almakky, Santosh Sanjeev, Anees Ur Rehman Hashmi, Mohammad Areeb Qazi, Hu Wang, Mohammad Yaqub

    Abstract: Transfer learning has become a powerful tool to initialize deep learning models to achieve faster convergence and higher performance. This is especially useful in the medical imaging analysis domain, where data scarcity limits possible performance gains for deep learning models. Some advancements have been made in boosting the transfer learning performance gain by merging models starting from the… ▽ More

    Submitted 15 April, 2025; v1 submitted 18 March, 2024; originally announced March 2024.

  42. arXiv:2403.10603  [pdf, other

    cs.CV cs.AI cs.LG

    SurvRNC: Learning Ordered Representations for Survival Prediction using Rank-N-Contrast

    Authors: Numan Saeed, Muhammad Ridzuan, Fadillah Adamsyah Maani, Hussain Alasmawi, Karthik Nandakumar, Mohammad Yaqub

    Abstract: Predicting the likelihood of survival is of paramount importance for individuals diagnosed with cancer as it provides invaluable information regarding prognosis at an early stage. This knowledge enables the formulation of effective treatment plans that lead to improved patient outcomes. In the past few years, deep learning models have provided a feasible solution for assessing medical images, elec… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  43. arXiv:2403.10164  [pdf, other

    cs.CV cs.AI cs.LG

    CoReEcho: Continuous Representation Learning for 2D+time Echocardiography Analysis

    Authors: Fadillah Adamsyah Maani, Numan Saeed, Aleksandr Matsun, Mohammad Yaqub

    Abstract: Deep learning (DL) models have been advancing automatic medical image analysis on various modalities, including echocardiography, by offering a comprehensive end-to-end training pipeline. This approach enables DL models to regress ejection fraction (EF) directly from 2D+time echocardiograms, resulting in superior performance. However, the end-to-end training pipeline makes the learned representati… ▽ More

    Submitted 16 September, 2024; v1 submitted 15 March, 2024; originally announced March 2024.

  44. arXiv:2403.09400  [pdf, other

    cs.CV

    ConDiSR: Contrastive Disentanglement and Style Regularization for Single Domain Generalization

    Authors: Aleksandr Matsun, Numan Saeed, Fadillah Adamsyah Maani, Mohammad Yaqub

    Abstract: Medical data often exhibits distribution shifts, which cause test-time performance degradation for deep learning models trained using standard supervised learning pipelines. This challenge is addressed in the field of Domain Generalization (DG) with the sub-field of Single Domain Generalization (SDG) being specifically interesting due to the privacy- or logistics-related issues often associated wi… ▽ More

    Submitted 31 October, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

    Comments: A flaw was found in the results acquisition

  45. arXiv:2403.09262  [pdf, other

    eess.IV cs.CV

    Advanced Tumor Segmentation in Medical Imaging: An Ensemble Approach for BraTS 2023 Adult Glioma and Pediatric Tumor Tasks

    Authors: Fadillah Maani, Anees Ur Rehman Hashmi, Mariam Aljuboory, Numan Saeed, Ikboljon Sobirov, Mohammad Yaqub

    Abstract: Automated segmentation proves to be a valuable tool in precisely detecting tumors within medical images. The accurate identification and segmentation of tumor types hold paramount importance in diagnosing, monitoring, and treating highly fatal brain tumors. The BraTS challenge serves as a platform for researchers to tackle this issue by participating in open challenges focused on tumor segmentatio… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  46. arXiv:2403.09240  [pdf, ps, other

    eess.IV cs.CV

    XReal: Realistic Anatomy and Pathology-Aware X-ray Generation via Controllable Diffusion Model

    Authors: Anees Ur Rehman Hashmi, Ibrahim Almakky, Mohammad Areeb Qazi, Santosh Sanjeev, Vijay Ram Papineni, Jagalpathy Jagdish, Mohammad Yaqub

    Abstract: Large-scale generative models have demonstrated impressive capabilities in producing visually compelling images, with increasing applications in medical imaging. However, they continue to grapple with hallucination challenges and the generation of anatomically inaccurate outputs. These limitations are mainly due to the reliance on textual inputs and lack of spatial control over the generated image… ▽ More

    Submitted 22 October, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

  47. Fine-Tuned Large Language Models for Symptom Recognition from Spanish Clinical Text

    Authors: Mai A. Shaaban, Abbas Akkasi, Adnan Khan, Majid Komeili, Mohammad Yaqub

    Abstract: The accurate recognition of symptoms in clinical reports is significantly important in the fields of healthcare and biomedical natural language processing. These entities serve as essential building blocks for clinical information extraction, enabling retrieval of critical medical insights from vast amounts of textual data. Furthermore, the ability to identify and categorize these entities is fund… ▽ More

    Submitted 28 January, 2024; originally announced January 2024.

  48. arXiv:2401.02723   

    cs.LG cs.CV

    Predicting Traffic Flow with Federated Learning and Graph Neural with Asynchronous Computations Network

    Authors: Muhammad Yaqub, Shahzad Ahmad, Malik Abdul Manan, Imran Shabir Chuhan

    Abstract: Real-time traffic flow prediction holds significant importance within the domain of Intelligent Transportation Systems (ITS). The task of achieving a balance between prediction precision and computational efficiency presents a significant challenge. In this article, we present a novel deep-learning method called Federated Learning and Asynchronous Graph Convolutional Network (FLAGCN). Our framewor… ▽ More

    Submitted 5 April, 2024; v1 submitted 5 January, 2024; originally announced January 2024.

    Comments: I request to withdraw my paper from arXiv due to significant updates and improvements identified post-submission. These enhancements will substantially elevate the work's quality and impact. I plan to resubmit the revised paper upon completion of these updates. Thank you for accommodating this request

  49. arXiv:2311.09607  [pdf, other

    eess.IV cs.CV

    Multi-Task Learning Approach for Unified Biometric Estimation from Fetal Ultrasound Anomaly Scans

    Authors: Mohammad Areeb Qazi, Mohammed Talha Alam, Ibrahim Almakky, Werner Gerhard Diehl, Leanne Bricker, Mohammad Yaqub

    Abstract: Precise estimation of fetal biometry parameters from ultrasound images is vital for evaluating fetal growth, monitoring health, and identifying potential complications reliably. However, the automated computerized segmentation of the fetal head, abdomen, and femur from ultrasound images, along with the subsequent measurement of fetal biometrics, remains challenging. In this work, we propose a mult… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

    Comments: 10 Pages, 4 Figures, The 4th International Conference on Medical Imaging and Computer-Aided Diagnosis

  50. arXiv:2310.19411  [pdf

    eess.IV cs.CV cs.LG

    Intelligent Breast Cancer Diagnosis with Heuristic-assisted Trans-Res-U-Net and Multiscale DenseNet using Mammogram Images

    Authors: Muhammad Yaqub, Feng Jinchao

    Abstract: Breast cancer (BC) significantly contributes to cancer-related mortality in women, underscoring the criticality of early detection for optimal patient outcomes. A mammography is a key tool for identifying and diagnosing breast abnormalities; however, accurately distinguishing malignant mass lesions remains challenging. To address this issue, we propose a novel deep learning approach for BC screeni… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: 22 pages, 17 figures, 4 Tables and Appendix A: Supplementary Material