Search | arXiv e-print repository

arXiv:2504.19203 [pdf]

Improving Generalization in MRI-Based Deep Learning Models for Total Knee Replacement Prediction

Authors: Ehsan Karami, Hamid Soltanian-Zadeh

Abstract: Knee osteoarthritis (KOA) is a common joint disease that causes pain and mobility issues. While MRI-based deep learning models have demonstrated superior performance in predicting total knee replacement (TKR) and disease progression, their generalizability remains challenging, particularly when applied to imaging data from different sources. In this study, we have shown that replacing batch normal… ▽ More Knee osteoarthritis (KOA) is a common joint disease that causes pain and mobility issues. While MRI-based deep learning models have demonstrated superior performance in predicting total knee replacement (TKR) and disease progression, their generalizability remains challenging, particularly when applied to imaging data from different sources. In this study, we have shown that replacing batch normalization with instance normalization, using data augmentation, and applying contrastive loss improves model generalization in a baseline deep learning model for knee osteoarthritis (KOA) prediction. We trained and evaluated our model using MRI data from the Osteoarthritis Initiative (OAI) database, considering sagittal fat-suppressed intermediate-weighted turbo spin-echo (FS-IW-TSE) images as the source domain and sagittal fat-suppressed three-dimensional (3D) dual-echo in steady state (DESS) images as the target domain. The results demonstrate a statistically significant improvement in classification accuracy across both domains, with our approach outperforming the baseline model. △ Less

Submitted 29 April, 2025; v1 submitted 27 April, 2025; originally announced April 2025.

arXiv:2410.00779 [pdf]

Local-to-Global Self-Supervised Representation Learning for Diabetic Retinopathy Grading

Authors: Mostafa Hajighasemlou, Samad Sheikhaei, Hamid Soltanian-Zadeh

Abstract: Artificial intelligence algorithms have demonstrated their image classification and segmentation ability in the past decade. However, artificial intelligence algorithms perform less for actual clinical data than those used for simulations. This research aims to present a novel hybrid learning model using self-supervised learning and knowledge distillation, which can achieve sufficient generalizati… ▽ More Artificial intelligence algorithms have demonstrated their image classification and segmentation ability in the past decade. However, artificial intelligence algorithms perform less for actual clinical data than those used for simulations. This research aims to present a novel hybrid learning model using self-supervised learning and knowledge distillation, which can achieve sufficient generalization and robustness. The self-attention mechanism and tokens employed in ViT, besides the local-to-global learning approach used in the hybrid model, enable the proposed algorithm to extract a high-dimensional and high-quality feature space from images. To demonstrate the proposed neural network's capability in classifying and extracting feature spaces from medical images, we use it on a dataset of Diabetic Retinopathy images, specifically the EyePACS dataset. This dataset is more complex structurally and challenging regarding damaged areas than other medical images. For the first time in this study, self-supervised learning and knowledge distillation are used to classify this dataset. In our algorithm, for the first time among all self-supervised learning and knowledge distillation models, the test dataset is 50% larger than the training dataset. Unlike many studies, we have not removed any images from the dataset. Finally, our algorithm achieved an accuracy of 79.1% in the linear classifier and 74.36% in the k-NN algorithm for multiclass classification. Compared to a similar state-of-the-art model, our results achieved higher accuracy and more effective representation spaces. △ Less

Submitted 11 December, 2024; v1 submitted 1 October, 2024; originally announced October 2024.

arXiv:2110.03002 [pdf]

Multi-Scale Convolutional Neural Network for Automated AMD Classification using Retinal OCT Images

Authors: Saman Sotoudeh-Paima, Ata Jodeiri, Fedra Hajizadeh, Hamid Soltanian-Zadeh

Abstract: Age-related macular degeneration (AMD) is the most common cause of blindness in developed countries, especially in people over 60 years of age. The workload of specialists and the healthcare system in this field has increased in recent years mainly due to the prevalence of population aging worldwide and the chronic nature of AMD. Recent developments in deep learning have provided a unique opportun… ▽ More Age-related macular degeneration (AMD) is the most common cause of blindness in developed countries, especially in people over 60 years of age. The workload of specialists and the healthcare system in this field has increased in recent years mainly due to the prevalence of population aging worldwide and the chronic nature of AMD. Recent developments in deep learning have provided a unique opportunity to develop fully automated diagnosis frameworks. Considering the presence of AMD-related retinal pathologies in varying sizes in OCT images, our objective was to propose a multi-scale convolutional neural network (CNN) capable of distinguishing pathologies using receptive fields with various sizes. The multi-scale CNN was designed based on the feature pyramid network (FPN) structure and was used to diagnose normal and two common clinical characteristics of dry and wet AMD, namely drusen and choroidal neovascularization (CNV). The proposed method was evaluated on a national dataset gathered at Noor Eye Hospital (NEH) and the UCSD public dataset. Experimental results show the superior performance of our proposed multi-scale structure over several well-known OCT classification frameworks. This feature combination strategy has proved to be effective on all tested backbone models, with improvements ranging from 0.4% to 3.3%. In addition, gradual learning has proven to improve performance in two consecutive stages. In the first stage, the performance was boosted from 87.2%+-2.5% to 92.0%+-1.6% using pre-trained ImageNet weights. In the second stage, another performance boost from 92.0%+-1.6% to 93.4%+-1.4% was observed due to fine-tuning the previous model on the UCSD dataset. Lastly, generating heatmaps provided additional proof for the effectiveness of our multi-scale structure, enabling the detection of retinal pathologies appearing in different sizes. △ Less

Submitted 4 February, 2022; v1 submitted 6 October, 2021; originally announced October 2021.

arXiv:2103.00087 [pdf]

CXR-Net: An Artificial Intelligence Pipeline for Quick Covid-19 Screening of Chest X-Rays

Authors: Haikal Abdulah, Benjamin Huber, Sinan Lal, Hassan Abdallah, Luigi L. Palese, Hamid Soltanian-Zadeh, Domenico L. Gatti

Abstract: CXR-Net is a two-module Artificial Intelligence pipeline for the quick detection of SARS-CoV-2 from chest X-rays (CXRs). Module 1 was trained on a public dataset of 6395 CXRs with radiologist annotated lung contours to generate masks of the lungs that overlap the heart and large vasa. Module 2 is a hybrid convnet in which the first convolutional layer with learned coefficients is replaced by a lay… ▽ More CXR-Net is a two-module Artificial Intelligence pipeline for the quick detection of SARS-CoV-2 from chest X-rays (CXRs). Module 1 was trained on a public dataset of 6395 CXRs with radiologist annotated lung contours to generate masks of the lungs that overlap the heart and large vasa. Module 2 is a hybrid convnet in which the first convolutional layer with learned coefficients is replaced by a layer with fixed coefficients provided by the Wavelet Scattering Transform (WST). Module 2 takes as inputs the patients CXRs and corresponding lung masks calculated by Module 1, and produces as outputs a class assignment (Covid vs. non-Covid) and high resolution heat maps that identify the SARS associated lung regions. Module 2 was trained on a dataset of CXRs from non-Covid and RT-PCR confirmed Covid patients acquired at the Henry Ford Health System (HFHS) Hospital in Detroit. All non-Covid CXRs were from pre-Covid era (2018-2019), and included images from both normal lungs and lungs affected by non-Covid pathologies. Training and test sets consisted of 2265 CXRs (1417 Covid negative, 848 Covid positive), and 1532 CXRs (945 Covid negative, 587 Covid positive), respectively. Six distinct cross-validation models, each trained on 1887 images and validated against 378 images, were combined into an ensemble model that was used to classify the CXR images of the test set with resulting Accuracy = 0.789, Precision = 0.739, Recall = 0.693, F1 score = 0.715, ROC(AUC) = 0.852. △ Less

Submitted 26 February, 2021; originally announced March 2021.

Comments: 16 pages, 14 figures. arXiv admin note: substantial text overlap with arXiv:2011.08655

MSC Class: 68Txx (Primary); 68T07 (Secondary); 92B20; 68T45 ACM Class: I.2; I.4; I.5; J.3

arXiv:2011.08655 [pdf]

Lung Segmentation in Chest X-rays with Res-CR-Net

Authors: Haikal Abdulah, Benjamin Huber, Sinan Lal, Hassan Abdallah, Hamid Soltanian-Zadeh, Domenico L. Gatti

Abstract: Deep Neural Networks (DNN) are widely used to carry out segmentation tasks in biomedical images. Most DNNs developed for this purpose are based on some variation of the encoder-decoder U-Net architecture. Here we show that Res-CR-Net, a new type of fully convolutional neural network, which was originally developed for the semantic segmentation of microscopy images, and which does not adopt a U-Net… ▽ More Deep Neural Networks (DNN) are widely used to carry out segmentation tasks in biomedical images. Most DNNs developed for this purpose are based on some variation of the encoder-decoder U-Net architecture. Here we show that Res-CR-Net, a new type of fully convolutional neural network, which was originally developed for the semantic segmentation of microscopy images, and which does not adopt a U-Net architecture, is very effective at segmenting the lung fields in chest X-rays from either healthy patients or patients with a variety of lung pathologies. △ Less

Submitted 13 November, 2020; originally announced November 2020.

Comments: 8 pages, 5 figures

MSC Class: 68Txx (Primary); 68T07 (Secondary); 92B20; 68T45 ACM Class: I.2; I.4; J.3

arXiv:2008.03387 [pdf]

Comparative Evaluation Of Three Methods Of Automatic Segmentation Of Brain Structures Using 426 Cases

Authors: Mohammad-Parsa Hosseini, Esmaeil Davoodi, Evangelia Bouzos, Kost Elisevich, Hamid Soltanian-Zadeh

Abstract: Segmentation of brain structures in a large dataset of magnetic resonance images (MRI) necessitates automatic segmentation instead of manual tracing. Automatic segmentation methods provide a much-needed alternative to manual segmentation which is both labor intensive and time-consuming. Among brain structures, the hippocampus presents a challenging segmentation task due to its irregular shape, sma… ▽ More Segmentation of brain structures in a large dataset of magnetic resonance images (MRI) necessitates automatic segmentation instead of manual tracing. Automatic segmentation methods provide a much-needed alternative to manual segmentation which is both labor intensive and time-consuming. Among brain structures, the hippocampus presents a challenging segmentation task due to its irregular shape, small size, and unclear edges. In this work, we use T1-weighted MRI of 426 subjects to validate the approach and compare three automatic segmentation methods: FreeSurfer, LocalInfo, and ABSS. Four evaluation measures are used to assess agreement between automatic and manual segmentation of the hippocampus. ABSS outperformed the others based on the Dice coefficient, precision, Hausdorff distance, ASSD, RMS, similarity, sensitivity, and volume agreement. Moreover, comparison of the segmentation results, acquired using 1.5T and 3T MRI systems, showed that ABSS is more sensitive than the others to the field inhomogeneity of 3T MRI. △ Less

Submitted 7 August, 2020; originally announced August 2020.

arXiv:1907.08705 [pdf]

doi 10.1371/journal.pone.0226048

Enhancing performance of subject-specific models via subject-independent information for SSVEP-based BCIs

Authors: Mohammad Hadi Mehdizavareh, Sobhan Hemati, Hamid Soltanian-Zadeh

Abstract: Recently, brain-computer interface (BCI) systems developed based on steady-state visual evoked potential (SSVEP) have attracted much attention due to their high information transfer rate (ITR) and increasing number of targets. However, SSVEP-based methods can be improved in terms of their accuracy and target detection time. We propose a new method based on canonical correlation analysis (CCA) to i… ▽ More Recently, brain-computer interface (BCI) systems developed based on steady-state visual evoked potential (SSVEP) have attracted much attention due to their high information transfer rate (ITR) and increasing number of targets. However, SSVEP-based methods can be improved in terms of their accuracy and target detection time. We propose a new method based on canonical correlation analysis (CCA) to integrate subject-specific models and subject-independent information and enhance BCI performance. We propose to use training data of other subjects to optimize hyperparameters for CCA-based model of a specific subject. An ensemble version of the proposed method is also developed for a fair comparison with ensemble task-related component analysis (TRCA). The proposed method is compared with TRCA and extended CCA methods. A publicly available, 35-subject SSVEP benchmark dataset is used for comparison studies and performance is quantified by classification accuracy and ITR. The ITR of the proposed method is higher than those of TRCA and extended CCA. The proposed method outperforms extended CCA in all conditions and TRCA for time windows greater than 0.3 s. The proposed method also outperforms TRCA when there are limited training blocks and electrodes. This study illustrates that adding subject-independent information to subject-specific models can improve performance of SSVEP-based BCIs. △ Less

Submitted 15 January, 2020; v1 submitted 19 July, 2019; originally announced July 2019.

Comments: 22 pages, 8 figures, 1 table, 1 appendix, published in PLOS ONE journal. This is a draft version. The published version is available in the following link: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0226048

Journal ref: PLOS ONE 15(1): e0226048 (2020)

Showing 1–7 of 7 results for author: Soltanian-Zadeh, H