-
Inversion of Magnetic Data using Learned Dictionaries and Scale Space
Authors:
Shadab Ahamed,
Simon Ghyselincks,
Pablo Chang Huang Arias,
Julian Kloiber,
Yasin Ranjbar,
Jingrong Tang,
Niloufar Zakariaei,
Eldad Haber
Abstract:
Magnetic data inversion is an important tool in geophysics, used to infer subsurface magnetic susceptibility distributions from surface magnetic field measurements. This inverse problem is inherently ill-posed, characterized by non-unique solutions, depth ambiguity, and sensitivity to noise. Traditional inversion approaches rely on predefined regularization techniques to stabilize solutions, limit…
▽ More
Magnetic data inversion is an important tool in geophysics, used to infer subsurface magnetic susceptibility distributions from surface magnetic field measurements. This inverse problem is inherently ill-posed, characterized by non-unique solutions, depth ambiguity, and sensitivity to noise. Traditional inversion approaches rely on predefined regularization techniques to stabilize solutions, limiting their adaptability to complex or diverse geological scenarios. In this study, we propose an approach that integrates variable dictionary learning and scale-space methods to address these challenges. Our method employs learned dictionaries, allowing for adaptive representation of complex subsurface features that are difficult to capture with predefined bases. Additionally, we extend classical variational inversion by incorporating multi-scale representations through a scale-space framework, enabling the progressive introduction of structural detail while mitigating overfitting. We implement both fixed and dynamic dictionary learning techniques, with the latter introducing iteration-dependent dictionaries for enhanced flexibility. Using a synthetic dataset to simulate geological scenarios, we demonstrate significant improvements in reconstruction accuracy and robustness compared to conventional variational and dictionary-based methods. Our results highlight the potential of learned dictionaries, especially when coupled with scale-space dynamics, to improve model recovery and noise handling. These findings underscore the promise of our data-driven approach for advance magnetic data inversion and its applications in geophysical exploration, environmental assessment, and mineral prospecting. The code is publicly available at: https://github.com/ahxmeds/magnetic-inversion-dictionary.git.
△ Less
Submitted 27 February, 2025; v1 submitted 8 February, 2025;
originally announced February 2025.
-
AutoPET Challenge III: Testing the Robustness of Generalized Dice Focal Loss trained 3D Residual UNet for FDG and PSMA Lesion Segmentation from Whole-Body PET/CT Images
Authors:
Shadab Ahamed
Abstract:
Automated segmentation of cancerous lesions in PET/CT scans is a crucial first step in quantitative image analysis. However, training deep learning models for segmentation with high accuracy is particularly challenging due to the variations in lesion size, shape, and radiotracer uptake. These lesions can appear in different parts of the body, often near healthy organs that also exhibit considerabl…
▽ More
Automated segmentation of cancerous lesions in PET/CT scans is a crucial first step in quantitative image analysis. However, training deep learning models for segmentation with high accuracy is particularly challenging due to the variations in lesion size, shape, and radiotracer uptake. These lesions can appear in different parts of the body, often near healthy organs that also exhibit considerable uptake, making the task even more complex. As a result, creating an effective segmentation model for routine PET/CT image analysis is challenging. In this study, we utilized a 3D Residual UNet model and employed the Generalized Dice Focal Loss function to train the model on the AutoPET Challenge 2024 dataset. We conducted a 5-fold cross-validation and used an average ensembling technique using the models from the five folds. In the preliminary test phase for Task-1, the average ensemble achieved a mean Dice Similarity Coefficient (DSC) of 0.6687, mean false negative volume (FNV) of 10.9522 ml and mean false positive volume (FPV) 2.9684 ml. More details about the algorithm can be found on our GitHub repository: https://github.com/ahxmeds/autosegnet2024.git. The training code has been shared via the repository: https://github.com/ahxmeds/autopet2024.git.
△ Less
Submitted 16 September, 2024;
originally announced September 2024.
-
Thyroidiomics: An Automated Pipeline for Segmentation and Classification of Thyroid Pathologies from Scintigraphy Images
Authors:
Maziar Sabouri,
Shadab Ahamed,
Azin Asadzadeh,
Atlas Haddadi Avval,
Soroush Bagheri,
Mohsen Arabi,
Seyed Rasoul Zakavi,
Emran Askari,
Ali Rasouli,
Atena Aghaee,
Mohaddese Sehati,
Fereshteh Yousefirizi,
Carlos Uribe,
Ghasem Hajianfar,
Habib Zaidi,
Arman Rahmim
Abstract:
The objective of this study was to develop an automated pipeline that enhances thyroid disease classification using thyroid scintigraphy images, aiming to decrease assessment time and increase diagnostic accuracy. Anterior thyroid scintigraphy images from 2,643 patients were collected and categorized into diffuse goiter (DG), multinodal goiter (MNG), and thyroiditis (TH) based on clinical reports,…
▽ More
The objective of this study was to develop an automated pipeline that enhances thyroid disease classification using thyroid scintigraphy images, aiming to decrease assessment time and increase diagnostic accuracy. Anterior thyroid scintigraphy images from 2,643 patients were collected and categorized into diffuse goiter (DG), multinodal goiter (MNG), and thyroiditis (TH) based on clinical reports, and then segmented by an expert. A ResUNet model was trained to perform auto-segmentation. Radiomic features were extracted from both physician (scenario 1) and ResUNet segmentations (scenario 2), followed by omitting highly correlated features using Spearman's correlation, and feature selection using Recursive Feature Elimination (RFE) with XGBoost as the core. All models were trained under leave-one-center-out cross-validation (LOCOCV) scheme, where nine instances of algorithms were iteratively trained and validated on data from eight centers and tested on the ninth for both scenarios separately. Segmentation performance was assessed using the Dice similarity coefficient (DSC), while classification performance was assessed using metrics, such as precision, recall, F1-score, accuracy, area under the Receiver Operating Characteristic (ROC AUC), and area under the precision-recall curve (PRC AUC). ResUNet achieved DSC values of 0.84$\pm$0.03, 0.71$\pm$0.06, and 0.86$\pm$0.02 for MNG, TH, and DG, respectively. Classification in scenario 1 achieved an accuracy of 0.76$\pm$0.04 and a ROC AUC of 0.92$\pm$0.02 while in scenario 2, classification yielded an accuracy of 0.74$\pm$0.05 and a ROC AUC of 0.90$\pm$0.02. The automated pipeline demonstrated comparable performance to physician segmentations on several classification metrics across different classes, effectively reducing assessment time while maintaining high diagnostic accuracy. Code available at: https://github.com/ahxmeds/thyroidiomics.git.
△ Less
Submitted 22 July, 2024; v1 submitted 14 July, 2024;
originally announced July 2024.
-
A slice classification neural network for automated classification of axial PET/CT slices from a multi-centric lymphoma dataset
Authors:
Shadab Ahamed,
Yixi Xu,
Ingrid Bloise,
Joo H. O,
Carlos F. Uribe,
Rahul Dodhia,
Juan L. Ferres,
Arman Rahmim
Abstract:
Automated slice classification is clinically relevant since it can be incorporated into medical image segmentation workflows as a preprocessing step that would flag slices with a higher probability of containing tumors, thereby directing physicians attention to the important slices. In this work, we train a ResNet-18 network to classify axial slices of lymphoma PET/CT images (collected from two in…
▽ More
Automated slice classification is clinically relevant since it can be incorporated into medical image segmentation workflows as a preprocessing step that would flag slices with a higher probability of containing tumors, thereby directing physicians attention to the important slices. In this work, we train a ResNet-18 network to classify axial slices of lymphoma PET/CT images (collected from two institutions) depending on whether the slice intercepted a tumor (positive slice) in the 3D image or if the slice did not (negative slice). Various instances of the network were trained on 2D axial datasets created in different ways: (i) slice-level split and (ii) patient-level split; inputs of different types were used: (i) only PET slices and (ii) concatenated PET and CT slices; and different training strategies were employed: (i) center-aware (CAW) and (ii) center-agnostic (CAG). Model performances were compared using the area under the receiver operating characteristic curve (AUROC) and the area under the precision-recall curve (AUPRC), and various binary classification metrics. We observe and describe a performance overestimation in the case of slice-level split as compared to the patient-level split training. The model trained using patient-level split data with the network input containing only PET slices in the CAG training regime was the best performing/generalizing model on a majority of metrics. Our models were additionally more closely compared using the sensitivity metric on the positive slices from their respective test sets.
△ Less
Submitted 11 March, 2024;
originally announced March 2024.
-
A cascaded deep network for automated tumor detection and segmentation in clinical PET imaging of diffuse large B-cell lymphoma
Authors:
Shadab Ahamed,
Natalia Dubljevic,
Ingrid Bloise,
Claire Gowdy,
Patrick Martineau,
Don Wilson,
Carlos F. Uribe,
Arman Rahmim,
Fereshteh Yousefirizi
Abstract:
Accurate detection and segmentation of diffuse large B-cell lymphoma (DLBCL) from PET images has important implications for estimation of total metabolic tumor volume, radiomics analysis, surgical intervention and radiotherapy. Manual segmentation of tumors in whole-body PET images is time-consuming, labor-intensive and operator-dependent. In this work, we develop and validate a fast and efficient…
▽ More
Accurate detection and segmentation of diffuse large B-cell lymphoma (DLBCL) from PET images has important implications for estimation of total metabolic tumor volume, radiomics analysis, surgical intervention and radiotherapy. Manual segmentation of tumors in whole-body PET images is time-consuming, labor-intensive and operator-dependent. In this work, we develop and validate a fast and efficient three-step cascaded deep learning model for automated detection and segmentation of DLBCL tumors from PET images. As compared to a single end-to-end network for segmentation of tumors in whole-body PET images, our three-step model is more effective (improves 3D Dice score from 58.9% to 78.1%) since each of its specialized modules, namely the slice classifier, the tumor detector and the tumor segmentor, can be trained independently to a high degree of skill to carry out a specific task, rather than a single network with suboptimal performance on overall segmentation.
△ Less
Submitted 11 March, 2024;
originally announced March 2024.
-
PyTomography: A Python Library for Medical Image Reconstruction
Authors:
Lucas A. Polson,
Roberto Fedrigo,
Chenguang Li,
Maziar Sabouri,
Obed Dzikunu,
Shadab Ahamed,
Nikolaos Karakatsanis,
Peyman Sheikhzadeh,
Pedro Esquinas,
Arman Rahmim,
Carlos Uribe
Abstract:
There is a need for open-source libraries in emission tomography that (i) use modern and popular backend code to encourage community contributions and (ii) offer support for the multitude of reconstruction techniques available in recent literature, such as those that employ artificial intelligence. The purpose of this research was to create and evaluate a GPU-accelerated, open-source, and user-fri…
▽ More
There is a need for open-source libraries in emission tomography that (i) use modern and popular backend code to encourage community contributions and (ii) offer support for the multitude of reconstruction techniques available in recent literature, such as those that employ artificial intelligence. The purpose of this research was to create and evaluate a GPU-accelerated, open-source, and user-friendly image reconstruction library, designed to serve as a central platform for the development, validation, and deployment of various tomographic reconstruction algorithms. PyTomography was developed using Python and inherits the GPU-accelerated functionality of PyTorch and parallelproj for fast computations. Its flexible and modular design decouples system matrices, likelihoods, and reconstruction algorithms, simplifying the process of integrating new imaging modalities using various python tools. Example use cases demonstrate the software capabilities in parallel hole SPECT and listmode PET imaging. Overall, we have developed and publicly share PyTomography, a highly optimized and user-friendly software for medical image reconstruction, with a class hierarchy that fosters the development of novel imaging applications.
△ Less
Submitted 30 December, 2024; v1 submitted 5 September, 2023;
originally announced September 2023.
-
Addressing computational challenges in physical system simulations with machine learning
Authors:
Sabber Ahamed,
Md Mesbah Uddin
Abstract:
In this paper, we present a machine learning-based data generator framework tailored to aid researchers who utilize simulations to examine various physical systems or processes. High computational costs and the resulting limited data often pose significant challenges to gaining insights into these systems or processes. Our approach involves a two-step process: initially, we train a supervised pred…
▽ More
In this paper, we present a machine learning-based data generator framework tailored to aid researchers who utilize simulations to examine various physical systems or processes. High computational costs and the resulting limited data often pose significant challenges to gaining insights into these systems or processes. Our approach involves a two-step process: initially, we train a supervised predictive model using a limited simulated dataset to predict simulation outcomes. Subsequently, a reinforcement learning agent is trained to generate accurate, simulation-like data by leveraging the supervised model. With this framework, researchers can generate more accurate data and know the outcomes without running high computational simulations, which enables them to explore the parameter space more efficiently and gain deeper insights into physical systems or processes. We demonstrate the effectiveness of the proposed framework by applying it to two case studies, one focusing on earthquake rupture physics and the other on new material development.
△ Less
Submitted 16 May, 2023;
originally announced May 2023.
-
Convolutional neural network with a hybrid loss function for fully automated segmentation of lymphoma lesions in FDG PET images
Authors:
Fereshteh Yousefirizi,
Natalia Dubljevic,
Shadab Ahamed,
Ingrid Bloise,
Claire Gowdy,
Joo Hyun O,
Youssef Farag,
Rodrigue de Schaetzen,
Patrick Martineau,
Don Wilson,
Carlos F. Uribe,
Arman Rahmim
Abstract:
Segmentation of lymphoma lesions is challenging due to their varied sizes and locations in whole-body PET scans. This work presents a fully-automated segmentation technique using a multi-center dataset of diffuse large B-cell lymphoma (DLBCL) with heterogeneous characteristics. We utilized a dataset of [18F]FDG-PET scans (n=194) from two different imaging centers, including cases with primary medi…
▽ More
Segmentation of lymphoma lesions is challenging due to their varied sizes and locations in whole-body PET scans. This work presents a fully-automated segmentation technique using a multi-center dataset of diffuse large B-cell lymphoma (DLBCL) with heterogeneous characteristics. We utilized a dataset of [18F]FDG-PET scans (n=194) from two different imaging centers, including cases with primary mediastinal large B-cell lymphoma (PMBCL) (n=104). Automated brain and bladder removal approaches were utilized as preprocessing steps to tackle false positives caused by normal hypermetabolic uptake in these organs. Our segmentation model is a convolutional neural network (CNN) based on a 3D U-Net architecture that includes squeeze and excitation (SE) modules. Hybrid distribution, region, and boundary-based losses (Unified Focal and Mumford-Shah (MS)) were utilized that showed the best performance compared to other combinations (p<0.05). Cross-validation between different centers, DLBCL and PMBCL cases, and three random splits were applied on train/validation data. The ensemble of these six models achieved a Dice similarity coefficient (DSC) of 0.77 +- 0.08 and Hausdorff distance (HD) of 16.5 +-12.5. Our 3D U-net model with SE modules for segmentation with hybrid loss performed significantly better (p<0.05) as compared to the 3D U-Net (without SE modules) using the same loss function (Unified Focal and MS loss) (DSC= 0.64 +-0.21 and HD= 26.3 +- 18.7). Our model can facilitate a fully automated quantification pipeline in a multi-center context that opens the possibility for routine reporting of total metabolic tumor volume (TMTV) and other metrics shown useful for the management of lymphoma.
△ Less
Submitted 10 August, 2022; v1 submitted 30 July, 2022;
originally announced August 2022.
-
Exploring Basement Surface relationship of north-west Bengal Basin using satellite images and tectonic modeling
Authors:
Sabber Ahamed,
Delwar Hossain,
Jahangir Alam
Abstract:
The Bengal basin is one of the thickest sedimentary basins and is being constantly affected by the collision of the Indian plate with the Burma and Tibetan plates. The northwest part of the basin, our study area, is one of the least explored areas where the shallowest faulted basement is present. Controversies exist about the origin of the basement and its role to the formation of surface landform…
▽ More
The Bengal basin is one of the thickest sedimentary basins and is being constantly affected by the collision of the Indian plate with the Burma and Tibetan plates. The northwest part of the basin, our study area, is one of the least explored areas where the shallowest faulted basement is present. Controversies exist about the origin of the basement and its role to the formation of surface landforms. We analyze satellite images, Bouguer anomaly data, and develop a geodynamic model to explore the relationship between the faulted basement and surface landforms. Satellite images and gravity anomalies show a spatial correlation between the surface topography and basement fault structures. The elevated tracts and the low-lying flood plains are located on top of the gravity highs (horsts) and lows (grabens). The geodynamic model suggests that conjugate thrust faults may exist beneath the horsts that push the horst block upward. Our observations suggest the regional compression and basement faults have a more considerable influence on the development of surface landforms such as the uplifted tracts and the low-lying flood plains.
△ Less
Submitted 6 April, 2020;
originally announced April 2020.
-
Estimating uncertainty of earthquake rupture using Bayesian neural network
Authors:
Sabber Ahamed,
Md Mesbah Uddin
Abstract:
Bayesian neural networks (BNN) are the probabilistic model that combines the strengths of both neural network (NN) and stochastic processes. As a result, BNN can combat overfitting and perform well in applications where data is limited. Earthquake rupture study is such a problem where data is insufficient, and scientists have to rely on many trial and error numerical or physical models. Lack of re…
▽ More
Bayesian neural networks (BNN) are the probabilistic model that combines the strengths of both neural network (NN) and stochastic processes. As a result, BNN can combat overfitting and perform well in applications where data is limited. Earthquake rupture study is such a problem where data is insufficient, and scientists have to rely on many trial and error numerical or physical models. Lack of resources and computational expenses, often, it becomes hard to determine the reasons behind the earthquake rupture. In this work, a BNN has been used (1) to combat the small data problem and (2) to find out the parameter combinations responsible for earthquake rupture and (3) to estimate the uncertainty associated with earthquake rupture. Two thousand rupture simulations are used to train and test the model. A simple 2D rupture geometry is considered where the fault has a Gaussian geometric heterogeneity at the center, and eight parameters vary in each simulation. The test F1-score of BNN (0.8334), which is 2.34% higher than plain NN score. Results show that the parameters of rupture propagation have higher uncertainty than the rupture arrest. Normal stresses play a vital role in determining rupture propagation and are also the highest source of uncertainty, followed by the dynamic friction coefficient. Shear stress has a moderate role, whereas the geometric features such as the width and height of the fault are least significant and uncertain.
△ Less
Submitted 11 April, 2023; v1 submitted 21 November, 2019;
originally announced November 2019.
-
Machine Learning Approach to Earthquake Rupture Dynamics
Authors:
Sabber Ahamed,
Eric G. Daub
Abstract:
Simulating dynamic rupture propagation is challenging due to the uncertainties involved in the underlying physics of fault slip, stress conditions, and frictional properties of the fault. A trial and error approach is often used to determine the unknown parameters describing rupture, but running many simulations usually requires human review to determine how to adjust parameter values and is thus…
▽ More
Simulating dynamic rupture propagation is challenging due to the uncertainties involved in the underlying physics of fault slip, stress conditions, and frictional properties of the fault. A trial and error approach is often used to determine the unknown parameters describing rupture, but running many simulations usually requires human review to determine how to adjust parameter values and is thus not very efficient. To reduce the computational cost and improve our ability to determine reasonable stress and friction parameters, we take advantage of the machine learning approach. We develop two models for earthquake rupture propagation using the artificial neural network (ANN) and the random forest (RF) algorithms to predict if a rupture can break a geometric heterogeneity on a fault. We train the models using a database of 1600 dynamic rupture simulations computed numerically. Fault geometry, stress conditions, and friction parameters vary in each simulation. We cross-validate and test the predictive power of the models using an additional 400 simulated ruptures, respectively. Both RF and ANN models predict rupture propagation with more than 81% accuracy, and model parameters can be used to infer the underlying factors most important for rupture propagation. Both of the models are computationally efficient such that the 400 testings require a fraction of a second, leading to potential applications of dynamic rupture that have previously not been possible due to the computational demands of physics-based rupture simulations.
△ Less
Submitted 14 June, 2019;
originally announced June 2019.
-
Incorporating Deformation Energetics in Long-Term Tectonic Modeling
Authors:
Sabber Ahamed,
Eunseo Choi
Abstract:
The deformation-related energy budget is usually considered in the simplest form or even completely omitted from the energy balance equation. We derive a full energy balance equation that accounts not only for heat energy but also for mechanical (elastic, plastic and viscous) work. The derived equation is implemented in DES3D, an unstructured finite element solver for long-term tectonic deformatio…
▽ More
The deformation-related energy budget is usually considered in the simplest form or even completely omitted from the energy balance equation. We derive a full energy balance equation that accounts not only for heat energy but also for mechanical (elastic, plastic and viscous) work. The derived equation is implemented in DES3D, an unstructured finite element solver for long-term tectonic deformation. We verify the implementation by comparing numerical solutions to the corresponding semi-analytic solutions in three benchmarks extended from the classical oedometer test. Two of the benchmarks are designed to evaluate the temperature change in a Mohr-Coulomb elasto-plastic square governed by a simplified equation involving plastic power only and by the full temperature evolution equation, respectively. The third benchmark differs in that it computes thermal stresses associated with a prescribed uniform temperature increase. All the solutions from DES3D show relative error less than 0.1%. We also investigate the long-term effects of deformation energetics on the evolution of large offset normal faults. We find that the models considering the full energy balance equation tend to produce more secondary faults and an elongated core complex. Our results for the normal fault system confirm that persistent inelastic deformation has a significant impact on the long-term evolution of faults, motivating further exploration of the role of the full energy balance equation in other geodynamic systems.
△ Less
Submitted 24 December, 2018;
originally announced December 2018.