Search | arXiv e-print repository

arXiv:2501.13376 [pdf]

Scalable Evaluation Framework for Foundation Models in Musculoskeletal MRI Bridging Computational Innovation with Clinical Utility

Authors: Gabrielle Hoyer, Michelle W Tong, Rupsa Bhattacharjee, Valentina Pedoia, Sharmila Majumdar

Abstract: Foundation models hold transformative potential for medical imaging, but their clinical utility requires rigorous evaluation to address their strengths and limitations. This study introduces an evaluation framework for assessing the clinical impact and translatability of SAM, MedSAM, and SAM2, using musculoskeletal MRI as a case study. We tested these models across zero-shot and finetuned paradigm… ▽ More Foundation models hold transformative potential for medical imaging, but their clinical utility requires rigorous evaluation to address their strengths and limitations. This study introduces an evaluation framework for assessing the clinical impact and translatability of SAM, MedSAM, and SAM2, using musculoskeletal MRI as a case study. We tested these models across zero-shot and finetuned paradigms to assess their ability to process diverse anatomical structures and effectuate clinically reliable biomarkers, including cartilage thickness, muscle volume, and disc height. We engineered a modular pipeline emphasizing scalability, clinical relevance, and workflow integration, reducing manual effort and aligning validation with end-user expectations. Hierarchical modeling revealed how dataset mixing, anatomical complexity, and MRI acquisition parameters influence performance, providing insights into the role of imaging refinements in improving segmentation accuracy. This work demonstrates how clinically focused evaluations can connect computational advancements with tangible applications, creating a pathway for foundation models to address medical challenges. By emphasizing interdisciplinary collaboration and aligning technical innovation with clinical priorities, our framework provides a roadmap for advancing machine learning technologies into scalable and impactful biomedical solutions. △ Less

Submitted 22 January, 2025; originally announced January 2025.

arXiv:2405.16715 [pdf]

Coil Reweighting to Suppress Motion Artifacts in Real-Time Exercise Cine Imaging

Authors: Chong Chen, Yingmin Liu, Yu Ding, Matthew Tong, Preethi Chandrasekaran, Christopher Crabtree, Syed M. Arshad, Yuchi Han, Rizwan Ahmad

Abstract: Background: Accelerated real-time cine (RT-Cine) imaging enables cardiac function assessment without the need for breath-holding. However, when performed during in-magnet exercise, RT-Cine images may exhibit significant motion artifacts. Methods: By projecting the time-averaged images to the subspace spanned by the coil sensitivity maps, we propose a coil reweighting (CR) method to automatically s… ▽ More Background: Accelerated real-time cine (RT-Cine) imaging enables cardiac function assessment without the need for breath-holding. However, when performed during in-magnet exercise, RT-Cine images may exhibit significant motion artifacts. Methods: By projecting the time-averaged images to the subspace spanned by the coil sensitivity maps, we propose a coil reweighting (CR) method to automatically suppress a subset of receive coils that introduces a high level of artifacts in the reconstructed image. RT-Cine data collected at rest and during exercise from ten healthy volunteers and six patients were utilized to assess the performance of the proposed method. One short-axis and one two-chamber RT-Cine series reconstructed with and without CR from each subject were visually scored by two cardiologists in terms of the level of artifacts on a scale of 1 (worst) to 5 (best). Results: For healthy volunteers, applying CR to RT-Cine images collected at rest did not significantly change the image quality score (p=1). In contrast, for RT-Cine images collected during exercise, CR significantly improved the score from 3.9 to 4.68 (p<0.001). Similarly, in patients, CR did not significantly change the score for images collected at rest (p=0.031) but markedly improved the score from 3.15 to 4.42 (p<0.001) for images taken during exercise. Despite lower image quality scores in the patient cohort compared to healthy subjects, likely due to larger body habitus and the difficulty of limiting body motion during exercise, CR effectively suppressed motion artifacts, with all image series from the patient cohort receiving a score of four or higher. Conclusion: Using data from healthy subjects and patients, we demonstrate that the motion artifacts in the reconstructed RT-Cine images can be effectively suppressed significantly with the proposed CR method. △ Less

Submitted 26 May, 2024; originally announced May 2024.

arXiv:2402.17877 [pdf, other]

Accelerated Real-time Cine and Flow under In-magnet Staged Exercise

Authors: Preethi Chandrasekaran, Chong Chen, Yingmin Liu, Syed Murtaza Arshad, Christopher Crabtree, Matthew Tong, Yuchi Han, Rizwan Ahmad

Abstract: Background: Cardiovascular magnetic resonance imaging (CMR) is a well established imaging tool for diagnosing and managing cardiac conditions. The integration of exercise stress with CMR (ExCMR) can enhance its diagnostic capacity. Despite recent advances in CMR technology, quantitative ExCMR during exercise remains technically challenging due to motion artifacts and limited spatial and temporal r… ▽ More Background: Cardiovascular magnetic resonance imaging (CMR) is a well established imaging tool for diagnosing and managing cardiac conditions. The integration of exercise stress with CMR (ExCMR) can enhance its diagnostic capacity. Despite recent advances in CMR technology, quantitative ExCMR during exercise remains technically challenging due to motion artifacts and limited spatial and temporal resolution. Methods: This study investigated the feasibility of biventricular functional and hemodynamic assessment using real-time (RT) ExCMR during a staged exercise protocol in 24 healthy volunteers. We employed high acceleration rates and applied a coil reweighting technique to minimize motion blurring and artifacts. We further applied a beat-selection technique that identified beats from the endexpiratory phase to minimize the impact of respiration-induced through-plane motion on cardiac function quantification. Additionally, results from six patients were presented to demonstrate clinical feasibility. Results: Our findings indicated a consistent decrease in end-systolic volume and stable end-diastolic volume across exercise intensities, leading to increased stroke volume and ejection fraction. The selection of end-expiratory beats modestly enhanced the repeatability of cardiac function parameters, as shown by scan-rescan tests in nine volunteers. High scores from a blinded image quality assessment indicated that coil reweighting effectively minimized motion artifacts. Conclusions: This study demonstrated the feasibility of RT ExCMR with inmagnet exercise in healthy subjects and patients. Our results indicate that high acceleration rates, coil reweighting, and selection of respiratory phase-specific heartbeats enhance image quality and repeatability of quantitative RT ExCMR. △ Less

Submitted 18 April, 2025; v1 submitted 27 February, 2024; originally announced February 2024.

arXiv:2308.02088 [pdf, other]

doi 10.1002/mrm.30123

Motion-robust free-running volumetric cardiovascular MRI

Authors: Syed M. Arshad, Lee C. Potter, Chong Chen, Yingmin Liu, Preethi Chandrasekaran, Christopher Crabtree, Matthew S. Tong, Orlando P. Simonetti, Yuchi Han, Rizwan Ahmad

Abstract: PURPOSE: To present and assess an outlier mitigation method that makes free-running volumetric cardiovascular MRI (CMR) more robust to motion. METHODS: The proposed method, called compressive recovery with outlier rejection (CORe), models outliers in the measured data as an additive auxiliary variable. We enforce MR physics-guided group sparsity on the auxiliary variable, and jointly estimate it… ▽ More PURPOSE: To present and assess an outlier mitigation method that makes free-running volumetric cardiovascular MRI (CMR) more robust to motion. METHODS: The proposed method, called compressive recovery with outlier rejection (CORe), models outliers in the measured data as an additive auxiliary variable. We enforce MR physics-guided group sparsity on the auxiliary variable, and jointly estimate it along with the image using an iterative algorithm. For evaluation, CORe is first compared to traditional compressed sensing (CS), robust regression (RR), and an existing outlier rejection method using two simulation studies. Then, CORe is compared to CS using seven three-dimensional (3D) cine, 12 rest four-dimensional (4D) flow, and eight stress 4D flow imaging datasets. RESULTS: Our simulation studies show that CORe outperforms CS, RR, and the existing outlier rejection method in terms of normalized mean square error and structural similarity index across 55 different realizations. The expert reader evaluation of 3D cine images demonstrates that CORe is more effective in suppressing artifacts while maintaining or improving image sharpness. Finally, 4D flow images show that CORe yields more reliable and consistent flow measurements, especially in the presence of involuntary subject motion or exercise stress. CONCLUSION: An outlier rejection method is presented and tested using simulated and measured data. This method can help suppress motion artifacts in a wide range of free-running CMR applications. CODE & DATA: Implementation code and datasets are available on GitHub at http://github.com/OSU-MR/motion-robust-CMR △ Less

Submitted 24 June, 2024; v1 submitted 3 August, 2023; originally announced August 2023.

Journal ref: Magnetic Resonance in Medicine 92(3) (2024) 1248-1262

arXiv:2307.09728 [pdf, other]

Uncertainty-Driven Multi-Scale Feature Fusion Network for Real-time Image Deraining

Authors: Ming Tong, Xuefeng Yan, Yongzhen Wang

Abstract: Visual-based measurement systems are frequently affected by rainy weather due to the degradation caused by rain streaks in captured images, and existing imaging devices struggle to address this issue in real-time. While most efforts leverage deep networks for image deraining and have made progress, their large parameter sizes hinder deployment on resource-constrained devices. Additionally, these d… ▽ More Visual-based measurement systems are frequently affected by rainy weather due to the degradation caused by rain streaks in captured images, and existing imaging devices struggle to address this issue in real-time. While most efforts leverage deep networks for image deraining and have made progress, their large parameter sizes hinder deployment on resource-constrained devices. Additionally, these data-driven models often produce deterministic results, without considering their inherent epistemic uncertainty, which can lead to undesired reconstruction errors. Well-calibrated uncertainty can help alleviate prediction errors and assist measurement devices in mitigating risks and improving usability. Therefore, we propose an Uncertainty-Driven Multi-Scale Feature Fusion Network (UMFFNet) that learns the probability mapping distribution between paired images to estimate uncertainty. Specifically, we introduce an uncertainty feature fusion block (UFFB) that utilizes uncertainty information to dynamically enhance acquired features and focus on blurry regions obscured by rain streaks, reducing prediction errors. In addition, to further boost the performance of UMFFNet, we fused feature information from multiple scales to guide the network for efficient collaborative rain removal. Extensive experiments demonstrate that UMFFNet achieves significant performance improvements with few parameters, surpassing state-of-the-art image deraining methods. △ Less

Submitted 18 July, 2023; originally announced July 2023.

arXiv:2209.12266 [pdf, other]

Enforcing safety for vision-based controllers via Control Barrier Functions and Neural Radiance Fields

Authors: Mukun Tong, Charles Dawson, Chuchu Fan

Abstract: To navigate complex environments, robots must increasingly use high-dimensional visual feedback (e.g. images) for control. However, relying on high-dimensional image data to make control decisions raises important questions; particularly, how might we prove the safety of a visual-feedback controller? Control barrier functions (CBFs) are powerful tools for certifying the safety of feedback controll… ▽ More To navigate complex environments, robots must increasingly use high-dimensional visual feedback (e.g. images) for control. However, relying on high-dimensional image data to make control decisions raises important questions; particularly, how might we prove the safety of a visual-feedback controller? Control barrier functions (CBFs) are powerful tools for certifying the safety of feedback controllers in the state-feedback setting, but CBFs have traditionally been poorly-suited to visual feedback control due to the need to predict future observations in order to evaluate the barrier function. In this work, we solve this issue by leveraging recent advances in neural radiance fields (NeRFs), which learn implicit representations of 3D scenes and can render images from previously-unseen camera perspectives, to provide single-step visual foresight for a CBF-based controller. This novel combination is able to filter out unsafe actions and intervene to preserve safety. We demonstrate the effect of our controller in real-time simulation experiments where it successfully prevents the robot from taking dangerous actions. △ Less

Submitted 28 February, 2023; v1 submitted 25 September, 2022; originally announced September 2022.

Comments: Accepted to ICRA 2023

arXiv:2202.00055 [pdf]

doi 10.1007/s10554-023-02966-z

Cardiac and respiratory motion extraction for MRI using Pilot Tone-a patient study

Authors: Chong Chen, Yingmin Liu, Orlando P. Simonetti, Matthew Tong, Ning Jin, Mario Bacher, Peter Speier, Rizwan Ahmad

Abstract: Background: Several studies have shown that both respiratory and cardiac motion can be extracted from the Pilot Tone (PT) signal successfully. However, most of these studies were performed in healthy volunteers. In addition, validating PT using ECG as a reference can be problematic because both PT and ECG tend to be unreliable in patients with arrhythmias. Purpose: We seek to evaluate the accuracy… ▽ More Background: Several studies have shown that both respiratory and cardiac motion can be extracted from the Pilot Tone (PT) signal successfully. However, most of these studies were performed in healthy volunteers. In addition, validating PT using ECG as a reference can be problematic because both PT and ECG tend to be unreliable in patients with arrhythmias. Purpose: We seek to evaluate the accuracy and reliability of the cardiac and respiratory signals extracted from PT in patients clinically referred for cardiovascular MRI with the image-derived signals as the reference. Methods: Twenty-three patients were scanned on a 1.5 T scanner using balanced steady-state free-precession real-time (RT) cine sequence. The PT signal was generated by a built-in PT transmitter integrated within the body array coil. For comparison, commercial ECG and BioMatrix (BM) respiratory sensor signals were synchronously recorded. Results: The respiratory motion extracted from PT correlated positively with the image-derived respiratory signal in all cases and showed a stronger correlation (absolute coefficient: 0.95+-0.09) than BM (0.72+-0.24). For the cardiac signal, PT trigger jitter (standard deviation of PT trigger locations relative to ECG triggers) ranged from 6.6 to 83.3 ms, with a median of 21.8 ms. The mean absolute difference between the PT and corresponding ECG cardiac cycle duration was less than 5% of the averaged ECG RR interval for 21 out of 23 patients. Overall, the performance of PT-based trigger extraction was comparable to that of ECG. We did not observe significant linear dependence (p>0.28) of PT delay and PT jitter on the patients' BMI or cardiac cycle duration. Conclusions: This study demonstrates the potential of PT to monitor both respiratory and cardiac motion in patients clinically referred for cardiovascular MRI. △ Less

Submitted 9 May, 2023; v1 submitted 31 January, 2022; originally announced February 2022.

arXiv:2008.03410 [pdf, other]

OCMR (v1.0)--Open-Access Multi-Coil k-Space Dataset for Cardiovascular Magnetic Resonance Imaging

Authors: Chong Chen, Yingmin Liu, Philip Schniter, Matthew Tong, Karolina Zareba, Orlando Simonetti, Lee Potter, Rizwan Ahmad

Abstract: Cardiovascular MRI (CMR) is a non-invasive imaging modality that provides excellent soft-tissue contrast without the use of ionizing radiation. Physiological motions and limited speed of MRI data acquisition necessitate development of accelerated methods, which typically rely on undersampling. Recovering diagnostic quality CMR images from highly undersampled data has been an active area of researc… ▽ More Cardiovascular MRI (CMR) is a non-invasive imaging modality that provides excellent soft-tissue contrast without the use of ionizing radiation. Physiological motions and limited speed of MRI data acquisition necessitate development of accelerated methods, which typically rely on undersampling. Recovering diagnostic quality CMR images from highly undersampled data has been an active area of research. Recently, several data acquisition and processing methods have been proposed to accelerate CMR. The availability of data to objectively evaluate and compare different reconstruction methods could expedite innovation and promote clinical translation of these methods. In this work, we introduce an open-access dataset, called OCMR, that provides multi-coil k-space data from 53 fully sampled and 212 prospectively undersampled cardiac cine series. △ Less

Submitted 12 August, 2020; v1 submitted 7 August, 2020; originally announced August 2020.

arXiv:2008.03188 [pdf, other]

CUCHILD: A Large-Scale Cantonese Corpus of Child Speech for Phonology and Articulation Assessment

Authors: Si-Ioi Ng, Cymie Wing-Yee Ng, Jiarui Wang, Tan Lee, Kathy Yuet-Sheung Lee, Michael Chi-Fai Tong

Abstract: This paper describes the design and development of CUCHILD, a large-scale Cantonese corpus of child speech. The corpus contains spoken words collected from 1,986 child speakers aged from 3 to 6 years old. The speech materials include 130 words of 1 to 4 syllables in length. The speakers cover both typically developing (TD) children and children with speech disorder. The intended use of the corpus… ▽ More This paper describes the design and development of CUCHILD, a large-scale Cantonese corpus of child speech. The corpus contains spoken words collected from 1,986 child speakers aged from 3 to 6 years old. The speech materials include 130 words of 1 to 4 syllables in length. The speakers cover both typically developing (TD) children and children with speech disorder. The intended use of the corpus is to support scientific and clinical research, as well as technology development related to child speech assessment. The design of the corpus, including selection of words, participants recruitment, data acquisition process, and data pre-processing are described in detail. The results of acoustical analysis are presented to illustrate the properties of child speech. Potential applications of the corpus in automatic speech recognition, phonological error detection and speaker diarization are also discussed. △ Less

Submitted 7 August, 2020; originally announced August 2020.

Comments: Accepted to INTERSPEECH 2020, Shanghai, China

arXiv:2004.08982 [pdf, other]

doi 10.1002/mrm.28491

Fully Self-Gated Whole-Heart 4D Flow Imaging from a Five-Minute Scan

Authors: Aaron Pruitt, Adam Rich, Yingmin Liu, Ning Jin, Lee Potter, Matthew Tong, Saurabh Rajpal, Orlando Simonetti, Rizwan Ahmad

Abstract: Purpose: To develop and validate an acquisition and processing technique that enables fully self-gated 4D flow imaging with whole-heart coverage in a fixed five-minute scan. Theory and Methods: The data are acquired continuously using Cartesian sampling and sorted into respiratory and cardiac bins using the self-gating signal. The reconstruction is performed using a recently proposed Bayesian me… ▽ More Purpose: To develop and validate an acquisition and processing technique that enables fully self-gated 4D flow imaging with whole-heart coverage in a fixed five-minute scan. Theory and Methods: The data are acquired continuously using Cartesian sampling and sorted into respiratory and cardiac bins using the self-gating signal. The reconstruction is performed using a recently proposed Bayesian method called ReVEAL4D. ReVEAL4D is validated using data from eight healthy volunteers and two patients and compared with a compressed sensing technique, L1-SENSE. Results: Healthy subjects -- Compared to 2D phase-contrast MRI (2D-PC), flow quantification from ReVEAL4D shows no significant bias. In contrast, the peak velocity and peak flow rate for L1-SENSE are significantly underestimated. Compared to traditional parallel MRI-based 4D flow imaging, ReVEAL4D demonstrates small but significant biases in net flow and peak flow rate, with no significant bias in peak velocity. All three indices are significantly and more markedly underestimated by L1-SENSE. Patients -- Flow quantification from ReVEAL4D agrees well with the 2D-PC reference. In contrast, L1-SENSE markedly underestimated peak velocity. Conclusions: The combination of highly accelerated five-minute Cartesian acquisition, self-gating, and ReVEAL4D enables whole-heart 4D flow imaging with accurate flow quantification. △ Less

Submitted 5 August, 2020; v1 submitted 19 April, 2020; originally announced April 2020.

Showing 1–10 of 10 results for author: Tong, M