Skip to main content

Showing 1–50 of 54 results for author: Prince, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.24160  [pdf, ps, other

    eess.IV cs.CV

    Beyond the LUMIR challenge: The pathway to foundational registration models

    Authors: Junyu Chen, Shuwen Wei, Joel Honkamaa, Pekka Marttinen, Hang Zhang, Min Liu, Yichao Zhou, Zuopeng Tan, Zhuoyuan Wang, Yi Wang, Hongchao Zhou, Shunbo Hu, Yi Zhang, Qian Tao, Lukas Förner, Thomas Wendler, Bailiang Jian, Benedikt Wiestler, Tim Hable, Jin Kim, Dan Ruan, Frederic Madesta, Thilo Sentker, Wiebke Heyer, Lianrui Zuo , et al. (11 additional authors not shown)

    Abstract: Medical image challenges have played a transformative role in advancing the field, catalyzing algorithmic innovation and establishing new performance standards across diverse clinical applications. Image registration, a foundational task in neuroimaging pipelines, has similarly benefited from the Learn2Reg initiative. Building on this foundation, we introduce the Large-scale Unsupervised Brain MRI… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

  2. arXiv:2505.18365  [pdf, ps, other

    eess.IV cs.CV

    Brightness-Invariant Tracking Estimation in Tagged MRI

    Authors: Zhangxing Bian, Shuwen Wei, Xiao Liang, Yuan-Chiao Lu, Samuel W. Remedios, Fangxu Xing, Jonghye Woo, Dzung L. Pham, Aaron Carass, Philip V. Bayly, Jiachen Zhuo, Ahmed Alshareef, Jerry L. Prince

    Abstract: Magnetic resonance (MR) tagging is an imaging technique for noninvasively tracking tissue motion in vivo by creating a visible pattern of magnetization saturation (tags) that deforms with the tissue. Due to longitudinal relaxation and progression to steady-state, the tags and tissue brightnesses change over time, which makes tracking with optical flow methods error-prone. Although Fourier methods… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

    Comments: Accepted by IPMI 2025

  3. arXiv:2505.15813  [pdf, ps, other

    cs.LG q-bio.NC

    Meta-Learning an In-Context Transformer Model of Human Higher Visual Cortex

    Authors: Muquan Yu, Mu Nan, Hossein Adeli, Jacob S. Prince, John A. Pyles, Leila Wehbe, Margaret M. Henderson, Michael J. Tarr, Andrew F. Luo

    Abstract: Understanding functional representations within higher visual cortex is a fundamental question in computational neuroscience. While artificial neural networks pretrained on large-scale datasets exhibit striking representational alignment with human neural responses, learning image-computable models of visual cortex relies on individual-level, large-scale fMRI datasets. The necessity for expensive,… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

  4. arXiv:2503.12102  [pdf, other

    cs.CV

    A Speech-to-Video Synthesis Approach Using Spatio-Temporal Diffusion for Vocal Tract MRI

    Authors: Paula Andrea Pérez-Toro, Tomás Arias-Vergara, Fangxu Xing, Xiaofeng Liu, Maureen Stone, Jiachen Zhuo, Juan Rafael Orozco-Arroyave, Elmar Nöth, Jana Hutter, Jerry L. Prince, Andreas Maier, Jonghye Woo

    Abstract: Understanding the relationship between vocal tract motion during speech and the resulting acoustic signal is crucial for aided clinical assessment and developing personalized treatment and rehabilitation strategies. Toward this goal, we introduce an audio-to-video generation framework for creating Real Time/cine-Magnetic Resonance Imaging (RT-/cine-MRI) visuals of the vocal tract from speech signa… ▽ More

    Submitted 15 March, 2025; originally announced March 2025.

  5. arXiv:2503.11787  [pdf, ps, other

    cs.CV eess.IV

    ECLARE: Efficient cross-planar learning for anisotropic resolution enhancement

    Authors: Samuel W. Remedios, Shuwen Wei, Shuo Han, Jinwei Zhang, Aaron Carass, Kurt G. Schilling, Dzung L. Pham, Jerry L. Prince, Blake E. Dewey

    Abstract: In clinical imaging, magnetic resonance (MR) image volumes are often acquired as stacks of 2D slices with decreased scan times, improved signal-to-noise ratio, and image contrasts unique to 2D MR pulse sequences. While this is sufficient for clinical evaluation, automated algorithms designed for 3D analysis perform poorly on multi-slice 2D MR volumes, especially those with thick slices and gaps be… ▽ More

    Submitted 21 May, 2025; v1 submitted 14 March, 2025; originally announced March 2025.

  6. arXiv:2502.14753  [pdf, ps, other

    eess.IV cs.AI cs.CV

    MedVAE: Efficient Automated Interpretation of Medical Images with Large-Scale Generalizable Autoencoders

    Authors: Maya Varma, Ashwin Kumar, Rogier van der Sluijs, Sophie Ostmeier, Louis Blankemeier, Pierre Chambon, Christian Bluethgen, Jip Prince, Curtis Langlotz, Akshay Chaudhari

    Abstract: Medical images are acquired at high resolutions with large fields of view in order to capture fine-grained features necessary for clinical decision-making. Consequently, training deep learning models on medical images can incur large computational costs. In this work, we address the challenge of downsizing medical images in order to improve downstream computational efficiency while preserving clin… ▽ More

    Submitted 2 June, 2025; v1 submitted 20 February, 2025; originally announced February 2025.

    Comments: MIDL 2025 (Oral)

  7. arXiv:2502.12892  [pdf, other

    cs.CV

    Archetypal SAE: Adaptive and Stable Dictionary Learning for Concept Extraction in Large Vision Models

    Authors: Thomas Fel, Ekdeep Singh Lubana, Jacob S. Prince, Matthew Kowal, Victor Boutin, Isabel Papadimitriou, Binxu Wang, Martin Wattenberg, Demba Ba, Talia Konkle

    Abstract: Sparse Autoencoders (SAEs) have emerged as a powerful framework for machine learning interpretability, enabling the unsupervised decomposition of model representations into a dictionary of abstract, human-interpretable concepts. However, we reveal a fundamental limitation: existing SAEs exhibit severe instability, as identical models trained on similar datasets can produce sharply different dictio… ▽ More

    Submitted 23 May, 2025; v1 submitted 18 February, 2025; originally announced February 2025.

    Journal ref: Proceedings of the 42nd International Conference on Machine Learning (ICML), 2025

  8. arXiv:2501.18834  [pdf

    eess.IV cs.AI cs.CV

    Pitfalls of defacing whole-head MRI: re-identification risk with diffusion models and compromised research potential

    Authors: Chenyu Gao, Kaiwen Xu, Michael E. Kim, Lianrui Zuo, Zhiyuan Li, Derek B. Archer, Timothy J. Hohman, Ann Zenobia Moore, Luigi Ferrucci, Lori L. Beason-Held, Susan M. Resnick, Christos Davatzikos, Jerry L. Prince, Bennett A. Landman

    Abstract: Defacing is often applied to head magnetic resonance image (MRI) datasets prior to public release to address privacy concerns. The alteration of facial and nearby voxels has provoked discussions about the true capability of these techniques to ensure privacy as well as their impact on downstream tasks. With advancements in deep generative models, the extent to which defacing can protect privacy is… ▽ More

    Submitted 30 January, 2025; originally announced January 2025.

  9. arXiv:2410.09639  [pdf, other

    eess.IV cs.CV

    Unique MS Lesion Identification from MRI

    Authors: Carlos A. Rivas, Jinwei Zhang, Shuwen Wei, Samuel W. Remedios, Aaron Carass, Jerry L. Prince

    Abstract: Unique identification of multiple sclerosis (MS) white matter lesions (WMLs) is important to help characterize MS progression. WMLs are routinely identified from magnetic resonance images (MRIs) but the resultant total lesion load does not correlate well with EDSS; whereas mean unique lesion volume has been shown to correlate with EDSS. Our approach builds on prior work by incorporating Hessian ma… ▽ More

    Submitted 12 October, 2024; originally announced October 2024.

    Comments: 5 pages, 5 figures, submitted to SPIE medical imaging conference

  10. arXiv:2407.10209  [pdf, other

    cs.CV

    Vector Field Attention for Deformable Image Registration

    Authors: Yihao Liu, Junyu Chen, Lianrui Zuo, Aaron Carass, Jerry L. Prince

    Abstract: Deformable image registration establishes non-linear spatial correspondences between fixed and moving images. Deep learning-based deformable registration methods have been widely studied in recent years due to their speed advantage over traditional algorithms as well as their better accuracy. Most existing deep learning-based methods require neural networks to encode location information in their… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

  11. arXiv:2402.06984  [pdf, other

    cs.SD cs.CV cs.MM eess.AS eess.IV

    Speech motion anomaly detection via cross-modal translation of 4D motion fields from tagged MRI

    Authors: Xiaofeng Liu, Fangxu Xing, Jiachen Zhuo, Maureen Stone, Jerry L. Prince, Georges El Fakhri, Jonghye Woo

    Abstract: Understanding the relationship between tongue motion patterns during speech and their resulting speech acoustic outcomes -- i.e., articulatory-acoustic relation -- is of great importance in assessing speech quality and developing innovative treatment and rehabilitative strategies. This is especially important when evaluating and detecting abnormal articulatory features in patients with speech-rela… ▽ More

    Submitted 10 February, 2024; originally announced February 2024.

    Comments: SPIE Medical Imaging 2024: Image Processing

  12. arXiv:2401.17571  [pdf, other

    eess.IV cs.CV

    Is Registering Raw Tagged-MR Enough for Strain Estimation in the Era of Deep Learning?

    Authors: Zhangxing Bian, Ahmed Alshareef, Shuwen Wei, Junyu Chen, Yuli Wang, Jonghye Woo, Dzung L. Pham, Jiachen Zhuo, Aaron Carass, Jerry L. Prince

    Abstract: Magnetic Resonance Imaging with tagging (tMRI) has long been utilized for quantifying tissue motion and strain during deformation. However, a phenomenon known as tag fading, a gradual decrease in tag visibility over time, often complicates post-processing. The first contribution of this study is to model tag fading by considering the interplay between $T_1$ relaxation and the repeated application… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

    Comments: Accepted to SPIE Medical Imaging 2024 (oral)

  13. Super-resolution multi-contrast unbiased eye atlases with deep probabilistic refinement

    Authors: Ho Hin Lee, Adam M. Saunders, Michael E. Kim, Samuel W. Remedios, Lucas W. Remedios, Yucheng Tang, Qi Yang, Xin Yu, Shunxing Bao, Chloe Cho, Louise A. Mawn, Tonia S. Rex, Kevin L. Schey, Blake E. Dewey, Jeffrey M. Spraggins, Jerry L. Prince, Yuankai Huo, Bennett A. Landman

    Abstract: Purpose: Eye morphology varies significantly across the population, especially for the orbit and optic nerve. These variations limit the feasibility and robustness of generalizing population-wise features of eye organs to an unbiased spatial reference. Approach: To tackle these limitations, we propose a process for creating high-resolution unbiased eye atlases. First, to restore spatial details… ▽ More

    Submitted 14 November, 2024; v1 submitted 5 January, 2024; originally announced January 2024.

    Comments: Published in SPIE Journal of Medical Imaging (https://doi.org/10.1117/1.JMI.11.6.064004). 27 pages, 6 figures

    Journal ref: J. Med. Imag. 11(6), 064004 (2024)

  14. arXiv:2312.04385  [pdf, other

    eess.IV cs.CV

    AniRes2D: Anisotropic Residual-enhanced Diffusion for 2D MR Super-Resolution

    Authors: Zejun Wu, Samuel W. Remedios, Blake E. Dewey, Aaron Carass, Jerry L. Prince

    Abstract: Anisotropic low-resolution (LR) magnetic resonance (MR) images are fast to obtain but hinder automated processing. We propose to use denoising diffusion probabilistic models (DDPMs) to super-resolve these 2D-acquired LR MR slices. This paper introduces AniRes2D, a novel approach combining DDPM with a residual prediction for 2D super-resolution (SR). Results demonstrate that AniRes2D outperforms se… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

    Comments: Accepted for presentation at SPIE Medical Imaging 2024, Clinical and Biomedical Imaging

  15. arXiv:2312.01460  [pdf, other

    eess.IV cs.CV

    Towards an accurate and generalizable multiple sclerosis lesion segmentation model using self-ensembled lesion fusion

    Authors: Jinwei Zhang, Lianrui Zuo, Blake E. Dewey, Samuel W. Remedios, Dzung L. Pham, Aaron Carass, Jerry L. Prince

    Abstract: Automatic multiple sclerosis (MS) lesion segmentation using multi-contrast magnetic resonance (MR) images provides improved efficiency and reproducibility compared to manual delineation. Current state-of-the-art automatic MS lesion segmentation methods utilize modified U-Net-like architectures. However, in the literature, dedicated architecture modifications were always required to maximize their… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

  16. arXiv:2309.14586  [pdf, other

    cs.SD cs.AI cs.CV eess.AS eess.SP

    Speech Audio Synthesis from Tagged MRI and Non-Negative Matrix Factorization via Plastic Transformer

    Authors: Xiaofeng Liu, Fangxu Xing, Maureen Stone, Jiachen Zhuo, Sidney Fels, Jerry L. Prince, Georges El Fakhri, Jonghye Woo

    Abstract: The tongue's intricate 3D structure, comprising localized functional units, plays a crucial role in the production of speech. When measured using tagged MRI, these functional units exhibit cohesive displacements and derived quantities that facilitate the complex process of speech production. Non-negative matrix factorization-based approaches have been shown to estimate the functional units through… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

    Comments: MICCAI 2023 (Oral presentation)

  17. arXiv:2309.01782  [pdf, other

    cs.CV cs.AI cs.LG q-bio.NC

    3D View Prediction Models of the Dorsal Visual Stream

    Authors: Gabriel Sarch, Hsiao-Yu Fish Tung, Aria Wang, Jacob Prince, Michael Tarr

    Abstract: Deep neural network representations align well with brain activity in the ventral visual stream. However, the primate visual system has a distinct dorsal processing stream with different functional properties. To test if a model trained to perceive 3D scene geometry aligns better with neural responses in dorsal visual areas, we trained a self-supervised geometry-aware recurrent neural network (GRN… ▽ More

    Submitted 4 September, 2023; originally announced September 2023.

    Comments: 2023 Conference on Cognitive Computational Neuroscience

  18. arXiv:2308.02949  [pdf, other

    eess.IV cs.CV physics.med-ph

    MomentaMorph: Unsupervised Spatial-Temporal Registration with Momenta, Shooting, and Correction

    Authors: Zhangxing Bian, Shuwen Wei, Yihao Liu, Junyu Chen, Jiachen Zhuo, Fangxu Xing, Jonghye Woo, Aaron Carass, Jerry L. Prince

    Abstract: Tagged magnetic resonance imaging (tMRI) has been employed for decades to measure the motion of tissue undergoing deformation. However, registration-based motion estimation from tMRI is difficult due to the periodic patterns in these images, particularly when the motion is large. With a larger motion the registration approach gets trapped in a local optima, leading to motion estimation errors. We… ▽ More

    Submitted 5 August, 2023; originally announced August 2023.

    Comments: Accepted by MICCAI Workshop 2023: Time-Series Data Analytics and Learning (MTSAIL)

  19. arXiv:2307.15615  [pdf, other

    eess.IV cs.CV

    A survey on deep learning in medical image registration: new technologies, uncertainty, evaluation metrics, and beyond

    Authors: Junyu Chen, Yihao Liu, Shuwen Wei, Zhangxing Bian, Shalini Subramanian, Aaron Carass, Jerry L. Prince, Yong Du

    Abstract: Deep learning technologies have dramatically reshaped the field of medical image registration over the past decade. The initial developments, such as regression-based and U-Net-based networks, established the foundation for deep learning in image registration. Subsequent progress has been made in various aspects of deep learning-based registration, including similarity measures, deformation regula… ▽ More

    Submitted 1 November, 2024; v1 submitted 28 July, 2023; originally announced July 2023.

    Comments: Accepted to Medical Image Analysis ((c) MedIA). A list of open-sourced code from the papers reviewed has been organized and is available at https://bit.ly/3QgFJ9z

  20. arXiv:2307.05979  [pdf, other

    cs.LG cs.AI cs.CV

    Transformers in Reinforcement Learning: A Survey

    Authors: Pranav Agarwal, Aamer Abdul Rahman, Pierre-Luc St-Charles, Simon J. D. Prince, Samira Ebrahimi Kahou

    Abstract: Transformers have significantly impacted domains like natural language processing, computer vision, and robotics, where they improve performance compared to other neural networks. This survey explores how transformers are used in reinforcement learning (RL), where they are seen as a promising solution for addressing challenges such as unstable training, credit assignment, lack of interpretability,… ▽ More

    Submitted 12 July, 2023; originally announced July 2023.

    Comments: 35 pages, 11 figures

  21. arXiv:2305.15239  [pdf, other

    cs.AI cs.CY cs.LG

    Deep Learning and Ethics

    Authors: Travis LaCroix, Simon J. D. Prince

    Abstract: This article appears as chapter 21 of Prince (2023, Understanding Deep Learning); a complete draft of the textbook is available here: http://udlbook.com. This chapter considers potential harms arising from the design and use of AI systems. These include algorithmic bias, lack of explainability, data privacy violations, militarization, fraud, and environmental concerns. The aim is not to provide ad… ▽ More

    Submitted 20 June, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: Copyright in this Work has been licensed exclusively to The MIT Press, https://mitpress.mit.edu, which will be releasing the final version to the public in 2023. All inquiries regarding rights should be addressed to The MIT Press, Rights and Permissions Department

  22. arXiv:2305.14589  [pdf, other

    eess.IV cs.CV cs.LG physics.med-ph

    Attentive Continuous Generative Self-training for Unsupervised Domain Adaptive Medical Image Translation

    Authors: Xiaofeng Liu, Jerry L. Prince, Fangxu Xing, Jiachen Zhuo, Reese Timothy, Maureen Stone, Georges El Fakhri, Jonghye Woo

    Abstract: Self-training is an important class of unsupervised domain adaptation (UDA) approaches that are used to mitigate the problem of domain shift, when applying knowledge learned from a labeled source domain to unlabeled and heterogeneous target domains. While self-training-based UDA has shown considerable promise on discriminative tasks, including classification and segmentation, through reliable pseu… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: Accepted to Medical Image Analysis

  23. arXiv:2302.07203  [pdf, other

    eess.IV cs.CV cs.SD eess.AS eess.SP

    Synthesizing audio from tongue motion during speech using tagged MRI via transformer

    Authors: Xiaofeng Liu, Fangxu Xing, Jerry L. Prince, Maureen Stone, Georges El Fakhri, Jonghye Woo

    Abstract: Investigating the relationship between internal tissue point motion of the tongue and oropharyngeal muscle deformation measured from tagged MRI and intelligible speech can aid in advancing speech motor control theories and developing novel treatment methods for speech related-disorders. However, elucidating the relationship between these two sources of information is challenging, due in part to th… ▽ More

    Submitted 14 February, 2023; originally announced February 2023.

    Comments: SPIE Medical Imaging: Deep Dive Oral

  24. arXiv:2302.00528  [pdf, other

    eess.IV cs.CV

    A latent space for unsupervised MR image quality control via artifact assessment

    Authors: Lianrui Zuo, Yuan Xue, Blake E. Dewey, Yihao Liu, Jerry L. Prince, Aaron Carass

    Abstract: Image quality control (IQC) can be used in automated magnetic resonance (MR) image analysis to exclude erroneous results caused by poorly acquired or artifact-laden images. Existing IQC methods for MR imaging generally require human effort to craft meaningful features or label large datasets for supervised training. The involvement of human labor can be burdensome and biased, as labeling MR images… ▽ More

    Submitted 1 February, 2023; originally announced February 2023.

    Comments: Accepted at the International Society for Optics and Photonics - Medical Imaging (SPIE-MI) 2023

  25. arXiv:2301.07234  [pdf, other

    eess.IV cs.CV

    DRIMET: Deep Registration for 3D Incompressible Motion Estimation in Tagged-MRI with Application to the Tongue

    Authors: Zhangxing Bian, Fangxu Xing, Jinglun Yu, Muhan Shao, Yihao Liu, Aaron Carass, Jiachen Zhuo, Jonghye Woo, Jerry L. Prince

    Abstract: Tagged magnetic resonance imaging~(MRI) has been used for decades to observe and quantify the detailed motion of deforming tissue. However, this technique faces several challenges such as tag fading, large motion, long computation times, and difficulties in obtaining diffeomorphic incompressible flow fields. To address these issues, this paper presents a novel unsupervised phase-based 3D motion es… ▽ More

    Submitted 30 April, 2023; v1 submitted 17 January, 2023; originally announced January 2023.

    Comments: Accepted to MIDL 2023 (oral)

  26. arXiv:2301.06114  [pdf, other

    eess.IV cs.LG

    Segmenting thalamic nuclei from manifold projections of multi-contrast MRI

    Authors: Chang Yan, Muhan Shao, Zhangxing Bian, Anqi Feng, Yuan Xue, Jiachen Zhuo, Rao P. Gullapalli, Aaron Carass, Jerry L. Prince

    Abstract: The thalamus is a subcortical gray matter structure that plays a key role in relaying sensory and motor signals within the brain. Its nuclei can atrophy or otherwise be affected by neurological disease and injuries including mild traumatic brain injury. Segmenting both the thalamus and its nuclei is challenging because of the relatively low contrast within and around the thalamus in conventional m… ▽ More

    Submitted 31 January, 2023; v1 submitted 15 January, 2023; originally announced January 2023.

    Comments: 8 pages, 3 figures, 2023 SPIE-MI Image Processing

  27. arXiv:2212.06065  [pdf, other

    eess.IV cs.CV

    HACA3: A Unified Approach for Multi-site MR Image Harmonization

    Authors: Lianrui Zuo, Yihao Liu, Yuan Xue, Blake E. Dewey, Samuel W. Remedios, Savannah P. Hays, Murat Bilgel, Ellen M. Mowry, Scott D. Newsome, Peter A. Calabresi, Susan M. Resnick, Jerry L. Prince, Aaron Carass

    Abstract: The lack of standardization is a prominent issue in magnetic resonance (MR) imaging. This often causes undesired contrast variations in the acquired images due to differences in hardware and acquisition parameters. In recent years, image synthesis-based MR harmonization with disentanglement has been proposed to compensate for the undesired contrast variations. Despite the success of existing metho… ▽ More

    Submitted 25 April, 2023; v1 submitted 12 December, 2022; originally announced December 2022.

  28. arXiv:2209.02611  [pdf, other

    eess.IV cs.CV

    Deep filter bank regression for super-resolution of anisotropic MR brain images

    Authors: Samuel W. Remedios, Shuo Han, Yuan Xue, Aaron Carass, Trac D. Tran, Dzung L. Pham, Jerry L. Prince

    Abstract: In 2D multi-slice magnetic resonance (MR) acquisition, the through-plane signals are typically of lower resolution than the in-plane signals. While contemporary super-resolution (SR) methods aim to recover the underlying high-resolution volume, the estimated high-frequency information is implicit via end-to-end data-driven training rather than being explicitly stated and sought. To address this, w… ▽ More

    Submitted 6 September, 2022; originally announced September 2022.

  29. arXiv:2206.02284  [pdf, other

    cs.SD cs.CV cs.MM eess.AS

    Tagged-MRI Sequence to Audio Synthesis via Self Residual Attention Guided Heterogeneous Translator

    Authors: Xiaofeng Liu, Fangxu Xing, Jerry L. Prince, Jiachen Zhuo, Maureen Stone, Georges El Fakhri, Jonghye Woo

    Abstract: Understanding the underlying relationship between tongue and oropharyngeal muscle deformation seen in tagged-MRI and intelligible speech plays an important role in advancing speech motor control theories and treatment of speech related-disorders. Because of their heterogeneous representations, however, direct mapping between the two modalities -- i.e., two-dimensional (mid-sagittal slice) plus tim… ▽ More

    Submitted 25 September, 2022; v1 submitted 5 June, 2022; originally announced June 2022.

    Comments: MICCAI 2022 (early accept, Oral Presentation ~3%)

  30. arXiv:2205.04982  [pdf, other

    eess.IV cs.CV cs.LG

    Disentangling A Single MR Modality

    Authors: Lianrui Zuo, Yihao Liu, Yuan Xue, Shuo Han, Murat Bilgel, Susan M. Resnick, Jerry L. Prince, Aaron Carass

    Abstract: Disentangling anatomical and contrast information from medical images has gained attention recently, demonstrating benefits for various image analysis tasks. Current methods learn disentangled representations using either paired multi-modal images with the same underlying anatomy or auxiliary labels (e.g., manual delineations) to provide inductive bias for disentanglement. However, these requireme… ▽ More

    Submitted 10 May, 2022; originally announced May 2022.

  31. arXiv:2203.03626  [pdf, other

    eess.IV cs.CV

    Coordinate Translator for Learning Deformable Medical Image Registration

    Authors: Yihao Liu, Lianrui Zuo, Shuo Han, Yuan Xue, Jerry L. Prince, Aaron Carass

    Abstract: The majority of deep learning (DL) based deformable image registration methods use convolutional neural networks (CNNs) to estimate displacement fields from pairs of moving and fixed images. This, however, requires the convolutional kernels in the CNN to not only extract intensity features from the inputs but also understand image coordinate systems. We argue that the latter task is challenging fo… ▽ More

    Submitted 31 July, 2022; v1 submitted 5 March, 2022; originally announced March 2022.

  32. arXiv:2202.12474  [pdf, other

    eess.IV cs.CV cs.LG

    Structure-aware Unsupervised Tagged-to-Cine MRI Synthesis with Self Disentanglement

    Authors: Xiaofeng Liu, Fangxu Xing, Jerry L. Prince, Maureen Stone, Georges El Fakhri, Jonghye Woo

    Abstract: Cycle reconstruction regularized adversarial training -- e.g., CycleGAN, DiscoGAN, and DualGAN -- has been widely used for image style transfer with unpaired training data. Several recent works, however, have shown that local distortions are frequent, and structural consistency cannot be guaranteed. Targeting this issue, prior works usually relied on additional segmentation or consistent feature e… ▽ More

    Submitted 24 February, 2022; originally announced February 2022.

    Comments: SPIE Medical Imaging: Image Processing (Oral presentation)

  33. arXiv:2106.12499  [pdf, other

    cs.CV cs.AI cs.LG cs.NE

    Generative Self-training for Cross-domain Unsupervised Tagged-to-Cine MRI Synthesis

    Authors: Xiaofeng Liu, Fangxu Xing, Maureen Stone, Jiachen Zhuo, Reese Timothy, Jerry L. Prince, Georges El Fakhri, Jonghye Woo

    Abstract: Self-training based unsupervised domain adaptation (UDA) has shown great potential to address the problem of domain shift, when applying a trained deep learning model in a source domain to unlabeled target domains. However, while the self-training UDA has demonstrated its effectiveness on discriminative tasks, such as classification and segmentation, via the reliable pseudo-label selection based o… ▽ More

    Submitted 23 June, 2021; originally announced June 2021.

    Comments: MICCAI 2021 (early accept <13%)

  34. arXiv:2104.00100  [pdf, other

    cs.CV eess.IV

    MR Slice Profile Estimation by Learning to Match Internal Patch Distributions

    Authors: Shuo Han, Samuel Remedios, Aaron Carass, Michael Schär, Jerry L. Prince

    Abstract: To super-resolve the through-plane direction of a multi-slice 2D magnetic resonance (MR) image, its slice selection profile can be used as the degeneration model from high resolution (HR) to low resolution (LR) to create paired data when training a supervised algorithm. Existing super-resolution algorithms make assumptions about the slice selection profile since it is not readily known for a given… ▽ More

    Submitted 31 March, 2021; originally announced April 2021.

    Comments: 12 pages, 6 figures, accepted by Information Processing in Medical Imaging (IPMI) 2021

  35. arXiv:2103.13283  [pdf, other

    eess.IV cs.CV cs.LG

    Information-based Disentangled Representation Learning for Unsupervised MR Harmonization

    Authors: Lianrui Zuo, Blake E. Dewey, Aaron Carass, Yihao Liu, Yufan He, Peter A. Calabresi, Jerry L. Prince

    Abstract: Accuracy and consistency are two key factors in computer-assisted magnetic resonance (MR) image analysis. However, contrast variation from site to site caused by lack of standardization in MR acquisition impedes consistent measurements. In recent years, image harmonization approaches have been proposed to compensate for contrast variation in MR images. Current harmonization approaches either requi… ▽ More

    Submitted 24 March, 2021; originally announced March 2021.

    Comments: Accepted in the 27th International Conference on Information Processing in Medical Imaging (IPMI 2021)

  36. arXiv:2103.03158  [pdf, other

    cs.CV cs.LG eess.IV stat.AP

    A Structural Causal Model for MR Images of Multiple Sclerosis

    Authors: Jacob C. Reinhold, Aaron Carass, Jerry L. Prince

    Abstract: Precision medicine involves answering counterfactual questions such as "Would this patient respond better to treatment A or treatment B?" These types of questions are causal in nature and require the tools of causal inference to be answered, e.g., with a structural causal model (SCM). In this work, we develop an SCM that models the interaction between demographic information, disease covariates, a… ▽ More

    Submitted 13 July, 2021; v1 submitted 4 March, 2021; originally announced March 2021.

    Comments: MICCAI 2021

  37. Dual-cycle Constrained Bijective VAE-GAN For Tagged-to-Cine Magnetic Resonance Image Synthesis

    Authors: Xiaofeng Liu, Fangxu Xing, Jerry L. Prince, Aaron Carass, Maureen Stone, Georges El Fakhri, Jonghye Woo

    Abstract: Tagged magnetic resonance imaging (MRI) is a widely used imaging technique for measuring tissue deformation in moving organs. Due to tagged MRI's intrinsic low anatomical resolution, another matching set of cine MRI with higher resolution is sometimes acquired in the same scanning session to facilitate tissue segmentation, thus adding extra time and cost. To mitigate this, in this work, we propose… ▽ More

    Submitted 13 January, 2021; originally announced January 2021.

    Comments: Accepted to IEEE International Symposium on Biomedical Imaging (ISBI) 2021

    Journal ref: 2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI)

  38. arXiv:2012.15355  [pdf, other

    cs.CL cs.LG

    Optimizing Deeper Transformers on Small Datasets

    Authors: Peng Xu, Dhruv Kumar, Wei Yang, Wenjie Zi, Keyi Tang, Chenyang Huang, Jackie Chi Kit Cheung, Simon J. D. Prince, Yanshuai Cao

    Abstract: It is a common belief that training deep transformers from scratch requires large datasets. Consequently, for small datasets, people usually use shallow and simple additional layers on top of pre-trained models during fine-tuning. This work shows that this does not always need to be the case: with proper initialization and optimization, the benefits of very deep transformers can carry over to chal… ▽ More

    Submitted 31 May, 2021; v1 submitted 30 December, 2020; originally announced December 2020.

    Comments: Accepted at ACL 2021 main conference

  39. A review of deep learning in medical imaging: Imaging traits, technology trends, case studies with progress highlights, and future promises

    Authors: S. Kevin Zhou, Hayit Greenspan, Christos Davatzikos, James S. Duncan, Bram van Ginneken, Anant Madabhushi, Jerry L. Prince, Daniel Rueckert, Ronald M. Summers

    Abstract: Since its renaissance, deep learning has been widely used in various medical imaging tasks and has achieved remarkable success in many medical imaging applications, thereby propelling us into the so-called artificial intelligence (AI) era. It is known that the success of AI is mostly attributed to the availability of big data with annotations for a single task and the advances in high performance… ▽ More

    Submitted 5 March, 2021; v1 submitted 2 August, 2020; originally announced August 2020.

    Comments: 20 pages, 7 figures

    Journal ref: Proceedings of the IEEE (2021)

  40. arXiv:2007.04865  [pdf, other

    cs.CV eess.IV

    A Deep Joint Sparse Non-negative Matrix Factorization Framework for Identifying the Common and Subject-specific Functional Units of Tongue Motion During Speech

    Authors: Jonghye Woo, Fangxu Xing, Jerry L. Prince, Maureen Stone, Arnold Gomez, Timothy G. Reese, Van J. Wedeen, Georges El Fakhri

    Abstract: Intelligible speech is produced by creating varying internal local muscle groupings -- i.e., functional units -- that are generated in a systematic and coordinated manner. There are two major challenges in characterizing and analyzing functional units.~First, due to the complex and convoluted nature of tongue structure and function, it is of great importance to develop a method that can accurately… ▽ More

    Submitted 6 June, 2021; v1 submitted 9 July, 2020; originally announced July 2020.

    Comments: Accepted by Medical Image Analysis

  41. arXiv:2007.03162  [pdf, other

    cs.CV cs.LG eess.IV

    Self domain adapted network

    Authors: Yufan He, Aaron Carass, Lianrui Zuo, Blake E. Dewey, Jerry L. Prince

    Abstract: Domain shift is a major problem for deploying deep networks in clinical practice. Network performance drops significantly with (target) images obtained differently than its (source) training data. Due to a lack of target label data, most work has focused on unsupervised domain adaptation (UDA). Current UDA methods need both source and target data to train models which perform image translation (ha… ▽ More

    Submitted 6 July, 2020; originally announced July 2020.

    Comments: early accept in miccai2020

  42. arXiv:2002.04639  [pdf, other

    eess.IV cs.CV cs.LG

    Validating uncertainty in medical image translation

    Authors: Jacob C. Reinhold, Yufan He, Shizhong Han, Yunqiang Chen, Dashan Gao, Junghoon Lee, Jerry L. Prince, Aaron Carass

    Abstract: Medical images are increasingly used as input to deep neural networks to produce quantitative values that aid researchers and clinicians. However, standard deep neural networks do not provide a reliable measure of uncertainty in those quantitative values. Recent work has shown that using dropout during training and testing can provide estimates of uncertainty. In this work, we investigate using dr… ▽ More

    Submitted 11 February, 2020; originally announced February 2020.

    Comments: IEEE ISBI 2020

  43. arXiv:2002.04626  [pdf, other

    eess.IV cs.CV cs.LG

    Finding novelty with uncertainty

    Authors: Jacob C. Reinhold, Yufan He, Shizhong Han, Yunqiang Chen, Dashan Gao, Junghoon Lee, Jerry L. Prince, Aaron Carass

    Abstract: Medical images are often used to detect and characterize pathology and disease; however, automatically identifying and segmenting pathology in medical images is challenging because the appearance of pathology across diseases varies widely. To address this challenge, we propose a Bayesian deep learning method that learns to translate healthy computed tomography images to magnetic resonance images a… ▽ More

    Submitted 11 February, 2020; originally announced February 2020.

    Comments: SPIE Medical Imaging 2020

  44. arXiv:1912.05345  [pdf, other

    eess.SP cs.CV cs.LG

    Severity Detection Tool for Patients with Infectious Disease

    Authors: Girmaw Abebe Tadesse, Tingting Zhu, Nhan Le Nguyen Thanh, Nguyen Thanh Hung, Ha Thi Hai Duong, Truong Huu Khanh, Pham Van Quang, Duc Duong Tran, LamMinh Yen, H Rogier Van Doorn, Nguyen Van Hao, John Prince, Hamza Javed, DaniKiyasseh, Le Van Tan, Louise Thwaites, David A. Clifton

    Abstract: Hand, foot and mouth disease (HFMD) and tetanus are serious infectious diseases in low and middle income countries. Tetanus in particular has a high mortality rate and its treatment is resource-demanding. Furthermore, HFMD often affects a large number of infants and young children. As a result, its treatment consumes enormous healthcare resources, especially when outbreaks occur. Autonomic nervous… ▽ More

    Submitted 10 December, 2019; originally announced December 2019.

  45. Normalizing Flows: An Introduction and Review of Current Methods

    Authors: Ivan Kobyzev, Simon J. D. Prince, Marcus A. Brubaker

    Abstract: Normalizing Flows are generative models which produce tractable distributions where both sampling and density evaluation can be efficient and exact. The goal of this survey article is to give a coherent and comprehensive review of the literature around the construction and use of Normalizing Flows for distribution learning. We aim to provide context and explanation of the models, review current st… ▽ More

    Submitted 5 June, 2020; v1 submitted 25 August, 2019; originally announced August 2019.

    Comments: This paper appears in: IEEE Transactions on Pattern Analysis and Machine Intelligence On page(s): 1-16 Print ISSN: 0162-8828 Online ISSN: 0162-8828

  46. arXiv:1812.04652  [pdf, other

    cs.CV

    Evaluating the Impact of Intensity Normalization on MR Image Synthesis

    Authors: Jacob C. Reinhold, Blake E. Dewey, Aaron Carass, Jerry L. Prince

    Abstract: Image synthesis learns a transformation from the intensity features of an input image to yield a different tissue contrast of the output image. This process has been shown to have application in many medical image analysis tasks including imputation, registration, and segmentation. To carry out synthesis, the intensities of the input images are typically scaled--i.e., normalized--both in training… ▽ More

    Submitted 11 December, 2018; originally announced December 2018.

    Comments: SPIE Medical Imaging 2019

  47. arXiv:1809.04536  [pdf, other

    cs.CV

    Unpaired Brain MR-to-CT Synthesis using a Structure-Constrained CycleGAN

    Authors: Heran Yang, Jian Sun, Aaron Carass, Can Zhao, Junghoon Lee, Zongben Xu, Jerry Prince

    Abstract: The cycleGAN is becoming an influential method in medical image synthesis. However, due to a lack of direct constraints between input and synthetic images, the cycleGAN cannot guarantee structural consistency between these two images, and such consistency is of extreme importance in medical imaging. To overcome this, we propose a structure-constrained cycleGAN for brain MR-to-CT synthesis using un… ▽ More

    Submitted 12 September, 2018; originally announced September 2018.

    Comments: 8 pages, 5 figures, accepted by MICCAI 2018 Workshop: Deep Learning in Medical Image Analysis (DLMIA)

  48. A Sparse Non-negative Matrix Factorization Framework for Identifying Functional Units of Tongue Behavior from MRI

    Authors: Jonghye Woo, Jerry L. Prince, Maureen Stone, Fangxu Xing, Arnold Gomez, Jordan R. Green, Christopher J. Hartnick, Thomas J. Brady, Timothy G. Reese, Van J. Wedeen, Georges El Fakhri

    Abstract: Muscle coordination patterns of lingual behaviors are synergies generated by deforming local muscle groups in a variety of ways. Functional units are functional muscle groups of local structural elements within the tongue that compress, expand, and move in a cohesive and consistent manner. Identifying the functional units using tagged-Magnetic Resonance Imaging (MRI) sheds light on the mechanisms… ▽ More

    Submitted 29 September, 2018; v1 submitted 15 April, 2018; originally announced April 2018.

    Comments: Accepted at IEEE TMI (https://ieeexplore.ieee.org/document/8467354)

  49. arXiv:1803.06629  [pdf, other

    cs.CV

    Cross-modality image synthesis from unpaired data using CycleGAN: Effects of gradient consistency loss and training data size

    Authors: Yuta Hiasa, Yoshito Otake, Masaki Takao, Takumi Matsuoka, Kazuma Takashima, Jerry L. Prince, Nobuhiko Sugano, Yoshinobu Sato

    Abstract: CT is commonly used in orthopedic procedures. MRI is used along with CT to identify muscle structures and diagnose osteonecrosis due to its superior soft tissue contrast. However, MRI has poor contrast for bone structures. Clearly, it would be helpful if a corresponding CT were available, as bone boundaries are more clearly seen and CT has standardized (i.e., Hounsfield) units. Therefore, we aim a… ▽ More

    Submitted 31 July, 2018; v1 submitted 18 March, 2018; originally announced March 2018.

    Comments: 10 pages, 7 figures, MICCAI 2018 Workshop on Simulation and Synthesis in Medical Imaging

  50. arXiv:1803.05120  [pdf, other

    cs.CV

    Topology guaranteed segmentation of the human retina from OCT using convolutional neural networks

    Authors: Yufan He, Aaron Carass, Bruno M. Jedynak, Sharon D. Solomon, Shiv Saidha, Peter A. Calabresi, Jerry L. Prince

    Abstract: Optical coherence tomography (OCT) is a noninvasive imaging modality which can be used to obtain depth images of the retina. The changing layer thicknesses can thus be quantified by analyzing these OCT images, moreover these changes have been shown to correlate with disease progression in multiple sclerosis. Recent automated retinal layer segmentation tools use machine learning methods to perform… ▽ More

    Submitted 13 March, 2018; originally announced March 2018.