Skip to main content

Showing 1–50 of 50 results for author: Prince, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.06261  [pdf, ps, other

    cs.CL cs.AI

    Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities

    Authors: Gheorghe Comanici, Eric Bieber, Mike Schaekermann, Ice Pasupat, Noveen Sachdeva, Inderjit Dhillon, Marcel Blistein, Ori Ram, Dan Zhang, Evan Rosen, Luke Marris, Sam Petulla, Colin Gaffney, Asaf Aharoni, Nathan Lintz, Tiago Cardal Pais, Henrik Jacobsson, Idan Szpektor, Nan-Jiang Jiang, Krishna Haridasan, Ahmed Omran, Nikunj Saunshi, Dara Bahri, Gaurav Mishra, Eric Chu , et al. (3278 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 2.X model family: Gemini 2.5 Pro and Gemini 2.5 Flash, as well as our earlier Gemini 2.0 Flash and Flash-Lite models. Gemini 2.5 Pro is our most capable model yet, achieving SoTA performance on frontier coding and reasoning benchmarks. In addition to its incredible coding and reasoning skills, Gemini 2.5 Pro is a thinking model that excels at multimodal unde… ▽ More

    Submitted 7 July, 2025; originally announced July 2025.

    Comments: 72 pages, 17 figures

  2. arXiv:2505.24160  [pdf, ps, other

    eess.IV cs.CV

    Beyond the LUMIR challenge: The pathway to foundational registration models

    Authors: Junyu Chen, Shuwen Wei, Joel Honkamaa, Pekka Marttinen, Hang Zhang, Min Liu, Yichao Zhou, Zuopeng Tan, Zhuoyuan Wang, Yi Wang, Hongchao Zhou, Shunbo Hu, Yi Zhang, Qian Tao, Lukas Förner, Thomas Wendler, Bailiang Jian, Benedikt Wiestler, Tim Hable, Jin Kim, Dan Ruan, Frederic Madesta, Thilo Sentker, Wiebke Heyer, Lianrui Zuo , et al. (11 additional authors not shown)

    Abstract: Medical image challenges have played a transformative role in advancing the field, catalyzing algorithmic innovation and establishing new performance standards across diverse clinical applications. Image registration, a foundational task in neuroimaging pipelines, has similarly benefited from the Learn2Reg initiative. Building on this foundation, we introduce the Large-scale Unsupervised Brain MRI… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

  3. arXiv:2505.18365  [pdf, ps, other

    eess.IV cs.CV

    Brightness-Invariant Tracking Estimation in Tagged MRI

    Authors: Zhangxing Bian, Shuwen Wei, Xiao Liang, Yuan-Chiao Lu, Samuel W. Remedios, Fangxu Xing, Jonghye Woo, Dzung L. Pham, Aaron Carass, Philip V. Bayly, Jiachen Zhuo, Ahmed Alshareef, Jerry L. Prince

    Abstract: Magnetic resonance (MR) tagging is an imaging technique for noninvasively tracking tissue motion in vivo by creating a visible pattern of magnetization saturation (tags) that deforms with the tissue. Due to longitudinal relaxation and progression to steady-state, the tags and tissue brightnesses change over time, which makes tracking with optical flow methods error-prone. Although Fourier methods… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

    Comments: Accepted by IPMI 2025

  4. arXiv:2503.12102  [pdf, other

    cs.CV

    A Speech-to-Video Synthesis Approach Using Spatio-Temporal Diffusion for Vocal Tract MRI

    Authors: Paula Andrea Pérez-Toro, Tomás Arias-Vergara, Fangxu Xing, Xiaofeng Liu, Maureen Stone, Jiachen Zhuo, Juan Rafael Orozco-Arroyave, Elmar Nöth, Jana Hutter, Jerry L. Prince, Andreas Maier, Jonghye Woo

    Abstract: Understanding the relationship between vocal tract motion during speech and the resulting acoustic signal is crucial for aided clinical assessment and developing personalized treatment and rehabilitation strategies. Toward this goal, we introduce an audio-to-video generation framework for creating Real Time/cine-Magnetic Resonance Imaging (RT-/cine-MRI) visuals of the vocal tract from speech signa… ▽ More

    Submitted 15 March, 2025; originally announced March 2025.

  5. arXiv:2503.11787  [pdf, ps, other

    cs.CV eess.IV

    ECLARE: Efficient cross-planar learning for anisotropic resolution enhancement

    Authors: Samuel W. Remedios, Shuwen Wei, Shuo Han, Jinwei Zhang, Aaron Carass, Kurt G. Schilling, Dzung L. Pham, Jerry L. Prince, Blake E. Dewey

    Abstract: In clinical imaging, magnetic resonance (MR) image volumes are often acquired as stacks of 2D slices with decreased scan times, improved signal-to-noise ratio, and image contrasts unique to 2D MR pulse sequences. While this is sufficient for clinical evaluation, automated algorithms designed for 3D analysis perform poorly on multi-slice 2D MR volumes, especially those with thick slices and gaps be… ▽ More

    Submitted 21 May, 2025; v1 submitted 14 March, 2025; originally announced March 2025.

  6. arXiv:2501.18834  [pdf

    eess.IV cs.AI cs.CV

    Pitfalls of defacing whole-head MRI: re-identification risk with diffusion models and compromised research potential

    Authors: Chenyu Gao, Kaiwen Xu, Michael E. Kim, Lianrui Zuo, Zhiyuan Li, Derek B. Archer, Timothy J. Hohman, Ann Zenobia Moore, Luigi Ferrucci, Lori L. Beason-Held, Susan M. Resnick, Christos Davatzikos, Jerry L. Prince, Bennett A. Landman

    Abstract: Defacing is often applied to head magnetic resonance image (MRI) datasets prior to public release to address privacy concerns. The alteration of facial and nearby voxels has provoked discussions about the true capability of these techniques to ensure privacy as well as their impact on downstream tasks. With advancements in deep generative models, the extent to which defacing can protect privacy is… ▽ More

    Submitted 30 January, 2025; originally announced January 2025.

  7. arXiv:2412.12119  [pdf, other

    cs.AI cs.CL cs.LG

    Mastering Board Games by External and Internal Planning with Language Models

    Authors: John Schultz, Jakub Adamek, Matej Jusup, Marc Lanctot, Michael Kaisers, Sarah Perrin, Daniel Hennes, Jeremy Shar, Cannada Lewis, Anian Ruoss, Tom Zahavy, Petar Veličković, Laurel Prince, Satinder Singh, Eric Malmi, Nenad Tomašev

    Abstract: Advancing planning and reasoning capabilities of Large Language Models (LLMs) is one of the key prerequisites towards unlocking their potential for performing reliably in complex and impactful domains. In this paper, we aim to demonstrate this across board games (Chess, Fischer Random / Chess960, Connect Four, and Hex), and we show that search-based planning can yield significant improvements in L… ▽ More

    Submitted 22 May, 2025; v1 submitted 2 December, 2024; originally announced December 2024.

    Comments: 70 pages, 10 figures

  8. arXiv:2410.09639  [pdf, other

    eess.IV cs.CV

    Unique MS Lesion Identification from MRI

    Authors: Carlos A. Rivas, Jinwei Zhang, Shuwen Wei, Samuel W. Remedios, Aaron Carass, Jerry L. Prince

    Abstract: Unique identification of multiple sclerosis (MS) white matter lesions (WMLs) is important to help characterize MS progression. WMLs are routinely identified from magnetic resonance images (MRIs) but the resultant total lesion load does not correlate well with EDSS; whereas mean unique lesion volume has been shown to correlate with EDSS. Our approach builds on prior work by incorporating Hessian ma… ▽ More

    Submitted 12 October, 2024; originally announced October 2024.

    Comments: 5 pages, 5 figures, submitted to SPIE medical imaging conference

  9. Classification performance and reproducibility of GPT-4 omni for information extraction from veterinary electronic health records

    Authors: Judit M Wulcan, Kevin L Jacques, Mary Ann Lee, Samantha L Kovacs, Nicole Dausend, Lauren E Prince, Jonatan Wulcan, Sina Marsilio, Stefan M Keller

    Abstract: Large language models (LLMs) can extract information from veterinary electronic health records (EHRs), but performance differences between models, the effect of temperature settings, and the influence of text ambiguity have not been previously evaluated. This study addresses these gaps by comparing the performance of GPT-4 omni (GPT-4o) and GPT-3.5 Turbo under different conditions and investigatin… ▽ More

    Submitted 9 September, 2024; originally announced September 2024.

    Comments: 24 pages, 3 figures, 8 supplementary figures

    Journal ref: Frontiers in Veterinary Science, Vol. 11, 2025

  10. arXiv:2407.17465  [pdf, other

    cs.LG

    u-$μ$P: The Unit-Scaled Maximal Update Parametrization

    Authors: Charlie Blake, Constantin Eichenberg, Josef Dean, Lukas Balles, Luke Y. Prince, Björn Deiseroth, Andres Felipe Cruz-Salinas, Carlo Luschi, Samuel Weinbach, Douglas Orr

    Abstract: The Maximal Update Parametrization ($μ$P) aims to make the optimal hyperparameters (HPs) of a model independent of its size, allowing them to be swept using a cheap proxy model rather than the full-size target model. We present a new scheme, u-$μ$P, which improves upon $μ$P by combining it with Unit Scaling, a method for designing models that makes them easy to train in low-precision. The two tech… ▽ More

    Submitted 10 January, 2025; v1 submitted 24 July, 2024; originally announced July 2024.

    Comments: 55 pages

  11. arXiv:2407.10209  [pdf, other

    cs.CV

    Vector Field Attention for Deformable Image Registration

    Authors: Yihao Liu, Junyu Chen, Lianrui Zuo, Aaron Carass, Jerry L. Prince

    Abstract: Deformable image registration establishes non-linear spatial correspondences between fixed and moving images. Deep learning-based deformable registration methods have been widely studied in recent years due to their speed advantage over traditional algorithms as well as their better accuracy. Most existing deep learning-based methods require neural networks to encode location information in their… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

  12. arXiv:2402.06984  [pdf, other

    cs.SD cs.CV cs.MM eess.AS eess.IV

    Speech motion anomaly detection via cross-modal translation of 4D motion fields from tagged MRI

    Authors: Xiaofeng Liu, Fangxu Xing, Jiachen Zhuo, Maureen Stone, Jerry L. Prince, Georges El Fakhri, Jonghye Woo

    Abstract: Understanding the relationship between tongue motion patterns during speech and their resulting speech acoustic outcomes -- i.e., articulatory-acoustic relation -- is of great importance in assessing speech quality and developing innovative treatment and rehabilitative strategies. This is especially important when evaluating and detecting abnormal articulatory features in patients with speech-rela… ▽ More

    Submitted 10 February, 2024; originally announced February 2024.

    Comments: SPIE Medical Imaging 2024: Image Processing

  13. arXiv:2401.17571  [pdf, other

    eess.IV cs.CV

    Is Registering Raw Tagged-MR Enough for Strain Estimation in the Era of Deep Learning?

    Authors: Zhangxing Bian, Ahmed Alshareef, Shuwen Wei, Junyu Chen, Yuli Wang, Jonghye Woo, Dzung L. Pham, Jiachen Zhuo, Aaron Carass, Jerry L. Prince

    Abstract: Magnetic Resonance Imaging with tagging (tMRI) has long been utilized for quantifying tissue motion and strain during deformation. However, a phenomenon known as tag fading, a gradual decrease in tag visibility over time, often complicates post-processing. The first contribution of this study is to model tag fading by considering the interplay between $T_1$ relaxation and the repeated application… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

    Comments: Accepted to SPIE Medical Imaging 2024 (oral)

  14. Super-resolution multi-contrast unbiased eye atlases with deep probabilistic refinement

    Authors: Ho Hin Lee, Adam M. Saunders, Michael E. Kim, Samuel W. Remedios, Lucas W. Remedios, Yucheng Tang, Qi Yang, Xin Yu, Shunxing Bao, Chloe Cho, Louise A. Mawn, Tonia S. Rex, Kevin L. Schey, Blake E. Dewey, Jeffrey M. Spraggins, Jerry L. Prince, Yuankai Huo, Bennett A. Landman

    Abstract: Purpose: Eye morphology varies significantly across the population, especially for the orbit and optic nerve. These variations limit the feasibility and robustness of generalizing population-wise features of eye organs to an unbiased spatial reference. Approach: To tackle these limitations, we propose a process for creating high-resolution unbiased eye atlases. First, to restore spatial details… ▽ More

    Submitted 14 November, 2024; v1 submitted 5 January, 2024; originally announced January 2024.

    Comments: Published in SPIE Journal of Medical Imaging (https://doi.org/10.1117/1.JMI.11.6.064004). 27 pages, 6 figures

    Journal ref: J. Med. Imag. 11(6), 064004 (2024)

  15. arXiv:2312.04385  [pdf, other

    eess.IV cs.CV

    AniRes2D: Anisotropic Residual-enhanced Diffusion for 2D MR Super-Resolution

    Authors: Zejun Wu, Samuel W. Remedios, Blake E. Dewey, Aaron Carass, Jerry L. Prince

    Abstract: Anisotropic low-resolution (LR) magnetic resonance (MR) images are fast to obtain but hinder automated processing. We propose to use denoising diffusion probabilistic models (DDPMs) to super-resolve these 2D-acquired LR MR slices. This paper introduces AniRes2D, a novel approach combining DDPM with a residual prediction for 2D super-resolution (SR). Results demonstrate that AniRes2D outperforms se… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

    Comments: Accepted for presentation at SPIE Medical Imaging 2024, Clinical and Biomedical Imaging

  16. arXiv:2312.01460  [pdf, other

    eess.IV cs.CV

    Towards an accurate and generalizable multiple sclerosis lesion segmentation model using self-ensembled lesion fusion

    Authors: Jinwei Zhang, Lianrui Zuo, Blake E. Dewey, Samuel W. Remedios, Dzung L. Pham, Aaron Carass, Jerry L. Prince

    Abstract: Automatic multiple sclerosis (MS) lesion segmentation using multi-contrast magnetic resonance (MR) images provides improved efficiency and reproducibility compared to manual delineation. Current state-of-the-art automatic MS lesion segmentation methods utilize modified U-Net-like architectures. However, in the literature, dedicated architecture modifications were always required to maximize their… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

  17. arXiv:2310.10553  [pdf, other

    cs.LG cs.MA stat.ML

    TacticAI: an AI assistant for football tactics

    Authors: Zhe Wang, Petar Veličković, Daniel Hennes, Nenad Tomašev, Laurel Prince, Michael Kaisers, Yoram Bachrach, Romuald Elie, Li Kevin Wenliang, Federico Piccinini, William Spearman, Ian Graham, Jerome Connor, Yi Yang, Adrià Recasens, Mina Khan, Nathalie Beauguerlange, Pablo Sprechmann, Pol Moreno, Nicolas Heess, Michael Bowling, Demis Hassabis, Karl Tuyls

    Abstract: Identifying key patterns of tactics implemented by rival teams, and developing effective responses, lies at the heart of modern football. However, doing so algorithmically remains an open research challenge. To address this unmet need, we propose TacticAI, an AI football tactics assistant developed and evaluated in close collaboration with domain experts from Liverpool FC. We focus on analysing co… ▽ More

    Submitted 17 October, 2023; v1 submitted 16 October, 2023; originally announced October 2023.

    Comments: 32 pages, 10 figures

  18. arXiv:2309.14586  [pdf, other

    cs.SD cs.AI cs.CV eess.AS eess.SP

    Speech Audio Synthesis from Tagged MRI and Non-Negative Matrix Factorization via Plastic Transformer

    Authors: Xiaofeng Liu, Fangxu Xing, Maureen Stone, Jiachen Zhuo, Sidney Fels, Jerry L. Prince, Georges El Fakhri, Jonghye Woo

    Abstract: The tongue's intricate 3D structure, comprising localized functional units, plays a crucial role in the production of speech. When measured using tagged MRI, these functional units exhibit cohesive displacements and derived quantities that facilitate the complex process of speech production. Non-negative matrix factorization-based approaches have been shown to estimate the functional units through… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

    Comments: MICCAI 2023 (Oral presentation)

  19. arXiv:2308.02949  [pdf, other

    eess.IV cs.CV physics.med-ph

    MomentaMorph: Unsupervised Spatial-Temporal Registration with Momenta, Shooting, and Correction

    Authors: Zhangxing Bian, Shuwen Wei, Yihao Liu, Junyu Chen, Jiachen Zhuo, Fangxu Xing, Jonghye Woo, Aaron Carass, Jerry L. Prince

    Abstract: Tagged magnetic resonance imaging (tMRI) has been employed for decades to measure the motion of tissue undergoing deformation. However, registration-based motion estimation from tMRI is difficult due to the periodic patterns in these images, particularly when the motion is large. With a larger motion the registration approach gets trapped in a local optima, leading to motion estimation errors. We… ▽ More

    Submitted 5 August, 2023; originally announced August 2023.

    Comments: Accepted by MICCAI Workshop 2023: Time-Series Data Analytics and Learning (MTSAIL)

  20. arXiv:2307.15615  [pdf, other

    eess.IV cs.CV

    A survey on deep learning in medical image registration: new technologies, uncertainty, evaluation metrics, and beyond

    Authors: Junyu Chen, Yihao Liu, Shuwen Wei, Zhangxing Bian, Shalini Subramanian, Aaron Carass, Jerry L. Prince, Yong Du

    Abstract: Deep learning technologies have dramatically reshaped the field of medical image registration over the past decade. The initial developments, such as regression-based and U-Net-based networks, established the foundation for deep learning in image registration. Subsequent progress has been made in various aspects of deep learning-based registration, including similarity measures, deformation regula… ▽ More

    Submitted 1 November, 2024; v1 submitted 28 July, 2023; originally announced July 2023.

    Comments: Accepted to Medical Image Analysis ((c) MedIA). A list of open-sourced code from the papers reviewed has been organized and is available at https://bit.ly/3QgFJ9z

  21. arXiv:2305.14589  [pdf, other

    eess.IV cs.CV cs.LG physics.med-ph

    Attentive Continuous Generative Self-training for Unsupervised Domain Adaptive Medical Image Translation

    Authors: Xiaofeng Liu, Jerry L. Prince, Fangxu Xing, Jiachen Zhuo, Reese Timothy, Maureen Stone, Georges El Fakhri, Jonghye Woo

    Abstract: Self-training is an important class of unsupervised domain adaptation (UDA) approaches that are used to mitigate the problem of domain shift, when applying knowledge learned from a labeled source domain to unlabeled and heterogeneous target domains. While self-training-based UDA has shown considerable promise on discriminative tasks, including classification and segmentation, through reliable pseu… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: Accepted to Medical Image Analysis

  22. arXiv:2302.07203  [pdf, other

    eess.IV cs.CV cs.SD eess.AS eess.SP

    Synthesizing audio from tongue motion during speech using tagged MRI via transformer

    Authors: Xiaofeng Liu, Fangxu Xing, Jerry L. Prince, Maureen Stone, Georges El Fakhri, Jonghye Woo

    Abstract: Investigating the relationship between internal tissue point motion of the tongue and oropharyngeal muscle deformation measured from tagged MRI and intelligible speech can aid in advancing speech motor control theories and developing novel treatment methods for speech related-disorders. However, elucidating the relationship between these two sources of information is challenging, due in part to th… ▽ More

    Submitted 14 February, 2023; originally announced February 2023.

    Comments: SPIE Medical Imaging: Deep Dive Oral

  23. arXiv:2302.00528  [pdf, other

    eess.IV cs.CV

    A latent space for unsupervised MR image quality control via artifact assessment

    Authors: Lianrui Zuo, Yuan Xue, Blake E. Dewey, Yihao Liu, Jerry L. Prince, Aaron Carass

    Abstract: Image quality control (IQC) can be used in automated magnetic resonance (MR) image analysis to exclude erroneous results caused by poorly acquired or artifact-laden images. Existing IQC methods for MR imaging generally require human effort to craft meaningful features or label large datasets for supervised training. The involvement of human labor can be burdensome and biased, as labeling MR images… ▽ More

    Submitted 1 February, 2023; originally announced February 2023.

    Comments: Accepted at the International Society for Optics and Photonics - Medical Imaging (SPIE-MI) 2023

  24. arXiv:2301.07234  [pdf, other

    eess.IV cs.CV

    DRIMET: Deep Registration for 3D Incompressible Motion Estimation in Tagged-MRI with Application to the Tongue

    Authors: Zhangxing Bian, Fangxu Xing, Jinglun Yu, Muhan Shao, Yihao Liu, Aaron Carass, Jiachen Zhuo, Jonghye Woo, Jerry L. Prince

    Abstract: Tagged magnetic resonance imaging~(MRI) has been used for decades to observe and quantify the detailed motion of deforming tissue. However, this technique faces several challenges such as tag fading, large motion, long computation times, and difficulties in obtaining diffeomorphic incompressible flow fields. To address these issues, this paper presents a novel unsupervised phase-based 3D motion es… ▽ More

    Submitted 30 April, 2023; v1 submitted 17 January, 2023; originally announced January 2023.

    Comments: Accepted to MIDL 2023 (oral)

  25. arXiv:2301.06114  [pdf, other

    eess.IV cs.LG

    Segmenting thalamic nuclei from manifold projections of multi-contrast MRI

    Authors: Chang Yan, Muhan Shao, Zhangxing Bian, Anqi Feng, Yuan Xue, Jiachen Zhuo, Rao P. Gullapalli, Aaron Carass, Jerry L. Prince

    Abstract: The thalamus is a subcortical gray matter structure that plays a key role in relaying sensory and motor signals within the brain. Its nuclei can atrophy or otherwise be affected by neurological disease and injuries including mild traumatic brain injury. Segmenting both the thalamus and its nuclei is challenging because of the relatively low contrast within and around the thalamus in conventional m… ▽ More

    Submitted 31 January, 2023; v1 submitted 15 January, 2023; originally announced January 2023.

    Comments: 8 pages, 3 figures, 2023 SPIE-MI Image Processing

  26. arXiv:2212.06065  [pdf, other

    eess.IV cs.CV

    HACA3: A Unified Approach for Multi-site MR Image Harmonization

    Authors: Lianrui Zuo, Yihao Liu, Yuan Xue, Blake E. Dewey, Samuel W. Remedios, Savannah P. Hays, Murat Bilgel, Ellen M. Mowry, Scott D. Newsome, Peter A. Calabresi, Susan M. Resnick, Jerry L. Prince, Aaron Carass

    Abstract: The lack of standardization is a prominent issue in magnetic resonance (MR) imaging. This often causes undesired contrast variations in the acquired images due to differences in hardware and acquisition parameters. In recent years, image synthesis-based MR harmonization with disentanglement has been proposed to compensate for the undesired contrast variations. Despite the success of existing metho… ▽ More

    Submitted 25 April, 2023; v1 submitted 12 December, 2022; originally announced December 2022.

  27. arXiv:2209.02611  [pdf, other

    eess.IV cs.CV

    Deep filter bank regression for super-resolution of anisotropic MR brain images

    Authors: Samuel W. Remedios, Shuo Han, Yuan Xue, Aaron Carass, Trac D. Tran, Dzung L. Pham, Jerry L. Prince

    Abstract: In 2D multi-slice magnetic resonance (MR) acquisition, the through-plane signals are typically of lower resolution than the in-plane signals. While contemporary super-resolution (SR) methods aim to recover the underlying high-resolution volume, the estimated high-frequency information is implicit via end-to-end data-driven training rather than being explicitly stated and sought. To address this, w… ▽ More

    Submitted 6 September, 2022; originally announced September 2022.

  28. arXiv:2206.02284  [pdf, other

    cs.SD cs.CV cs.MM eess.AS

    Tagged-MRI Sequence to Audio Synthesis via Self Residual Attention Guided Heterogeneous Translator

    Authors: Xiaofeng Liu, Fangxu Xing, Jerry L. Prince, Jiachen Zhuo, Maureen Stone, Georges El Fakhri, Jonghye Woo

    Abstract: Understanding the underlying relationship between tongue and oropharyngeal muscle deformation seen in tagged-MRI and intelligible speech plays an important role in advancing speech motor control theories and treatment of speech related-disorders. Because of their heterogeneous representations, however, direct mapping between the two modalities -- i.e., two-dimensional (mid-sagittal slice) plus tim… ▽ More

    Submitted 25 September, 2022; v1 submitted 5 June, 2022; originally announced June 2022.

    Comments: MICCAI 2022 (early accept, Oral Presentation ~3%)

  29. arXiv:2205.04982  [pdf, other

    eess.IV cs.CV cs.LG

    Disentangling A Single MR Modality

    Authors: Lianrui Zuo, Yihao Liu, Yuan Xue, Shuo Han, Murat Bilgel, Susan M. Resnick, Jerry L. Prince, Aaron Carass

    Abstract: Disentangling anatomical and contrast information from medical images has gained attention recently, demonstrating benefits for various image analysis tasks. Current methods learn disentangled representations using either paired multi-modal images with the same underlying anatomy or auxiliary labels (e.g., manual delineations) to provide inductive bias for disentanglement. However, these requireme… ▽ More

    Submitted 10 May, 2022; originally announced May 2022.

  30. arXiv:2203.03626  [pdf, other

    eess.IV cs.CV

    Coordinate Translator for Learning Deformable Medical Image Registration

    Authors: Yihao Liu, Lianrui Zuo, Shuo Han, Yuan Xue, Jerry L. Prince, Aaron Carass

    Abstract: The majority of deep learning (DL) based deformable image registration methods use convolutional neural networks (CNNs) to estimate displacement fields from pairs of moving and fixed images. This, however, requires the convolutional kernels in the CNN to not only extract intensity features from the inputs but also understand image coordinate systems. We argue that the latter task is challenging fo… ▽ More

    Submitted 31 July, 2022; v1 submitted 5 March, 2022; originally announced March 2022.

  31. arXiv:2202.12474  [pdf, other

    eess.IV cs.CV cs.LG

    Structure-aware Unsupervised Tagged-to-Cine MRI Synthesis with Self Disentanglement

    Authors: Xiaofeng Liu, Fangxu Xing, Jerry L. Prince, Maureen Stone, Georges El Fakhri, Jonghye Woo

    Abstract: Cycle reconstruction regularized adversarial training -- e.g., CycleGAN, DiscoGAN, and DualGAN -- has been widely used for image style transfer with unpaired training data. Several recent works, however, have shown that local distortions are frequent, and structural consistency cannot be guaranteed. Targeting this issue, prior works usually relied on additional segmentation or consistent feature e… ▽ More

    Submitted 24 February, 2022; originally announced February 2022.

    Comments: SPIE Medical Imaging: Image Processing (Oral presentation)

  32. arXiv:2106.12499  [pdf, other

    cs.CV cs.AI cs.LG cs.NE

    Generative Self-training for Cross-domain Unsupervised Tagged-to-Cine MRI Synthesis

    Authors: Xiaofeng Liu, Fangxu Xing, Maureen Stone, Jiachen Zhuo, Reese Timothy, Jerry L. Prince, Georges El Fakhri, Jonghye Woo

    Abstract: Self-training based unsupervised domain adaptation (UDA) has shown great potential to address the problem of domain shift, when applying a trained deep learning model in a source domain to unlabeled target domains. However, while the self-training UDA has demonstrated its effectiveness on discriminative tasks, such as classification and segmentation, via the reliable pseudo-label selection based o… ▽ More

    Submitted 23 June, 2021; originally announced June 2021.

    Comments: MICCAI 2021 (early accept <13%)

  33. arXiv:2105.05382  [pdf

    q-bio.NC cs.AI

    Current State and Future Directions for Learning in Biological Recurrent Neural Networks: A Perspective Piece

    Authors: Luke Y. Prince, Roy Henha Eyono, Ellen Boven, Arna Ghosh, Joe Pemberton, Franz Scherr, Claudia Clopath, Rui Ponte Costa, Wolfgang Maass, Blake A. Richards, Cristina Savin, Katharina Anna Wilmes

    Abstract: We provide a brief review of the common assumptions about biological learning with findings from experimental neuroscience and contrast them with the efficiency of gradient-based learning in recurrent neural networks. The key issues discussed in this review include: synaptic plasticity, neural circuits, theory-experiment divide, and objective functions. We conclude with recommendations for both th… ▽ More

    Submitted 5 January, 2022; v1 submitted 11 May, 2021; originally announced May 2021.

  34. arXiv:2104.00100  [pdf, other

    cs.CV eess.IV

    MR Slice Profile Estimation by Learning to Match Internal Patch Distributions

    Authors: Shuo Han, Samuel Remedios, Aaron Carass, Michael Schär, Jerry L. Prince

    Abstract: To super-resolve the through-plane direction of a multi-slice 2D magnetic resonance (MR) image, its slice selection profile can be used as the degeneration model from high resolution (HR) to low resolution (LR) to create paired data when training a supervised algorithm. Existing super-resolution algorithms make assumptions about the slice selection profile since it is not readily known for a given… ▽ More

    Submitted 31 March, 2021; originally announced April 2021.

    Comments: 12 pages, 6 figures, accepted by Information Processing in Medical Imaging (IPMI) 2021

  35. arXiv:2103.13283  [pdf, other

    eess.IV cs.CV cs.LG

    Information-based Disentangled Representation Learning for Unsupervised MR Harmonization

    Authors: Lianrui Zuo, Blake E. Dewey, Aaron Carass, Yihao Liu, Yufan He, Peter A. Calabresi, Jerry L. Prince

    Abstract: Accuracy and consistency are two key factors in computer-assisted magnetic resonance (MR) image analysis. However, contrast variation from site to site caused by lack of standardization in MR acquisition impedes consistent measurements. In recent years, image harmonization approaches have been proposed to compensate for contrast variation in MR images. Current harmonization approaches either requi… ▽ More

    Submitted 24 March, 2021; originally announced March 2021.

    Comments: Accepted in the 27th International Conference on Information Processing in Medical Imaging (IPMI 2021)

  36. arXiv:2103.03158  [pdf, other

    cs.CV cs.LG eess.IV stat.AP

    A Structural Causal Model for MR Images of Multiple Sclerosis

    Authors: Jacob C. Reinhold, Aaron Carass, Jerry L. Prince

    Abstract: Precision medicine involves answering counterfactual questions such as "Would this patient respond better to treatment A or treatment B?" These types of questions are causal in nature and require the tools of causal inference to be answered, e.g., with a structural causal model (SCM). In this work, we develop an SCM that models the interaction between demographic information, disease covariates, a… ▽ More

    Submitted 13 July, 2021; v1 submitted 4 March, 2021; originally announced March 2021.

    Comments: MICCAI 2021

  37. Dual-cycle Constrained Bijective VAE-GAN For Tagged-to-Cine Magnetic Resonance Image Synthesis

    Authors: Xiaofeng Liu, Fangxu Xing, Jerry L. Prince, Aaron Carass, Maureen Stone, Georges El Fakhri, Jonghye Woo

    Abstract: Tagged magnetic resonance imaging (MRI) is a widely used imaging technique for measuring tissue deformation in moving organs. Due to tagged MRI's intrinsic low anatomical resolution, another matching set of cine MRI with higher resolution is sometimes acquired in the same scanning session to facilitate tissue segmentation, thus adding extra time and cost. To mitigate this, in this work, we propose… ▽ More

    Submitted 13 January, 2021; originally announced January 2021.

    Comments: Accepted to IEEE International Symposium on Biomedical Imaging (ISBI) 2021

    Journal ref: 2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI)

  38. A review of deep learning in medical imaging: Imaging traits, technology trends, case studies with progress highlights, and future promises

    Authors: S. Kevin Zhou, Hayit Greenspan, Christos Davatzikos, James S. Duncan, Bram van Ginneken, Anant Madabhushi, Jerry L. Prince, Daniel Rueckert, Ronald M. Summers

    Abstract: Since its renaissance, deep learning has been widely used in various medical imaging tasks and has achieved remarkable success in many medical imaging applications, thereby propelling us into the so-called artificial intelligence (AI) era. It is known that the success of AI is mostly attributed to the availability of big data with annotations for a single task and the advances in high performance… ▽ More

    Submitted 5 March, 2021; v1 submitted 2 August, 2020; originally announced August 2020.

    Comments: 20 pages, 7 figures

    Journal ref: Proceedings of the IEEE (2021)

  39. arXiv:2007.04865  [pdf, other

    cs.CV eess.IV

    A Deep Joint Sparse Non-negative Matrix Factorization Framework for Identifying the Common and Subject-specific Functional Units of Tongue Motion During Speech

    Authors: Jonghye Woo, Fangxu Xing, Jerry L. Prince, Maureen Stone, Arnold Gomez, Timothy G. Reese, Van J. Wedeen, Georges El Fakhri

    Abstract: Intelligible speech is produced by creating varying internal local muscle groupings -- i.e., functional units -- that are generated in a systematic and coordinated manner. There are two major challenges in characterizing and analyzing functional units.~First, due to the complex and convoluted nature of tongue structure and function, it is of great importance to develop a method that can accurately… ▽ More

    Submitted 6 June, 2021; v1 submitted 9 July, 2020; originally announced July 2020.

    Comments: Accepted by Medical Image Analysis

  40. arXiv:2007.03162  [pdf, other

    cs.CV cs.LG eess.IV

    Self domain adapted network

    Authors: Yufan He, Aaron Carass, Lianrui Zuo, Blake E. Dewey, Jerry L. Prince

    Abstract: Domain shift is a major problem for deploying deep networks in clinical practice. Network performance drops significantly with (target) images obtained differently than its (source) training data. Due to a lack of target label data, most work has focused on unsupervised domain adaptation (UDA). Current UDA methods need both source and target data to train models which perform image translation (ha… ▽ More

    Submitted 6 July, 2020; originally announced July 2020.

    Comments: early accept in miccai2020

  41. arXiv:2002.04639  [pdf, other

    eess.IV cs.CV cs.LG

    Validating uncertainty in medical image translation

    Authors: Jacob C. Reinhold, Yufan He, Shizhong Han, Yunqiang Chen, Dashan Gao, Junghoon Lee, Jerry L. Prince, Aaron Carass

    Abstract: Medical images are increasingly used as input to deep neural networks to produce quantitative values that aid researchers and clinicians. However, standard deep neural networks do not provide a reliable measure of uncertainty in those quantitative values. Recent work has shown that using dropout during training and testing can provide estimates of uncertainty. In this work, we investigate using dr… ▽ More

    Submitted 11 February, 2020; originally announced February 2020.

    Comments: IEEE ISBI 2020

  42. arXiv:2002.04626  [pdf, other

    eess.IV cs.CV cs.LG

    Finding novelty with uncertainty

    Authors: Jacob C. Reinhold, Yufan He, Shizhong Han, Yunqiang Chen, Dashan Gao, Junghoon Lee, Jerry L. Prince, Aaron Carass

    Abstract: Medical images are often used to detect and characterize pathology and disease; however, automatically identifying and segmenting pathology in medical images is challenging because the appearance of pathology across diseases varies widely. To address this challenge, we propose a Bayesian deep learning method that learns to translate healthy computed tomography images to magnetic resonance images a… ▽ More

    Submitted 11 February, 2020; originally announced February 2020.

    Comments: SPIE Medical Imaging 2020

  43. arXiv:1812.04652  [pdf, other

    cs.CV

    Evaluating the Impact of Intensity Normalization on MR Image Synthesis

    Authors: Jacob C. Reinhold, Blake E. Dewey, Aaron Carass, Jerry L. Prince

    Abstract: Image synthesis learns a transformation from the intensity features of an input image to yield a different tissue contrast of the output image. This process has been shown to have application in many medical image analysis tasks including imputation, registration, and segmentation. To carry out synthesis, the intensities of the input images are typically scaled--i.e., normalized--both in training… ▽ More

    Submitted 11 December, 2018; originally announced December 2018.

    Comments: SPIE Medical Imaging 2019

  44. A Sparse Non-negative Matrix Factorization Framework for Identifying Functional Units of Tongue Behavior from MRI

    Authors: Jonghye Woo, Jerry L. Prince, Maureen Stone, Fangxu Xing, Arnold Gomez, Jordan R. Green, Christopher J. Hartnick, Thomas J. Brady, Timothy G. Reese, Van J. Wedeen, Georges El Fakhri

    Abstract: Muscle coordination patterns of lingual behaviors are synergies generated by deforming local muscle groups in a variety of ways. Functional units are functional muscle groups of local structural elements within the tongue that compress, expand, and move in a cohesive and consistent manner. Identifying the functional units using tagged-Magnetic Resonance Imaging (MRI) sheds light on the mechanisms… ▽ More

    Submitted 29 September, 2018; v1 submitted 15 April, 2018; originally announced April 2018.

    Comments: Accepted at IEEE TMI (https://ieeexplore.ieee.org/document/8467354)

  45. arXiv:1803.06629  [pdf, other

    cs.CV

    Cross-modality image synthesis from unpaired data using CycleGAN: Effects of gradient consistency loss and training data size

    Authors: Yuta Hiasa, Yoshito Otake, Masaki Takao, Takumi Matsuoka, Kazuma Takashima, Jerry L. Prince, Nobuhiko Sugano, Yoshinobu Sato

    Abstract: CT is commonly used in orthopedic procedures. MRI is used along with CT to identify muscle structures and diagnose osteonecrosis due to its superior soft tissue contrast. However, MRI has poor contrast for bone structures. Clearly, it would be helpful if a corresponding CT were available, as bone boundaries are more clearly seen and CT has standardized (i.e., Hounsfield) units. Therefore, we aim a… ▽ More

    Submitted 31 July, 2018; v1 submitted 18 March, 2018; originally announced March 2018.

    Comments: 10 pages, 7 figures, MICCAI 2018 Workshop on Simulation and Synthesis in Medical Imaging

  46. arXiv:1803.05120  [pdf, other

    cs.CV

    Topology guaranteed segmentation of the human retina from OCT using convolutional neural networks

    Authors: Yufan He, Aaron Carass, Bruno M. Jedynak, Sharon D. Solomon, Shiv Saidha, Peter A. Calabresi, Jerry L. Prince

    Abstract: Optical coherence tomography (OCT) is a noninvasive imaging modality which can be used to obtain depth images of the retina. The changing layer thicknesses can thus be quantified by analyzing these OCT images, moreover these changes have been shown to correlate with disease progression in multiple sclerosis. Recent automated retinal layer segmentation tools use machine learning methods to perform… ▽ More

    Submitted 13 March, 2018; originally announced March 2018.

  47. arXiv:1802.09431  [pdf, other

    eess.IV cs.CV

    Self Super-Resolution for Magnetic Resonance Images using Deep Networks

    Authors: Can Zhao, Aaron Carass, Blake E. Dewey, Jerry L. Prince

    Abstract: High resolution magnetic resonance~(MR) imaging~(MRI) is desirable in many clinical applications, however, there is a trade-off between resolution, speed of acquisition, and noise. It is common for MR images to have worse through-plane resolution~(slice thickness) than in-plane resolution. In these MRI images, high frequency information in the through-plane direction is not acquired, and cannot be… ▽ More

    Submitted 26 February, 2018; originally announced February 2018.

    Comments: Accepted by IEEE International Symposium on Biomedical Imaging (ISBI) 2018

  48. arXiv:1705.06870  [pdf, other

    cs.CV

    Fiber Orientation Estimation Guided by a Deep Network

    Authors: Chuyang Ye, Jerry L. Prince

    Abstract: Diffusion magnetic resonance imaging (dMRI) is currently the only tool for noninvasively imaging the brain's white matter tracts. The fiber orientation (FO) is a key feature computed from dMRI for fiber tract reconstruction. Because the number of FOs in a voxel is usually small, dictionary-based sparse reconstruction has been used to estimate FOs with a relatively small number of diffusion gradien… ▽ More

    Submitted 19 May, 2017; originally announced May 2017.

    Comments: A shorter version is accepted by MICCAI 2017

  49. arXiv:1701.06708  [pdf, other

    cs.CV

    Speech Map: A Statistical Multimodal Atlas of 4D Tongue Motion During Speech from Tagged and Cine MR Images

    Authors: Jonghye Woo, Fangxu Xing, Maureen Stone, Jordan Green, Timothy G. Reese, Thomas J. Brady, Van J. Wedeen, Jerry L. Prince, Georges El Fakhri

    Abstract: Quantitative measurement of functional and anatomical traits of 4D tongue motion in the course of speech or other lingual behaviors remains a major challenge in scientific research and clinical applications. Here, we introduce a statistical multimodal atlas of 4D tongue motion using healthy subjects, which enables a combined quantitative characterization of tongue motion in a reference anatomical… ▽ More

    Submitted 14 September, 2018; v1 submitted 23 January, 2017; originally announced January 2017.

    Comments: Accepted at Journal of Computer Methods in Biomechanics and Biomedical Engineering

  50. Estimation of Fiber Orientations Using Neighborhood Information

    Authors: Chuyang Ye, Jiachen Zhuo, Rao P. Gullapalli, Jerry L. Prince

    Abstract: Data from diffusion magnetic resonance imaging (dMRI) can be used to reconstruct fiber tracts, for example, in muscle and white matter. Estimation of fiber orientations (FOs) is a crucial step in the reconstruction process and these estimates can be corrupted by noise. In this paper, a new method called Fiber Orientation Reconstruction using Neighborhood Information (FORNI) is described and shown… ▽ More

    Submitted 16 May, 2016; v1 submitted 15 January, 2016; originally announced January 2016.

    Comments: Journal paper accepted in Medical Image Analysis. 35 pages and 16 figures