Skip to main content

Showing 1–45 of 45 results for author: nath, V

.
  1. arXiv:2505.04522  [pdf, ps, other

    eess.IV cs.CV

    Text2CT: Towards 3D CT Volume Generation from Free-text Descriptions Using Diffusion Model

    Authors: Pengfei Guo, Can Zhao, Dong Yang, Yufan He, Vishwesh Nath, Ziyue Xu, Pedro R. A. S. Bassi, Zongwei Zhou, Benjamin D. Simon, Stephanie Anne Harmon, Baris Turkbey, Daguang Xu

    Abstract: Generating 3D CT volumes from descriptive free-text inputs presents a transformative opportunity in diagnostics and research. In this paper, we introduce Text2CT, a novel approach for synthesizing 3D CT volumes from textual descriptions using the diffusion model. Unlike previous methods that rely on fixed-format text input, Text2CT employs a novel prompt formulation that enables generation from di… ▽ More

    Submitted 7 May, 2025; originally announced May 2025.

  2. arXiv:2504.16578  [pdf, other

    gr-qc hep-th

    Spontaneous symmetry breaking induced by curvature : Analysis via non-perturbative 2PI Hartree approximation

    Authors: Vishal Nath, Kinsuk Roy, Sourav Bhattacharya

    Abstract: In this work we investigate the spontaneous symmetry breaking (SSB) induced by a classical background spacetime's curvature, via the 2 particle irreducible (2PI) non-perturbative effective action formalism. We use the standard Schwinger-DeWitt local expansion of the Feynman propagator, appropriate to probe the effect of spacetime curvature on the local or short scale physics. Recently it was shown… ▽ More

    Submitted 23 April, 2025; originally announced April 2025.

    Comments: v1, 31 pages, 27 figures

  3. arXiv:2503.07480  [pdf, ps, other

    physics.flu-dyn

    Trapping and Transport of Inertial Particles in a Taylor-Green Vortex: Effects of Added Mass and History Force

    Authors: Prabhash Kumar, Anu V. S. Nath, Mahesh Panchagnula, Anubhab Roy

    Abstract: We investigate the dynamics of small inertial particles in a two-dimensional, steady Taylor-Green vortex flow. A classic study by Taylor (2022) showed that heavy inertial point particles (having density parameter R = 1) are trapped by the flow separatrices when the particle Stokes number St, which measures the particle's inertia, is less than 1/4. Here, we consider finitely dense particles, incorp… ▽ More

    Submitted 10 March, 2025; originally announced March 2025.

    Comments: 21 pages, 10 figures

  4. arXiv:2501.01290  [pdf, other

    cs.CL

    ToolComp: A Multi-Tool Reasoning & Process Supervision Benchmark

    Authors: Vaskar Nath, Pranav Raja, Claire Yoon, Sean Hendryx

    Abstract: Despite recent advances in AI, the development of systems capable of executing complex, multi-step reasoning tasks involving multiple tools remains a significant challenge. Current benchmarks fall short in capturing the real-world complexity of tool-use reasoning, where verifying the correctness of not only the final answer but also the intermediate steps is important for evaluation, development,… ▽ More

    Submitted 2 January, 2025; originally announced January 2025.

  5. arXiv:2412.04468  [pdf, other

    cs.CV

    NVILA: Efficient Frontier Visual Language Models

    Authors: Zhijian Liu, Ligeng Zhu, Baifeng Shi, Zhuoyang Zhang, Yuming Lou, Shang Yang, Haocheng Xi, Shiyi Cao, Yuxian Gu, Dacheng Li, Xiuyu Li, Yunhao Fang, Yukang Chen, Cheng-Yu Hsieh, De-An Huang, An-Chieh Cheng, Vishwesh Nath, Jinyi Hu, Sifei Liu, Ranjay Krishna, Daguang Xu, Xiaolong Wang, Pavlo Molchanov, Jan Kautz, Hongxu Yin , et al. (2 additional authors not shown)

    Abstract: Visual language models (VLMs) have made significant advances in accuracy in recent years. However, their efficiency has received much less attention. This paper introduces NVILA, a family of open VLMs designed to optimize both efficiency and accuracy. Building on top of VILA, we improve its model architecture by first scaling up the spatial and temporal resolutions, and then compressing visual tok… ▽ More

    Submitted 5 March, 2025; v1 submitted 5 December, 2024; originally announced December 2024.

  6. arXiv:2411.12915  [pdf, other

    cs.CV

    VILA-M3: Enhancing Vision-Language Models with Medical Expert Knowledge

    Authors: Vishwesh Nath, Wenqi Li, Dong Yang, Andriy Myronenko, Mingxin Zheng, Yao Lu, Zhijian Liu, Hongxu Yin, Yucheng Tang, Pengfei Guo, Can Zhao, Ziyue Xu, Yufan He, Greg Heinrich, Yee Man Law, Benjamin Simon, Stephanie Harmon, Stephen Aylward, Marc Edgar, Michael Zephyr, Song Han, Pavlo Molchanov, Baris Turkbey, Holger Roth, Daguang Xu

    Abstract: Generalist vision language models (VLMs) have made significant strides in computer vision, but they fall short in specialized fields like healthcare, where expert knowledge is essential. In traditional computer vision tasks, creative or approximate answers may be acceptable, but in healthcare, precision is paramount.Current large multimodal models like Gemini and GPT-4o are insufficient for medica… ▽ More

    Submitted 4 March, 2025; v1 submitted 19 November, 2024; originally announced November 2024.

  7. arXiv:2411.09618  [pdf, other

    physics.med-ph cs.LG

    MICCAI-CDMRI 2023 QuantConn Challenge Findings on Achieving Robust Quantitative Connectivity through Harmonized Preprocessing of Diffusion MRI

    Authors: Nancy R. Newlin, Kurt Schilling, Serge Koudoro, Bramsh Qamar Chandio, Praitayini Kanakaraj, Daniel Moyer, Claire E. Kelly, Sila Genc, Jian Chen, Joseph Yuan-Mou Yang, Ye Wu, Yifei He, Jiawei Zhang, Qingrun Zeng, Fan Zhang, Nagesh Adluru, Vishwesh Nath, Sudhir Pathak, Walter Schneider, Anurag Gade, Yogesh Rathi, Tom Hendriks, Anna Vilanova, Maxime Chamberland, Tomasz Pieciak , et al. (11 additional authors not shown)

    Abstract: White matter alterations are increasingly implicated in neurological diseases and their progression. International-scale studies use diffusion-weighted magnetic resonance imaging (DW-MRI) to qualitatively identify changes in white matter microstructure and connectivity. Yet, quantitative analysis of DW-MRI data is hindered by inconsistencies stemming from varying acquisition protocols. There is a… ▽ More

    Submitted 14 November, 2024; originally announced November 2024.

    Comments: Accepted for publication at the Journal of Machine Learning for Biomedical Imaging (MELBA) https://melba-journal.org/2024/019

    Journal ref: Machine.Learning.for.Biomedical.Imaging. 2 (2024)

  8. arXiv:2410.06563  [pdf, other

    gr-qc hep-th

    Self interacting scalar field theory in general curved spacetimes at zero and finite temperature revisited

    Authors: Vishal Nath, Sourav Bhattacharya

    Abstract: We revisit the problem of spontaneous symmetry breaking (SSB), its restoration, and phase transition for a self interacting quantum scalar field in a general curved background, at zero and finite temperature. To the best of our knowledge, most of the earlier computations in this context have been done in the linear order in curvature, which may not be very suitable for the Ricci flat spacetimes. O… ▽ More

    Submitted 20 March, 2025; v1 submitted 9 October, 2024; originally announced October 2024.

    Comments: v2; 37pp, 16 figs; added references, discussions and many clarifications; improved presentation; accepted in PRD

  9. arXiv:2410.03717  [pdf, other

    cs.CL cs.AI cs.LG

    Revisiting the Superficial Alignment Hypothesis

    Authors: Mohit Raghavendra, Vaskar Nath, Sean Hendryx

    Abstract: The Superficial Alignment Hypothesis posits that almost all of a language model's abilities and knowledge are learned during pre-training, while post-training is about giving a model the right style and format. We re-examine these claims by empirically studying the scaling behavior of post-training with increasing finetuning examples and evaluating them using objective task-specific standardized b… ▽ More

    Submitted 27 September, 2024; originally announced October 2024.

  10. arXiv:2409.11169  [pdf, other

    eess.IV cs.AI cs.CV

    MAISI: Medical AI for Synthetic Imaging

    Authors: Pengfei Guo, Can Zhao, Dong Yang, Ziyue Xu, Vishwesh Nath, Yucheng Tang, Benjamin Simon, Mason Belue, Stephanie Harmon, Baris Turkbey, Daguang Xu

    Abstract: Medical imaging analysis faces challenges such as data scarcity, high annotation costs, and privacy concerns. This paper introduces the Medical AI for Synthetic Imaging (MAISI), an innovative approach using the diffusion model to generate synthetic 3D computed tomography (CT) images to address those challenges. MAISI leverages the foundation volume compression network and the latent diffusion mode… ▽ More

    Submitted 29 October, 2024; v1 submitted 13 September, 2024; originally announced September 2024.

    Comments: WACV25 accepted. https://monai.io/research/maisi

  11. arXiv:2409.03733  [pdf, other

    cs.LG cs.AI cs.CL

    Planning In Natural Language Improves LLM Search For Code Generation

    Authors: Evan Wang, Federico Cassano, Catherine Wu, Yunfeng Bai, Will Song, Vaskar Nath, Ziwen Han, Sean Hendryx, Summer Yue, Hugh Zhang

    Abstract: While scaling training compute has led to remarkable improvements in large language models (LLMs), scaling inference compute has not yet yielded analogous gains. We hypothesize that a core missing component is a lack of diverse LLM outputs, leading to inefficient search due to models repeatedly sampling highly similar, yet incorrect generations. We empirically demonstrate that this lack of diversi… ▽ More

    Submitted 18 October, 2024; v1 submitted 5 September, 2024; originally announced September 2024.

  12. arXiv:2408.11210  [pdf, other

    cs.CV

    A Short Review and Evaluation of SAM2's Performance in 3D CT Image Segmentation

    Authors: Yufan He, Pengfei Guo, Yucheng Tang, Andriy Myronenko, Vishwesh Nath, Ziyue Xu, Dong Yang, Can Zhao, Daguang Xu, Wenqi Li

    Abstract: Since the release of Segment Anything 2 (SAM2), the medical imaging community has been actively evaluating its performance for 3D medical image segmentation. However, different studies have employed varying evaluation pipelines, resulting in conflicting outcomes that obscure a clear understanding of SAM2's capabilities and potential applications. We shortly review existing benchmarks and point out… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

  13. arXiv:2407.13887  [pdf, other

    cs.CL

    Learning Goal-Conditioned Representations for Language Reward Models

    Authors: Vaskar Nath, Dylan Slack, Jeff Da, Yuntao Ma, Hugh Zhang, Spencer Whitehead, Sean Hendryx

    Abstract: Techniques that learn improved representations via offline data or self-supervised objectives have shown impressive results in traditional reinforcement learning (RL). Nevertheless, it is unclear how improved representation learning can benefit reinforcement learning from human feedback (RLHF) on language models (LMs). In this work, we propose training reward models (RMs) in a contrastive,… ▽ More

    Submitted 23 October, 2024; v1 submitted 18 July, 2024; originally announced July 2024.

  14. arXiv:2407.03307  [pdf, other

    eess.IV cs.CV

    HoloHisto: End-to-end Gigapixel WSI Segmentation with 4K Resolution Sequential Tokenization

    Authors: Yucheng Tang, Yufan He, Vishwesh Nath, Pengfeig Guo, Ruining Deng, Tianyuan Yao, Quan Liu, Can Cui, Mengmeng Yin, Ziyue Xu, Holger Roth, Daguang Xu, Haichun Yang, Yuankai Huo

    Abstract: In digital pathology, the traditional method for deep learning-based image segmentation typically involves a two-stage process: initially segmenting high-resolution whole slide images (WSI) into smaller patches (e.g., 256x256, 512x512, 1024x1024) and subsequently reconstructing them to their original scale. This method often struggles to capture the complex details and vast scope of WSIs. In this… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  15. arXiv:2407.02604  [pdf, other

    cs.AI cs.CL cs.LG eess.IV

    D-Rax: Domain-specific Radiologic assistant leveraging multi-modal data and eXpert model predictions

    Authors: Hareem Nisar, Syed Muhammad Anwar, Zhifan Jiang, Abhijeet Parida, Ramon Sanchez-Jacob, Vishwesh Nath, Holger R. Roth, Marius George Linguraru

    Abstract: Large vision language models (VLMs) have progressed incredibly from research to applicability for general-purpose use cases. LLaVA-Med, a pioneering large language and vision assistant for biomedicine, can perform multi-modal biomedical image and data analysis to provide a natural language interface for radiologists. While it is highly generalizable and works with multi-modal data, it is currently… ▽ More

    Submitted 2 August, 2024; v1 submitted 2 July, 2024; originally announced July 2024.

    Comments: accepted to the MICCAI 2024 Second International Workshop on Foundation Models for General Medical AI

  16. arXiv:2406.05285  [pdf, other

    cs.CV

    VISTA3D: A Unified Segmentation Foundation Model For 3D Medical Imaging

    Authors: Yufan He, Pengfei Guo, Yucheng Tang, Andriy Myronenko, Vishwesh Nath, Ziyue Xu, Dong Yang, Can Zhao, Benjamin Simon, Mason Belue, Stephanie Harmon, Baris Turkbey, Daguang Xu, Wenqi Li

    Abstract: Foundation models for interactive segmentation in 2D natural images and videos have sparked significant interest in building 3D foundation models for medical imaging. However, the domain gaps and clinical use cases for 3D medical imaging require a dedicated model that diverges from existing 2D solutions. Specifically, such foundation models should support a full workflow that can actually reduce h… ▽ More

    Submitted 21 November, 2024; v1 submitted 7 June, 2024; originally announced June 2024.

  17. arXiv:2405.17824  [pdf, other

    cs.CV

    mTREE: Multi-Level Text-Guided Representation End-to-End Learning for Whole Slide Image Analysis

    Authors: Quan Liu, Ruining Deng, Can Cui, Tianyuan Yao, Vishwesh Nath, Yucheng Tang, Yuankai Huo

    Abstract: Multi-modal learning adeptly integrates visual and textual data, but its application to histopathology image and text analysis remains challenging, particularly with large, high-resolution images like gigapixel Whole Slide Images (WSIs). Current methods typically rely on manual region labeling or multi-stage learning to assemble local representations (e.g., patch-level) into global features (e.g.,… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  18. arXiv:2405.05539  [pdf, other

    physics.flu-dyn

    Instability of a dusty shear flow

    Authors: Anu V. S. Nath, Anubhab Roy, M. Houssem Kasbaoui

    Abstract: We study the instability of a dusty simple shear flow where the dust particles are distributed non-uniformly. A simple shear flow is modally stable to infinitesimal perturbations. Also, a band of particles remains unaffected in the absence of any background flow. However, we demonstrate that the combined scenario -- comprising a simple shear flow with a localised band of particles -- can exhibit d… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

    Comments: 37 pages, 13 figures

  19. arXiv:2403.10011  [pdf, other

    physics.flu-dyn

    Clustering and chaotic motion of heavy inertial particles in an isolated non-axisymmetric vortex

    Authors: Anu V. S. Nath, Anubhab Roy

    Abstract: We investigate the dynamics of heavy inertial particles in a flow field due to an isolated, non-axisymmetric vortex. For our study, we consider a canonical elliptical vortex - the Kirchhoff vortex and its strained variant, the Kida vortex. Contrary to the anticipated centrifugal dispersion of inertial particles, which is typical in open vortical flows, we observe the clustering of particles around… ▽ More

    Submitted 18 September, 2024; v1 submitted 15 March, 2024; originally announced March 2024.

    Comments: 45 pages, 24 figures

  20. arXiv:2307.16896  [pdf, other

    cs.CV

    Disruptive Autoencoders: Leveraging Low-level features for 3D Medical Image Pre-training

    Authors: Jeya Maria Jose Valanarasu, Yucheng Tang, Dong Yang, Ziyue Xu, Can Zhao, Wenqi Li, Vishal M. Patel, Bennett Landman, Daguang Xu, Yufan He, Vishwesh Nath

    Abstract: Harnessing the power of pre-training on large-scale datasets like ImageNet forms a fundamental building block for the progress of representation learning-driven solutions in computer vision. Medical images are inherently different from natural images as they are acquired in the form of many modalities (CT, MR, PET, Ultrasound etc.) and contain granulated information like tissue, lesion, organs etc… ▽ More

    Submitted 31 July, 2023; originally announced July 2023.

    Comments: Preprint

  21. arXiv:2307.12004  [pdf, other

    cs.CV

    COLosSAL: A Benchmark for Cold-start Active Learning for 3D Medical Image Segmentation

    Authors: Han Liu, Hao Li, Xing Yao, Yubo Fan, Dewei Hu, Benoit Dawant, Vishwesh Nath, Zhoubing Xu, Ipek Oguz

    Abstract: Medical image segmentation is a critical task in medical image analysis. In recent years, deep learning based approaches have shown exceptional performance when trained on a fully-annotated dataset. However, data annotation is often a significant bottleneck, especially for 3D medical images. Active learning (AL) is a promising solution for efficient annotation but requires an initial set of labele… ▽ More

    Submitted 22 July, 2023; originally announced July 2023.

    Comments: Accepted by MICCAI 2023

  22. arXiv:2306.02900  [pdf, other

    cs.CV

    Robust Fiber Orientation Distribution Function Estimation Using Deep Constrained Spherical Deconvolution for Diffusion MRI

    Authors: Tianyuan Yao, Francois Rheault, Leon Y Cai, Vishwesh nath, Zuhayr Asad, Nancy Newlin, Can Cui, Ruining Deng, Karthik Ramadass, Andrea Shafer, Susan Resnick, Kurt Schilling, Bennett A. Landman, Yuankai Huo

    Abstract: Diffusion-weighted magnetic resonance imaging (DW-MRI) is a critical imaging method for capturing and modeling tissue microarchitecture at a millimeter scale. A common practice to model the measured DW-MRI signal is via fiber orientation distribution function (fODF). This function is the essential first step for the downstream tractography and connectivity analyses. With recent advantages in data… ▽ More

    Submitted 3 December, 2024; v1 submitted 5 June, 2023; originally announced June 2023.

    Comments: 33 pages, 7 figures

  23. arXiv:2305.10655  [pdf, other

    eess.IV cs.CV cs.LG

    DeepEdit: Deep Editable Learning for Interactive Segmentation of 3D Medical Images

    Authors: Andres Diaz-Pinto, Pritesh Mehta, Sachidanand Alle, Muhammad Asad, Richard Brown, Vishwesh Nath, Alvin Ihsani, Michela Antonelli, Daniel Palkovics, Csaba Pinter, Ron Alkalay, Steve Pieper, Holger R. Roth, Daguang Xu, Prerna Dogra, Tom Vercauteren, Andrew Feng, Abood Quraini, Sebastien Ourselin, M. Jorge Cardoso

    Abstract: Automatic segmentation of medical images is a key step for diagnostic and interventional tasks. However, achieving this requires large amounts of annotated volumes, which can be tedious and time-consuming task for expert annotators. In this paper, we introduce DeepEdit, a deep learning-based method for volumetric medical image annotation, that allows automatic and semi-automatic segmentation, and… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

  24. arXiv:2304.09804  [pdf, ps, other

    physics.flu-dyn

    Irregular dependence on Stokes number and non-ergodic transport of heavy inertial particles in steady laminar flows

    Authors: Anu V. S. Nath, Anubhab Roy, S. Ravichandran, Rama Govindarajan

    Abstract: Small heavy particles in a fluid flow respond to the flow on a time-scale proportional to their inertia, or Stokes number St. Their behaviour is thought to be gradually modified as St increases. We show, in the steady spatially-periodic laminar Taylor-Green flow, that particle dynamics, and their effective diffusivity, actually change in an irregular, non-monotonic and sometimes discontinuous mann… ▽ More

    Submitted 19 April, 2023; originally announced April 2023.

  25. arXiv:2303.16520  [pdf, other

    cs.LG cs.AI cs.CV

    Fair Federated Medical Image Segmentation via Client Contribution Estimation

    Authors: Meirui Jiang, Holger R Roth, Wenqi Li, Dong Yang, Can Zhao, Vishwesh Nath, Daguang Xu, Qi Dou, Ziyue Xu

    Abstract: How to ensure fairness is an important topic in federated learning (FL). Recent studies have investigated how to reward clients based on their contribution (collaboration fairness), and how to achieve uniformity of performance across clients (performance fairness). Despite achieving progress on either one, we argue that it is critical to consider them together, in order to engage and motivate more… ▽ More

    Submitted 29 March, 2023; originally announced March 2023.

    Comments: Accepted at CVPR 2023

  26. arXiv:2303.16376  [pdf, other

    cs.LG

    A Unified Learning Model for Estimating Fiber Orientation Distribution Functions on Heterogeneous Multi-shell Diffusion-weighted MRI

    Authors: Tianyuan Yao, Nancy Newlin, Praitayini Kanakaraj, Vishwesh nath, Leon Y Cai, Karthik Ramadass, Kurt Schilling, Bennett A. Landman, Yuankai Huo

    Abstract: Diffusion-weighted (DW) MRI measures the direction and scale of the local diffusion process in every voxel through its spectrum in q-space, typically acquired in one or more shells. Recent developments in micro-structure imaging and multi-tissue decomposition have sparked renewed attention to the radial b-value dependence of the signal. Applications in tissue classification and micro-architecture… ▽ More

    Submitted 29 January, 2024; v1 submitted 28 March, 2023; originally announced March 2023.

  27. arXiv:2303.16270  [pdf, other

    cs.LG

    Communication-Efficient Vertical Federated Learning with Limited Overlapping Samples

    Authors: Jingwei Sun, Ziyue Xu, Dong Yang, Vishwesh Nath, Wenqi Li, Can Zhao, Daguang Xu, Yiran Chen, Holger R. Roth

    Abstract: Federated learning is a popular collaborative learning approach that enables clients to train a global model without sharing their local data. Vertical federated learning (VFL) deals with scenarios in which the data on clients have different feature spaces but share some overlapping samples. Existing VFL approaches suffer from high communication costs and cannot deal efficiently with limited overl… ▽ More

    Submitted 29 March, 2023; v1 submitted 28 March, 2023; originally announced March 2023.

  28. arXiv:2211.02701  [pdf, other

    cs.LG cs.AI cs.CV

    MONAI: An open-source framework for deep learning in healthcare

    Authors: M. Jorge Cardoso, Wenqi Li, Richard Brown, Nic Ma, Eric Kerfoot, Yiheng Wang, Benjamin Murrey, Andriy Myronenko, Can Zhao, Dong Yang, Vishwesh Nath, Yufan He, Ziyue Xu, Ali Hatamizadeh, Andriy Myronenko, Wentao Zhu, Yun Liu, Mingxin Zheng, Yucheng Tang, Isaac Yang, Michael Zephyr, Behrooz Hashemian, Sachidanand Alle, Mohammad Zalbagi Darestani, Charlie Budd , et al. (32 additional authors not shown)

    Abstract: Artificial Intelligence (AI) is having a tremendous impact across most areas of science. Applications of AI in healthcare have the potential to improve our ability to detect, diagnose, prognose, and intervene on human disease. For AI models to be used clinically, they need to be made safe, reproducible and robust, and the underlying software framework must be aware of the particularities (e.g. geo… ▽ More

    Submitted 4 November, 2022; originally announced November 2022.

    Comments: www.monai.io

  29. arXiv:2209.06285  [pdf, other

    cs.CV

    Warm Start Active Learning with Proxy Labels \& Selection via Semi-Supervised Fine-Tuning

    Authors: Vishwesh Nath, Dong Yang, Holger R. Roth, Daguang Xu

    Abstract: Which volume to annotate next is a challenging problem in building medical imaging datasets for deep learning. One of the promising methods to approach this question is active learning (AL). However, AL has been a hard nut to crack in terms of which AL algorithm and acquisition functions are most useful for which datasets. Also, the problem is exacerbated with which volumes to label first when the… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

    Comments: 12 pages, 5 figures

  30. arXiv:2203.12362  [pdf, other

    cs.HC cs.CV cs.LG eess.IV

    MONAI Label: A framework for AI-assisted Interactive Labeling of 3D Medical Images

    Authors: Andres Diaz-Pinto, Sachidanand Alle, Vishwesh Nath, Yucheng Tang, Alvin Ihsani, Muhammad Asad, Fernando Pérez-García, Pritesh Mehta, Wenqi Li, Mona Flores, Holger R. Roth, Tom Vercauteren, Daguang Xu, Prerna Dogra, Sebastien Ourselin, Andrew Feng, M. Jorge Cardoso

    Abstract: The lack of annotated datasets is a major bottleneck for training new task-specific supervised machine learning models, considering that manual annotation is extremely expensive and time-consuming. To address this problem, we present MONAI Label, a free and open-source framework that facilitates the development of applications based on artificial intelligence (AI) models that aim at reducing the t… ▽ More

    Submitted 28 April, 2023; v1 submitted 23 March, 2022; originally announced March 2022.

  31. arXiv:2201.01266  [pdf, other

    eess.IV cs.CV cs.LG

    Swin UNETR: Swin Transformers for Semantic Segmentation of Brain Tumors in MRI Images

    Authors: Ali Hatamizadeh, Vishwesh Nath, Yucheng Tang, Dong Yang, Holger Roth, Daguang Xu

    Abstract: Semantic segmentation of brain tumors is a fundamental medical image analysis task involving multiple MRI imaging modalities that can assist clinicians in diagnosing the patient and successively studying the progression of the malignant entity. In recent years, Fully Convolutional Neural Networks (FCNNs) approaches have become the de facto standard for 3D medical image segmentation. The popular "U… ▽ More

    Submitted 4 January, 2022; originally announced January 2022.

    Comments: 13 pages, 3 figures

  32. arXiv:2112.10652  [pdf, other

    eess.IV cs.CV

    HyperSegNAS: Bridging One-Shot Neural Architecture Search with 3D Medical Image Segmentation using HyperNet

    Authors: Cheng Peng, Andriy Myronenko, Ali Hatamizadeh, Vish Nath, Md Mahfuzur Rahman Siddiquee, Yufan He, Daguang Xu, Rama Chellappa, Dong Yang

    Abstract: Semantic segmentation of 3D medical images is a challenging task due to the high variability of the shape and pattern of objects (such as organs or tumors). Given the recent success of deep learning in medical image segmentation, Neural Architecture Search (NAS) has been introduced to find high-performance 3D segmentation network architectures. However, because of the massive computational require… ▽ More

    Submitted 24 March, 2022; v1 submitted 20 December, 2021; originally announced December 2021.

  33. arXiv:2111.14791  [pdf, other

    cs.CV cs.AI cs.LG

    Self-Supervised Pre-Training of Swin Transformers for 3D Medical Image Analysis

    Authors: Yucheng Tang, Dong Yang, Wenqi Li, Holger Roth, Bennett Landman, Daguang Xu, Vishwesh Nath, Ali Hatamizadeh

    Abstract: Vision Transformers (ViT)s have shown great performance in self-supervised learning of global and local representations that can be transferred to downstream applications. Inspired by these results, we introduce a novel self-supervised learning framework with tailored proxy tasks for medical image analysis. Specifically, we propose: (i) a new 3D transformer-based model, dubbed Swin UNEt TRansforme… ▽ More

    Submitted 28 March, 2022; v1 submitted 29 November, 2021; originally announced November 2021.

    Comments: CVPR'22 Accepted Paper

  34. Transport of condensing droplets in Taylor-Green vortex flow in the presence of thermal noise

    Authors: Anu V. S. Nath, Anubhab Roy, Rama Govindarajan, S. Ravichandran

    Abstract: We study the role of phase change and thermal noise in particle transport in turbulent flows. We employ a toy model to extract the main physics: condensing droplets are modelled as heavy particles which grow in size, the ambient flow is modelled as a two-dimensional Taylor-Green (TG) flow consisting of an array of vortices delineated by separatrices, and thermal noise are modelled as uncorrelated… ▽ More

    Submitted 7 November, 2021; originally announced November 2021.

    Comments: 14 pages, 11 figures

  35. arXiv:2107.05471  [pdf, other

    eess.IV cs.CV

    The Power of Proxy Data and Proxy Networks for Hyper-Parameter Optimization in Medical Image Segmentation

    Authors: Vishwesh Nath, Dong Yang, Ali Hatamizadeh, Anas A. Abidin, Andriy Myronenko, Holger Roth, Daguang Xu

    Abstract: Deep learning models for medical image segmentation are primarily data-driven. Models trained with more data lead to improved performance and generalizability. However, training is a computationally expensive process because multiple hyper-parameters need to be tested to find the optimal setting for best performance. In this work, we focus on accelerating the estimation of hyper-parameters by prop… ▽ More

    Submitted 12 July, 2021; originally announced July 2021.

  36. arXiv:2103.10504  [pdf, other

    eess.IV cs.CV cs.LG

    UNETR: Transformers for 3D Medical Image Segmentation

    Authors: Ali Hatamizadeh, Yucheng Tang, Vishwesh Nath, Dong Yang, Andriy Myronenko, Bennett Landman, Holger Roth, Daguang Xu

    Abstract: Fully Convolutional Neural Networks (FCNNs) with contracting and expanding paths have shown prominence for the majority of medical image segmentation applications since the past decade. In FCNNs, the encoder plays an integral role by learning both global and local features and contextual representations which can be utilized for semantic output prediction by the decoder. Despite their success, the… ▽ More

    Submitted 9 October, 2021; v1 submitted 18 March, 2021; originally announced March 2021.

    Comments: Accepted to IEEE Winter Conference on Applications of Computer Vision (WACV) 2022

  37. Diminishing Uncertainty within the Training Pool: Active Learning for Medical Image Segmentation

    Authors: Vishwesh Nath, Dong Yang, Bennett A. Landman, Daguang Xu, Holger R. Roth

    Abstract: Active learning is a unique abstraction of machine learning techniques where the model/algorithm could guide users for annotation of a set of data points that would be beneficial to the model, unlike passive machine learning. The primary advantage being that active learning frameworks select data points that can accelerate the learning process of a model and can reduce the amount of data needed to… ▽ More

    Submitted 6 January, 2021; originally announced January 2021.

    Comments: 19 pages, 13 figures, Transactions of Medical Imaging

    Journal ref: IEEE Transactions on Medical Imaging, 2020

  38. arXiv:2003.07921  [pdf, other

    cs.LG stat.ML

    Semi-supervised Contrastive Learning Using Partial Label Information

    Authors: Colin B. Hansen, Vishwesh Nath, Diego A. Mesa, Yuankai Huo, Bennett A. Landman, Thomas A. Lasko

    Abstract: In semi-supervised learning, information from unlabeled examples is used to improve the model learned from labeled examples. In some learning problems, partial label information can be inferred from otherwise unlabeled examples and used to further improve the model. In particular, partial label information exists when subsets of training examples are known to have the same label, even though the l… ▽ More

    Submitted 3 June, 2024; v1 submitted 17 March, 2020; originally announced March 2020.

  39. arXiv:2002.08820  [pdf

    eess.IV cs.CV q-bio.QM

    Deep Learning Estimation of Multi-Tissue Constrained Spherical Deconvolution with Limited Single Shell DW-MRI

    Authors: Vishwesh Nath, Sudhir K. Pathak, Kurt G. Schilling, Walt Schneider, Bennett A. Landman

    Abstract: Diffusion-weighted magnetic resonance imaging (DW-MRI) is the only non-invasive approach for estimation of intra-voxel tissue microarchitecture and reconstruction of in vivo neural pathways for the human brain. With improvement in accelerated MRI acquisition technologies, DW-MRI protocols that make use of multiple levels of diffusion sensitization have gained popularity. A well-known advanced meth… ▽ More

    Submitted 20 February, 2020; originally announced February 2020.

    Comments: 10 pages, 7 figures

  40. arXiv:1911.07927  [pdf

    eess.IV cs.CV

    Deep Learning Captures More Accurate Diffusion Fiber Orientations Distributions than Constrained Spherical Deconvolution

    Authors: Vishwesh Nath, Kurt G. Schilling, Colin B. Hansen, Prasanna Parvathaneni, Allison E. Hainline, Camilo Bermudez, Andrew J. Plassard, Vaibhav Janve, Yurui Gao, Justin A. Blaber, Iwona Stępniewska, Adam W. Anderson, Bennett A. Landman

    Abstract: Confocal histology provides an opportunity to establish intra-voxel fiber orientation distributions that can be used to quantitatively assess the biological relevance of diffusion weighted MRI models, e.g., constrained spherical deconvolution (CSD). Here, we apply deep learning to investigate the potential of single shell diffusion weighted MRI to explain histologically observed fiber orientation… ▽ More

    Submitted 13 November, 2019; originally announced November 2019.

    Comments: 2 pages, 4 figures. This work was accepted and published as an abstract at ISMRM 2018 held in Paris, France

  41. arXiv:1907.06319  [pdf

    eess.IV cs.CV

    Enabling Multi-Shell b-Value Generalizability of Data-Driven Diffusion Models with Deep SHORE

    Authors: Vishwesh Nath, Ilwoo Lyu, Kurt G. Schilling, Prasanna Parvathaneni, Colin B. Hansen, Yucheng Tang, Yuankai Huo, Vaibhav A. Janve, Yurui Gao, Iwona Stepniewska, Adam W. Anderson, Bennett A. Landman

    Abstract: Intra-voxel models of the diffusion signal are essential for interpreting organization of the tissue environment at micrometer level with data at millimeter resolution. Recent advances in data driven methods have enabled direct compari-son and optimization of methods for in-vivo data with externally validated histological sections with both 2-D and 3-D histology. Yet, all existing methods make lim… ▽ More

    Submitted 22 February, 2020; v1 submitted 14 July, 2019; originally announced July 2019.

  42. arXiv:1907.05395  [pdf, other

    q-bio.NC eess.IV q-bio.QM

    Cortical Surface Parcellation using Spherical Convolutional Neural Networks

    Authors: Prasanna Parvathaneni, Shunxing Bao, Vishwesh Nath, Neil D. Woodward, Daniel O. Claassen, Carissa J. Cascio, David H. Zald, Yuankai Huo, Bennett A. Landman, Ilwoo Lyu

    Abstract: We present cortical surface parcellation using spherical deep convolutional neural networks. Traditional multi-atlas cortical surface parcellation requires inter-subject surface registration using geometric features with high processing time on a single subject (2-3 hours). Moreover, even optimal surface registration does not necessarily produce optimal cortical parcellation as parcel boundaries a… ▽ More

    Submitted 11 July, 2019; originally announced July 2019.

  43. arXiv:1903.04207  [pdf, other

    cs.CV

    Distributed deep learning for robust multi-site segmentation of CT imaging after traumatic brain injury

    Authors: Samuel Remedios, Snehashis Roy, Justin Blaber, Camilo Bermudez, Vishwesh Nath, Mayur B. Patel, John A. Butman, Bennett A. Landman, Dzung L. Pham

    Abstract: Machine learning models are becoming commonplace in the domain of medical imaging, and with these methods comes an ever-increasing need for more data. However, to preserve patient anonymity it is frequently impractical or prohibited to transfer protected health information (PHI) between institutions. Additionally, due to the nature of some studies, there may not be a large public dataset available… ▽ More

    Submitted 11 March, 2019; originally announced March 2019.

  44. arXiv:1811.04289  [pdf

    cs.CV

    Coronary Calcium Detection using 3D Attention Identical Dual Deep Network Based on Weakly Supervised Learning

    Authors: Yuankai Huo, James G. Terry, Jiachen Wang, Vishwesh Nath, Camilo Bermudez, Shunxing Bao, Prasanna Parvathaneni, J. Jeffery Carr, Bennett A. Landman

    Abstract: Coronary artery calcium (CAC) is biomarker of advanced subclinical coronary artery disease and predicts myocardial infarction and death prior to age 60 years. The slice-wise manual delineation has been regarded as the gold standard of coronary calcium detection. However, manual efforts are time and resource consuming and even impracticable to be applied on large-scale cohorts. In this paper, we pr… ▽ More

    Submitted 10 November, 2018; originally announced November 2018.

    Comments: Accepted by SPIE medical imaging 2019

  45. arXiv:1810.04260  [pdf

    cs.CV

    Inter-Scanner Harmonization of High Angular Resolution DW-MRI using Null Space Deep Learning

    Authors: Vishwesh Nath, Prasanna Parvathaneni, Colin B. Hansen, Allison E. Hainline, Camilo Bermudez, Samuel Remedios, Justin A. Blaber, Kurt G. Schilling, Ilwoo Lyu, Vaibhav Janve, Yurui Gao, Iwona Stepniewska, Baxter P. Rogers, Allen T. Newton, L. Taylor Davis, Jeff Luci, Adam W. Anderson, Bennett A. Landman

    Abstract: Diffusion-weighted magnetic resonance imaging (DW-MRI) allows for non-invasive imaging of the local fiber architecture of the human brain at a millimetric scale. Multiple classical approaches have been proposed to detect both single (e.g., tensors) and multiple (e.g., constrained spherical deconvolution, CSD) fiber population orientations per voxel. However, existing techniques generally exhibit l… ▽ More

    Submitted 9 October, 2018; originally announced October 2018.

    Comments: 10 pages, 5 figures