Skip to main content

Showing 1–7 of 7 results for author: Frisch, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.03082  [pdf, ps, other

    cs.CV

    SG2VID: Scene Graphs Enable Fine-Grained Control for Video Synthesis

    Authors: Ssharvien Kumar Sivakumar, Yannik Frisch, Ghazal Ghazaei, Anirban Mukhopadhyay

    Abstract: Surgical simulation plays a pivotal role in training novice surgeons, accelerating their learning curve and reducing intra-operative errors. However, conventional simulation tools fall short in providing the necessary photorealism and the variability of human anatomy. In response, current methods are shifting towards generative model-based simulators. Yet, these approaches primarily focus on using… ▽ More

    Submitted 13 June, 2025; v1 submitted 3 June, 2025; originally announced June 2025.

  2. arXiv:2502.09653  [pdf, other

    eess.IV cs.CV

    SASVi -- Segment Any Surgical Video

    Authors: Ssharvien Kumar Sivakumar, Yannik Frisch, Amin Ranem, Anirban Mukhopadhyay

    Abstract: Purpose: Foundation models, trained on multitudes of public datasets, often require additional fine-tuning or re-prompting mechanisms to be applied to visually distinct target domains such as surgical videos. Further, without domain knowledge, they cannot model the specific semantics of the target domain. Hence, when applied to surgical video segmentation, they fail to generalise to sections where… ▽ More

    Submitted 11 February, 2025; originally announced February 2025.

  3. arXiv:2502.07945  [pdf, other

    cs.CV cs.LG

    SurGrID: Controllable Surgical Simulation via Scene Graph to Image Diffusion

    Authors: Yannik Frisch, Ssharvien Kumar Sivakumar, Çağhan Köksal, Elsa Böhm, Felix Wagner, Adrian Gericke, Ghazal Ghazaei, Anirban Mukhopadhyay

    Abstract: Surgical simulation offers a promising addition to conventional surgical training. However, available simulation tools lack photorealism and rely on hardcoded behaviour. Denoising Diffusion Models are a promising alternative for high-fidelity image synthesis, but existing state-of-the-art conditioning methods fall short in providing precise control or interactivity over the generated scenes. We… ▽ More

    Submitted 11 February, 2025; originally announced February 2025.

  4. arXiv:2501.10819  [pdf, other

    cs.CV cs.LG

    GAUDA: Generative Adaptive Uncertainty-guided Diffusion-based Augmentation for Surgical Segmentation

    Authors: Yannik Frisch, Christina Bornberg, Moritz Fuchs, Anirban Mukhopadhyay

    Abstract: Augmentation by generative modelling yields a promising alternative to the accumulation of surgical data, where ethical, organisational and regulatory aspects must be considered. Yet, the joint synthesis of (image, mask) pairs for segmentation, a major application in surgery, is rather unexplored. We propose to learn semantically comprehensive yet compact latent representations of the (image, mask… ▽ More

    Submitted 18 January, 2025; originally announced January 2025.

  5. arXiv:2410.17664  [pdf, other

    eess.IV cs.CV

    Deep Generative Models for 3D Medical Image Synthesis

    Authors: Paul Friedrich, Yannik Frisch, Philippe C. Cattin

    Abstract: Deep generative modeling has emerged as a powerful tool for synthesizing realistic medical images, driving advances in medical image analysis, disease diagnosis, and treatment planning. This chapter explores various deep generative models for 3D medical image synthesis, with a focus on Variational Autoencoders (VAEs), Generative Adversarial Networks (GANs), and Denoising Diffusion Models (DDMs). W… ▽ More

    Submitted 23 October, 2024; originally announced October 2024.

  6. arXiv:2401.06291  [pdf, other

    cs.CV

    Frequency-Time Diffusion with Neural Cellular Automata

    Authors: John Kalkhof, Arlene Kühn, Yannik Frisch, Anirban Mukhopadhyay

    Abstract: Despite considerable success, large Denoising Diffusion Models (DDMs) with UNet backbone pose practical challenges, particularly on limited hardware and in processing gigapixel images. To address these limitations, we introduce two Neural Cellular Automata (NCA)-based DDMs: Diff-NCA and FourierDiff-NCA. Capitalizing on the local communication capabilities of NCA, Diff-NCA significantly reduces the… ▽ More

    Submitted 13 May, 2024; v1 submitted 11 January, 2024; originally announced January 2024.

  7. arXiv:2308.02587  [pdf, other

    eess.IV cs.CV cs.LG

    Synthesising Rare Cataract Surgery Samples with Guided Diffusion Models

    Authors: Yannik Frisch, Moritz Fuchs, Antoine Sanner, Felix Anton Ucar, Marius Frenzel, Joana Wasielica-Poslednik, Adrian Gericke, Felix Mathias Wagner, Thomas Dratsch, Anirban Mukhopadhyay

    Abstract: Cataract surgery is a frequently performed procedure that demands automation and advanced assistance systems. However, gathering and annotating data for training such systems is resource intensive. The publicly available data also comprises severe imbalances inherent to the surgical process. Motivated by this, we analyse cataract surgery video data for the worst-performing phases of a pre-trained… ▽ More

    Submitted 3 August, 2023; originally announced August 2023.