Skip to main content

Showing 1–23 of 23 results for author: van der Sommen, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.10634  [pdf, ps, other

    cs.CV cs.AI

    Symmetrical Flow Matching: Unified Image Generation, Segmentation, and Classification with Score-Based Generative Models

    Authors: Francisco Caetano, Christiaan Viviers, Peter H. N. De With, Fons van der Sommen

    Abstract: Flow Matching has emerged as a powerful framework for learning continuous transformations between distributions, enabling high-fidelity generative modeling. This work introduces Symmetrical Flow Matching (SymmFlow), a new formulation that unifies semantic segmentation, classification, and image generation within a single model. Using a symmetric learning objective, SymmFlow models forward and reve… ▽ More

    Submitted 12 June, 2025; originally announced June 2025.

  2. arXiv:2506.01471  [pdf, ps, other

    cs.CV

    SemiVT-Surge: Semi-Supervised Video Transformer for Surgical Phase Recognition

    Authors: Yiping Li, Ronald de Jong, Sahar Nasirihaghighi, Tim Jaspers, Romy van Jaarsveld, Gino Kuiper, Richard van Hillegersberg, Fons van der Sommen, Jelle Ruurda, Marcel Breeuwer, Yasmina Al Khalil

    Abstract: Accurate surgical phase recognition is crucial for computer-assisted interventions and surgical video analysis. Annotating long surgical videos is labor-intensive, driving research toward leveraging unlabeled data for strong performance with minimal annotations. Although self-supervised learning has gained popularity by enabling large-scale pretraining followed by fine-tuning on small labeled subs… ▽ More

    Submitted 2 June, 2025; originally announced June 2025.

    Comments: Accepted for MICCAI 2025

  3. arXiv:2502.16610  [pdf, other

    cs.CV cs.AI

    AdverX-Ray: Ensuring X-Ray Integrity Through Frequency-Sensitive Adversarial VAEs

    Authors: Francisco Caetano, Christiaan Viviers, Lena Filatova, Peter H. N. de With, Fons van der Sommen

    Abstract: Ensuring the quality and integrity of medical images is crucial for maintaining diagnostic accuracy in deep learning-based Computer-Aided Diagnosis and Computer-Aided Detection (CAD) systems. Covariate shifts are subtle variations in the data distribution caused by different imaging devices or settings and can severely degrade model performance, similar to the effects of adversarial attacks. There… ▽ More

    Submitted 23 February, 2025; originally announced February 2025.

    Comments: SPIE Medical Imaging 2025 Runner-up 2025 Robert F. Wagner All-Conference Best Student Paper Award

  4. arXiv:2501.09436  [pdf, other

    cs.CV

    Scaling up self-supervised learning for improved surgical foundation models

    Authors: Tim J. M. Jaspers, Ronald L. P. D. de Jong, Yiping Li, Carolus H. J. Kusters, Franciscus H. A. Bakker, Romy C. van Jaarsveld, Gino M. Kuiper, Richard van Hillegersberg, Jelle P. Ruurda, Willem M. Brinkman, Josien P. W. Pluim, Peter H. N. de With, Marcel Breeuwer, Yasmina Al Khalil, Fons van der Sommen

    Abstract: Foundation models have revolutionized computer vision by achieving vastly superior performance across diverse tasks through large-scale pretraining on extensive datasets. However, their application in surgical computer vision has been limited. This study addresses this gap by introducing SurgeNetXL, a novel surgical foundation model that sets a new benchmark in surgical computer vision. Trained on… ▽ More

    Submitted 16 January, 2025; originally announced January 2025.

  5. arXiv:2501.08005  [pdf, ps, other

    cs.CV cs.AI eess.IV

    DisCoPatch: Taming Adversarially-driven Batch Statistics for Improved Out-of-Distribution Detection

    Authors: Francisco Caetano, Christiaan Viviers, Luis A. Zavala-Mondragón, Peter H. N. de With, Fons van der Sommen

    Abstract: Out-of-distribution (OOD) detection holds significant importance across many applications. While semantic and domain-shift OOD problems are well-studied, this work focuses on covariate shifts - subtle variations in the data distribution that can degrade machine learning performance. We hypothesize that detecting these subtle shifts can improve our understanding of in-distribution boundaries, ultim… ▽ More

    Submitted 30 June, 2025; v1 submitted 14 January, 2025; originally announced January 2025.

    Comments: ICCV 2025

  6. arXiv:2412.03401  [pdf, other

    cs.CV cs.AI

    Benchmarking Pretrained Attention-based Models for Real-Time Recognition in Robot-Assisted Esophagectomy

    Authors: Ronald L. P. D. de Jong, Yasmina al Khalil, Tim J. M. Jaspers, Romy C. van Jaarsveld, Gino M. Kuiper, Yiping Li, Richard van Hillegersberg, Jelle P. Ruurda, Marcel Breeuwer, Fons van der Sommen

    Abstract: Esophageal cancer is among the most common types of cancer worldwide. It is traditionally treated using open esophagectomy, but in recent years, robot-assisted minimally invasive esophagectomy (RAMIE) has emerged as a promising alternative. However, robot-assisted surgery can be challenging for novice surgeons, as they often suffer from a loss of spatial orientation. Computer-aided anatomy recogni… ▽ More

    Submitted 18 December, 2024; v1 submitted 4 December, 2024; originally announced December 2024.

    Comments: Accepted for presentation at the SPIE Medical Imaging Conference, 2025

  7. arXiv:2411.16370  [pdf, ps, other

    cs.CV cs.AI cs.LG eess.IV stat.ML

    A Review of Bayesian Uncertainty Quantification in Deep Probabilistic Image Segmentation

    Authors: M. M. A. Valiuddin, R. J. G. van Sloun, C. G. A. Viviers, P. H. N. de With, F. van der Sommen

    Abstract: Advancements in image segmentation play an integral role within the broad scope of Deep Learning-based Computer Vision. Furthermore, their widespread applicability in critical real-world tasks has resulted in challenges related to the reliability of such algorithms. Hence, uncertainty quantification has been extensively studied within this context, enabling the expression of model ignorance (epist… ▽ More

    Submitted 2 July, 2025; v1 submitted 25 November, 2024; originally announced November 2024.

    Comments: 31 pages of content, revised

  8. arXiv:2409.03043  [pdf, other

    cs.CV cs.AI cs.LG

    Can Your Generative Model Detect Out-of-Distribution Covariate Shift?

    Authors: Christiaan Viviers, Amaan Valiuddin, Francisco Caetano, Lemar Abdi, Lena Filatova, Peter de With, Fons van der Sommen

    Abstract: Detecting Out-of-Distribution (OOD) sensory data and covariate distribution shift aims to identify new test examples with different high-level image statistics to the captured, normal and In-Distribution (ID) set. Existing OOD detection literature largely focuses on semantic shift with little-to-no consensus over covariate shift. Generative models capture the ID data in an unsupervised manner, ena… ▽ More

    Submitted 9 October, 2024; v1 submitted 4 September, 2024; originally announced September 2024.

    Comments: ECCV 2024, typos corrected

  9. arXiv:2408.12945  [pdf, other

    cs.CV

    Find the Assembly Mistakes: Error Segmentation for Industrial Applications

    Authors: Dan Lehman, Tim J. Schoonbeek, Shao-Hsuan Hung, Jacek Kustra, Peter H. N. de With, Fons van der Sommen

    Abstract: Recognizing errors in assembly and maintenance procedures is valuable for industrial applications, since it can increase worker efficiency and prevent unplanned down-time. Although assembly state recognition is gaining attention, none of the current works investigate assembly error localization. Therefore, we propose StateDiffNet, which localizes assembly errors based on detecting the differences… ▽ More

    Submitted 23 August, 2024; originally announced August 2024.

    Comments: 23 pages (14 main paper, 2 references, 7 supplementary), 15 figures (8 main paper, 7 supplementary). Accepted at ECCV Vision-based InduStrial InspectiON (VISION) workshop

  10. arXiv:2408.11700  [pdf, other

    cs.CV

    Supervised Representation Learning towards Generalizable Assembly State Recognition

    Authors: Tim J. Schoonbeek, Goutham Balachandran, Hans Onvlee, Tim Houben, Shao-Hsuan Hung, Jacek Kustra, Peter H. N. de With, Fons van der Sommen

    Abstract: Assembly state recognition facilitates the execution of assembly procedures, offering feedback to enhance efficiency and minimize errors. However, recognizing assembly states poses challenges in scalability, since parts are frequently updated, and the robustness to execution errors remains underexplored. To address these challenges, this paper proposes an approach based on representation learning… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

    Comments: 8 pages, 8 figures

  11. Exploring the Effect of Dataset Diversity in Self-Supervised Learning for Surgical Computer Vision

    Authors: Tim J. M. Jaspers, Ronald L. P. D. de Jong, Yasmina Al Khalil, Tijn Zeelenberg, Carolus H. J. Kusters, Yiping Li, Romy C. van Jaarsveld, Franciscus H. A. Bakker, Jelle P. Ruurda, Willem M. Brinkman, Peter H. N. De With, Fons van der Sommen

    Abstract: Over the past decade, computer vision applications in minimally invasive surgery have rapidly increased. Despite this growth, the impact of surgical computer vision remains limited compared to other medical fields like pathology and radiology, primarily due to the scarcity of representative annotated data. Whereas transfer learning from large annotated datasets such as ImageNet has been convention… ▽ More

    Submitted 26 July, 2024; v1 submitted 25 July, 2024; originally announced July 2024.

    Comments: accepted - Data Engineering in Medical Imaging (DEMI) Workshop @ MICCAI2024

    Report number: vol 15265

    Journal ref: Data Engineering in Medical Imaging. DEMI 2024. Lecture Notes in Computer Science

  12. Advancing 6-DoF Instrument Pose Estimation in Variable X-Ray Imaging Geometries

    Authors: Christiaan G. A. Viviers, Lena Filatova, Maurice Termeer, Peter H. N. de With, Fons van der Sommen

    Abstract: Accurate 6-DoF pose estimation of surgical instruments during minimally invasive surgeries can substantially improve treatment strategies and eventual surgical outcome. Existing deep learning methods have achieved accurate results, but they require custom approaches for each object and laborious setup and training environments often stretching to extensive simulations, whilst lacking real-time com… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

    Comments: Early author version of paper. Refer to the full paper at https://ieeexplore.ieee.org/document/10478293

    Journal ref: IEEE Transactions on Image Processing (2024) (Volume: 33) Page(s): 2462 - 2476

  13. arXiv:2310.17323  [pdf, other

    cs.CV

    IndustReal: A Dataset for Procedure Step Recognition Handling Execution Errors in Egocentric Videos in an Industrial-Like Setting

    Authors: Tim J. Schoonbeek, Tim Houben, Hans Onvlee, Peter H. N. de With, Fons van der Sommen

    Abstract: Although action recognition for procedural tasks has received notable attention, it has a fundamental flaw in that no measure of success for actions is provided. This limits the applicability of such systems especially within the industrial domain, since the outcome of procedural actions is often significantly more important than the mere execution. To address this limitation, we define the novel… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: Accepted for WACV 2024. 15 pages, 9 figures, including supplementary materials

  14. arXiv:2310.00639  [pdf, other

    eess.IV cs.CV

    Segmentation-based Assessment of Tumor-Vessel Involvement for Surgical Resectability Prediction of Pancreatic Ductal Adenocarcinoma

    Authors: Christiaan Viviers, Mark Ramaekers, Amaan Valiuddin, Terese Hellström, Nick Tasios, John van der Ven, Igor Jacobs, Lotte Ewals, Joost Nederend, Peter de With, Misha Luyer, Fons van der Sommen

    Abstract: Pancreatic ductal adenocarcinoma (PDAC) is a highly aggressive cancer with limited treatment options. This research proposes a workflow and deep learning-based segmentation models to automatically assess tumor-vessel involvement, a key factor in determining tumor resectability. Correct assessment of resectability is vital to determine treatment options. The proposed workflow involves processing CT… ▽ More

    Submitted 1 October, 2023; originally announced October 2023.

    Comments: ICCV CVAMD 2023

  15. Investigating and Improving Latent Density Segmentation Models for Aleatoric Uncertainty Quantification in Medical Imaging

    Authors: M. M. Amaan Valiuddin, Christiaan G. A. Viviers, Ruud J. G. van Sloun, Peter H. N. de With, Fons van der Sommen

    Abstract: Data uncertainties, such as sensor noise, occlusions or limitations in the acquisition method can introduce irreducible ambiguities in images, which result in varying, yet plausible, semantic hypotheses. In Machine Learning, this ambiguity is commonly referred to as aleatoric uncertainty. In image segmentation, latent density models can be utilized to address this problem. The most popular approac… ▽ More

    Submitted 20 August, 2024; v1 submitted 31 July, 2023; originally announced July 2023.

  16. arXiv:2307.13425  [pdf, other

    cs.CV cs.LG eess.IV eess.SP

    A signal processing interpretation of noise-reduction convolutional neural networks

    Authors: Luis A. Zavala-Mondragón, Peter H. N. de With, Fons van der Sommen

    Abstract: Encoding-decoding CNNs play a central role in data-driven noise reduction and can be found within numerous deep-learning algorithms. However, the development of these CNN architectures is often done in ad-hoc fashion and theoretical underpinnings for important design choices is generally lacking. Up to this moment there are different existing relevant works that strive to explain the internal oper… ▽ More

    Submitted 25 July, 2023; originally announced July 2023.

    Comments: This article is currently accepted in IEEE Signal Processing Magazine (SPM)

  17. arXiv:2305.00950  [pdf, other

    eess.IV cs.CV cs.LG

    Probabilistic 3D segmentation for aleatoric uncertainty quantification in full 3D medical data

    Authors: Christiaan G. A. Viviers, Amaan M. M. Valiuddin, Peter H. N. de With, Fons van der Sommen

    Abstract: Uncertainty quantification in medical images has become an essential addition to segmentation models for practical application in the real world. Although there are valuable developments in accurate uncertainty quantification methods using 2D images and slices of 3D volumes, in clinical practice, the complete 3D volumes (such as CT and MRI scans) are used to evaluate and plan the medical procedure… ▽ More

    Submitted 1 May, 2023; originally announced May 2023.

  18. arXiv:2211.03211  [pdf, other

    cs.CV cs.LG

    Towards real-time 6D pose estimation of objects in single-view cone-beam X-ray

    Authors: Christiaan G. A. Viviers, Joel de Bruijn, Lena Filatova, Peter H. N. de With, Fons van der Sommen

    Abstract: Deep learning-based pose estimation algorithms can successfully estimate the pose of objects in an image, especially in the field of color images. 6D Object pose estimation based on deep learning models for X-ray images often use custom architectures that employ extensive CAD models and simulated data for training purposes. Recent RGB-based methods opt to solve pose estimation problems using small… ▽ More

    Submitted 6 November, 2022; originally announced November 2022.

    Comments: Published at SPIE Medical Imaging 2022

  19. arXiv:2208.04639  [pdf, other

    cs.CV

    Efficient Out-of-Distribution Detection of Melanoma with Wavelet-based Normalizing Flows

    Authors: M. M. Amaan Valiuddin, Christiaan G. A. Viviers, Ruud J. G. van Sloun, Peter H. N. de With, Fons van der Sommen

    Abstract: Melanoma is a serious form of skin cancer with high mortality rate at later stages. Fortunately, when detected early, the prognosis of melanoma is promising and malignant melanoma incidence rates are relatively low. As a result, datasets are heavily imbalanced which complicates training current state-of-the-art supervised classification AI models. We propose to use generative models to learn the b… ▽ More

    Submitted 10 August, 2022; v1 submitted 9 August, 2022; originally announced August 2022.

    Comments: Published at 1st Workshop on Cancer Prevention through early detecTion (MICCAI 2022)

  20. arXiv:2208.03581  [pdf, other

    cs.CV cs.LG

    Improved Pancreatic Tumor Detection by Utilizing Clinically-Relevant Secondary Features

    Authors: Christiaan G. A. Viviers, Mark Ramaekers, Peter H. N. de With, Dimitrios Mavroeidis, Joost Nederend, Misha Luyer, Fons van der Sommen

    Abstract: Pancreatic cancer is one of the global leading causes of cancer-related deaths. Despite the success of Deep Learning in computer-aided diagnosis and detection (CAD) methods, little attention has been paid to the detection of Pancreatic Cancer. We propose a method for detecting pancreatic tumor that utilizes clinically-relevant features in the surrounding anatomical structures, thereby better aimin… ▽ More

    Submitted 6 August, 2022; originally announced August 2022.

    Comments: Published at MICCAI 2022 CaPTion Workshop on Cancer Prevention through early detecTion

  21. arXiv:2108.02155  [pdf, other

    cs.CV cs.LG

    Improving Aleatoric Uncertainty Quantification in Multi-Annotated Medical Image Segmentation with Normalizing Flows

    Authors: M. M. A. Valiuddin, C. G. A. Viviers, R. J. G. van Sloun, P. H. N. de With, F. van der Sommen

    Abstract: Quantifying uncertainty in medical image segmentation applications is essential, as it is often connected to vital decision-making. Compelling attempts have been made in quantifying the uncertainty in image segmentation architectures, e.g. to learn a density segmentation model conditioned on the input image. Typical work in this field restricts these learnt densities to be strictly Gaussian. In th… ▽ More

    Submitted 5 August, 2021; v1 submitted 4 August, 2021; originally announced August 2021.

    Comments: Accepted for UNSURE at MICCAI 2021. 13 pages and 7 figures

  22. Why rankings of biomedical image analysis competitions should be interpreted with care

    Authors: Lena Maier-Hein, Matthias Eisenmann, Annika Reinke, Sinan Onogur, Marko Stankovic, Patrick Scholz, Tal Arbel, Hrvoje Bogunovic, Andrew P. Bradley, Aaron Carass, Carolin Feldmann, Alejandro F. Frangi, Peter M. Full, Bram van Ginneken, Allan Hanbury, Katrin Honauer, Michal Kozubek, Bennett A. Landman, Keno März, Oskar Maier, Klaus Maier-Hein, Bjoern H. Menze, Henning Müller, Peter F. Neher, Wiro Niessen , et al. (13 additional authors not shown)

    Abstract: International challenges have become the standard for validation of biomedical image analysis methods. Given their scientific impact, it is surprising that a critical analysis of common practices related to the organization of challenges has not yet been performed. In this paper, we present a comprehensive analysis of biomedical image analysis challenges conducted up to now. We demonstrate the imp… ▽ More

    Submitted 18 September, 2019; v1 submitted 6 June, 2018; originally announced June 2018.

    Comments: Article published in Nature Communications: https://rdcu.be/bRmNr

    Journal ref: Nature communications 9.1 (2018): 5217

  23. arXiv:1707.08567  [pdf

    cs.IT

    Proceedings of Workshop AEW10: Concepts in Information Theory and Communications

    Authors: Kees A. Schouhamer Immink, Stan Baggen, Ferdaous Chaabane, Yanling Chen, Peter H. N. de With, Hela Gassara, Hamed Gharbi, Adel Ghazel, Khaled Grati, Naira M. Grigoryan, Ashot Harutyunyan, Masayuki Imanishi, Mitsugu Iwamoto, Ken-ichi Iwata, Hiroshi Kamabe, Brian M. Kurkoski, Shigeaki Kuzuoka, Patrick Langenhuizen, Jan Lewandowsky, Akiko Manada, Shigeki Miyake, Hiroyoshi Morita, Jun Muramatsu, Safa Najjar, Arnak V. Poghosyan , et al. (9 additional authors not shown)

    Abstract: The 10th Asia-Europe workshop in "Concepts in Information Theory and Communications" AEW10 was held in Boppard, Germany on June 21-23, 2017. It is based on a longstanding cooperation between Asian and European scientists. The first workshop was held in Eindhoven, the Netherlands in 1989. The idea of the workshop is threefold: 1) to improve the communication between the scientist in the different p… ▽ More

    Submitted 27 July, 2017; originally announced July 2017.

    Comments: 44 pages, editors for the proceedings: Yanling Chen and A. J. Han Vinck

    MSC Class: 68P30; 94A05