Skip to main content

Showing 1–17 of 17 results for author: Ghamsarian, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.11356  [pdf, ps, other

    cs.CV

    GynSurg: A Comprehensive Gynecology Laparoscopic Surgery Dataset

    Authors: Sahar Nasirihaghighi, Negin Ghamsarian, Leonie Peschek, Matteo Munari, Heinrich Husslein, Raphael Sznitman, Klaus Schoeffmann

    Abstract: Recent advances in deep learning have transformed computer-assisted intervention and surgical video analysis, driving improvements not only in surgical training, intraoperative decision support, and patient outcomes, but also in postoperative documentation and surgical discovery. Central to these developments is the availability of large, high-quality annotated datasets. In gynecologic laparoscopy… ▽ More

    Submitted 12 June, 2025; originally announced June 2025.

  2. arXiv:2506.08896  [pdf, ps, other

    cs.CV

    WetCat: Automating Skill Assessment in Wetlab Cataract Surgery Videos

    Authors: Negin Ghamsarian, Raphael Sznitman, Klaus Schoeffmann, Jens Kowal

    Abstract: To meet the growing demand for systematic surgical training, wetlab environments have become indispensable platforms for hands-on practice in ophthalmology. Yet, traditional wetlab training depends heavily on manual performance evaluations, which are labor-intensive, time-consuming, and often subject to variability. Recent advances in computer vision offer promising avenues for automated skill ass… ▽ More

    Submitted 10 June, 2025; originally announced June 2025.

    Comments: 9 pages, 6 figures

  3. arXiv:2505.07691  [pdf, ps, other

    cs.CV

    Feedback-Driven Pseudo-Label Reliability Assessment: Redefining Thresholding for Semi-Supervised Semantic Segmentation

    Authors: Negin Ghamsarian, Sahar Nasirihaghighi, Klaus Schoeffmann, Raphael Sznitman

    Abstract: Semi-supervised learning leverages unlabeled data to enhance model performance, addressing the limitations of fully supervised approaches. Among its strategies, pseudo-supervision has proven highly effective, typically relying on one or multiple teacher networks to refine pseudo-labels before training a student network. A common practice in pseudo-supervision is filtering pseudo-labels based on pr… ▽ More

    Submitted 12 May, 2025; originally announced May 2025.

    Comments: 11 pages, 5 Figures

  4. arXiv:2501.17628  [pdf, other

    eess.IV cs.CV

    Dual Invariance Self-training for Reliable Semi-supervised Surgical Phase Recognition

    Authors: Sahar Nasirihaghighi, Negin Ghamsarian, Raphael Sznitman, Klaus Schoeffmann

    Abstract: Accurate surgical phase recognition is crucial for advancing computer-assisted interventions, yet the scarcity of labeled data hinders training reliable deep learning models. Semi-supervised learning (SSL), particularly with pseudo-labeling, shows promise over fully supervised methods but often lacks reliable pseudo-label assessment mechanisms. To address this gap, we propose a novel SSL framework… ▽ More

    Submitted 29 January, 2025; originally announced January 2025.

  5. arXiv:2407.11906  [pdf, other

    cs.CV cs.RO

    SegSTRONG-C: Segmenting Surgical Tools Robustly On Non-adversarial Generated Corruptions -- An EndoVis'24 Challenge

    Authors: Hao Ding, Yuqian Zhang, Tuxun Lu, Ruixing Liang, Hongchao Shu, Lalithkumar Seenivasan, Yonghao Long, Qi Dou, Cong Gao, Yicheng Leng, Seok Bong Yoo, Eung-Joo Lee, Negin Ghamsarian, Klaus Schoeffmann, Raphael Sznitman, Zijian Wu, Yuxin Chen, Septimiu E. Salcudean, Samra Irshad, Shadi Albarqouni, Seong Tae Kim, Yueyi Sun, An Wang, Long Bai, Hongliang Ren , et al. (17 additional authors not shown)

    Abstract: Surgical data science has seen rapid advancement due to the excellent performance of end-to-end deep neural networks (DNNs) for surgical video analysis. Despite their successes, end-to-end DNNs have been proven susceptible to even minor corruptions, substantially impairing the model's performance. This vulnerability has become a major concern for the translation of cutting-edge technology, especia… ▽ More

    Submitted 7 April, 2025; v1 submitted 16 July, 2024; originally announced July 2024.

  6. arXiv:2312.06295  [pdf, other

    cs.CV

    Cataract-1K: Cataract Surgery Dataset for Scene Segmentation, Phase Recognition, and Irregularity Detection

    Authors: Negin Ghamsarian, Yosuf El-Shabrawi, Sahar Nasirihaghighi, Doris Putzgruber-Adamitsch, Martin Zinkernagel, Sebastian Wolf, Klaus Schoeffmann, Raphael Sznitman

    Abstract: In recent years, the landscape of computer-assisted interventions and post-operative surgical video analysis has been dramatically reshaped by deep-learning techniques, resulting in significant advancements in surgeons' skills, operation room management, and overall surgical outcomes. However, the progression of deep-learning-powered surgical technologies is profoundly reliant on large-scale datas… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: 12 pages, 5 figures, 7 tables

  7. arXiv:2312.05900  [pdf, other

    cs.CV

    Deep-Learning-Assisted Analysis of Cataract Surgery Videos

    Authors: Negin Ghamsarian

    Abstract: Following the technological advancements in medicine, the operation rooms are evolving into intelligent environments. The context-aware systems (CAS) can comprehensively interpret the surgical state, enable real-time warning, and support decision-making, especially for novice surgeons. These systems can automatically analyze surgical videos and perform indexing, documentation, and post-operative r… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

    Comments: 201 pages, Dissertation

  8. arXiv:2312.03409  [pdf, other

    cs.CV

    DeepPyramid+: Medical Image Segmentation using Pyramid View Fusion and Deformable Pyramid Reception

    Authors: Negin Ghamsarian, Sebastian Wolf, Martin Zinkernagel, Klaus Schoeffmann, Raphael Sznitman

    Abstract: Semantic Segmentation plays a pivotal role in many applications related to medical image and video analysis. However, designing a neural network architecture for medical image and surgical video segmentation is challenging due to the diverse features of relevant classes, including heterogeneity, deformability, transparency, blunt boundaries, and various distortions. We propose a network architectu… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

    Comments: 13 pages, 3 figures

  9. arXiv:2312.03401  [pdf, other

    eess.IV cs.CV

    Predicting Postoperative Intraocular Lens Dislocation in Cataract Surgery via Deep Learning

    Authors: Negin Ghamsarian, Doris Putzgruber-Adamitsch, Stephanie Sarny, Raphael Sznitman, Klaus Schoeffmann, Yosuf El-Shabrawi

    Abstract: A critical yet unpredictable complication following cataract surgery is intraocular lens dislocation. Postoperative stability is imperative, as even a tiny decentration of multifocal lenses or inadequate alignment of the torus in toric lenses due to postoperative rotation can lead to a significant drop in visual acuity. Investigating possible intraoperative indicators that can predict post-surgica… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

    Comments: 12 pages, 5 figures

  10. arXiv:2312.00593  [pdf, other

    cs.CV

    Event Recognition in Laparoscopic Gynecology Videos with Hybrid Transformers

    Authors: Sahar Nasirihaghighi, Negin Ghamsarian, Heinrich Husslein, Klaus Schoeffmann

    Abstract: Analyzing laparoscopic surgery videos presents a complex and multifaceted challenge, with applications including surgical training, intra-operative surgical complication prediction, and post-operative surgical assessment. Identifying crucial events within these videos is a significant prerequisite in a majority of these applications. In this paper, we introduce a comprehensive dataset tailored for… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

  11. Action Recognition in Video Recordings from Gynecologic Laparoscopy

    Authors: Sahar Nasirihaghighi, Negin Ghamsarian, Daniela Stefanics, Klaus Schoeffmann, Heinrich Husslein

    Abstract: Action recognition is a prerequisite for many applications in laparoscopic video analysis including but not limited to surgical training, operation room planning, follow-up surgery preparation, post-operative surgical assessment, and surgical outcome estimation. However, automatic action recognition in laparoscopic surgeries involves numerous challenges such as (I) cross-action and intra-action du… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

  12. arXiv:2307.16660  [pdf, other

    cs.CV

    Domain Adaptation for Medical Image Segmentation using Transformation-Invariant Self-Training

    Authors: Negin Ghamsarian, Javier Gamazo Tejero, Pablo Márquez Neila, Sebastian Wolf, Martin Zinkernagel, Klaus Schoeffmann, Raphael Sznitman

    Abstract: Models capable of leveraging unlabelled data are crucial in overcoming large distribution gaps between the acquired datasets across different imaging devices and configurations. In this regard, self-training techniques based on pseudo-labeling have been shown to be highly effective for semi-supervised domain adaptation. However, the unreliability of pseudo labels can hinder the capability of self-… ▽ More

    Submitted 31 July, 2023; originally announced July 2023.

    Comments: 11 pages, 5 figures, accepted at 26th international conference on Medical Image Computing & Computer Assisted Intervention (MICCAI 2023)

  13. arXiv:2207.01453  [pdf, other

    cs.CV

    DeepPyramid: Enabling Pyramid View and Deformable Pyramid Reception for Semantic Segmentation in Cataract Surgery Videos

    Authors: Negin Ghamsarian, Mario Taschwer, Raphael Sznitman, Klaus Schoeffmann

    Abstract: Semantic segmentation in cataract surgery has a wide range of applications contributing to surgical outcome enhancement and clinical risk reduction. However, the varying issues in segmenting the different relevant structures in these surgeries make the designation of a unique network quite challenging. This paper proposes a semantic segmentation network, termed DeepPyramid, that can deal with thes… ▽ More

    Submitted 4 July, 2022; originally announced July 2022.

    Comments: 11 pages, 4 figures, accepted at 25th international conference on Medical Image Computing & Computer Assisted Intervention (MICCAI 2022). arXiv admin note: substantial text overlap with arXiv:2109.05352

  14. arXiv:2109.12448  [pdf, other

    eess.IV cs.CV cs.LG

    ReCal-Net: Joint Region-Channel-Wise Calibrated Network for Semantic Segmentation in Cataract Surgery Videos

    Authors: Negin Ghamsarian, Mario Taschwer, Doris Putzgruber-Adamitsch, Stephanie Sarny, Yosuf El-Shabrawi, Klaus Schoeffmann

    Abstract: Semantic segmentation in surgical videos is a prerequisite for a broad range of applications towards improving surgical outcomes and surgical video analysis. However, semantic segmentation in surgical videos involves many challenges. In particular, in cataract surgery, various features of the relevant objects such as blunt edges, color and context variation, reflection, transparency, and motion bl… ▽ More

    Submitted 25 September, 2021; originally announced September 2021.

    Comments: 12 pages, 5 figures, accepted at the 28th International Conference on Neural Information Processing (ICONIP), 2021

  15. arXiv:2109.05352  [pdf, other

    cs.CV cs.LG

    DeepPyram: Enabling Pyramid View and Deformable Pyramid Reception for Semantic Segmentation in Cataract Surgery Videos

    Authors: Negin Ghamsarian, Mario Taschwer, klaus Schoeffmann

    Abstract: Semantic segmentation in cataract surgery has a wide range of applications contributing to surgical outcome enhancement and clinical risk reduction. However, the varying issues in segmenting the different relevant instances make the designation of a unique network quite challenging. This paper proposes a semantic segmentation network termed as DeepPyram that can achieve superior performance in seg… ▽ More

    Submitted 11 September, 2021; originally announced September 2021.

    Comments: 12 pages, 10 figures

  16. arXiv:2107.00875  [pdf, other

    eess.IV cs.CV

    LensID: A CNN-RNN-Based Framework Towards Lens Irregularity Detection in Cataract Surgery Videos

    Authors: Negin Ghamsarian, Mario Taschwer, Doris Putzgruber-Adamitsch, Stephanie Sarny, Yosuf El-Shabrawi, Klaus Schoeffmann

    Abstract: A critical complication after cataract surgery is the dislocation of the lens implant leading to vision deterioration and eye trauma. In order to reduce the risk of this complication, it is vital to discover the risk factors during the surgery. However, studying the relationship between lens dislocation and its suspicious risk factors using numerous videos is a time-extensive procedure. Hence, the… ▽ More

    Submitted 2 July, 2021; originally announced July 2021.

    Comments: 13 pages, 5 figures, accepted at 24th international conference on Medical Image Computing & Computer Assisted Intervention (MICCAI 2021)

  17. Relevance Detection in Cataract Surgery Videos by Spatio-Temporal Action Localization

    Authors: Negin Ghamsarian, Mario Taschwer, Doris Putzgruber-Adamitsch, Stephanie Sarny, Klaus Schoeffmann

    Abstract: In cataract surgery, the operation is performed with the help of a microscope. Since the microscope enables watching real-time surgery by up to two people only, a major part of surgical training is conducted using the recorded videos. To optimize the training procedure with the video content, the surgeons require an automatic relevance detection approach. In addition to relevance-based retrieval,… ▽ More

    Submitted 29 April, 2021; originally announced April 2021.

    Comments: 8 pages, 4 figures, accepted at 5th International Conference on Pattern Recognition (ICPR), Milan, Italy, 2020