Skip to main content

Showing 1–21 of 21 results for author: Mazomenos, E

.
  1. arXiv:2504.17401  [pdf, other

    cs.CV cs.AI

    StereoMamba: Real-time and Robust Intraoperative Stereo Disparity Estimation via Long-range Spatial Dependencies

    Authors: Xu Wang, Jialang Xu, Shuai Zhang, Baoru Huang, Danail Stoyanov, Evangelos B. Mazomenos

    Abstract: Stereo disparity estimation is crucial for obtaining depth information in robot-assisted minimally invasive surgery (RAMIS). While current deep learning methods have made significant advancements, challenges remain in achieving an optimal balance between accuracy, robustness, and inference speed. To address these challenges, we propose the StereoMamba architecture, which is specifically designed f… ▽ More

    Submitted 24 April, 2025; originally announced April 2025.

  2. arXiv:2503.24306  [pdf, other

    cs.CV

    Point Tracking in Surgery--The 2024 Surgical Tattoos in Infrared (STIR) Challenge

    Authors: Adam Schmidt, Mert Asim Karaoglu, Soham Sinha, Mingang Jang, Ho-Gun Ha, Kyungmin Jung, Kyeongmo Gu, Ihsan Ullah, Hyunki Lee, Jonáš Šerých, Michal Neoral, Jiří Matas, Rulin Zhou, Wenlong He, An Wang, Hongliang Ren, Bruno Silva, Sandro Queirós, Estêvão Lima, João L. Vilaça, Shunsuke Kikuchi, Atsushi Kouno, Hiroki Matsuzaki, Tongtong Li, Yulu Chen , et al. (15 additional authors not shown)

    Abstract: Understanding tissue motion in surgery is crucial to enable applications in downstream tasks such as segmentation, 3D reconstruction, virtual tissue landmarking, autonomous probe-based scanning, and subtask autonomy. Labeled data are essential to enabling algorithms in these downstream tasks since they allow us to quantify and train algorithms. This paper introduces a point tracking challenge to a… ▽ More

    Submitted 31 March, 2025; originally announced March 2025.

  3. arXiv:2503.22592  [pdf, other

    eess.IV cs.AI cs.CV

    KEVS: Enhancing Segmentation of Visceral Adipose Tissue in Pre-Cystectomy CT with Gaussian Kernel Density Estimation

    Authors: Thomas Boucher, Nicholas Tetlow, Annie Fung, Amy Dewar, Pietro Arina, Sven Kerneis, John Whittle, Evangelos B. Mazomenos

    Abstract: Purpose: The distribution of visceral adipose tissue (VAT) in cystectomy patients is indicative of the incidence of post-operative complications. Existing VAT segmentation methods for computed tomography (CT) employing intensity thresholding have limitations relating to inter-observer variability. Moreover, the difficulty in creating ground-truth masks limits the development of deep learning (DL)… ▽ More

    Submitted 28 March, 2025; originally announced March 2025.

    Comments: Preprint for submission to IPCAI special edition of IJCARS 2025, version prior to any peer review

  4. arXiv:2503.22437  [pdf, other

    cs.CV

    EndoLRMGS: Complete Endoscopic Scene Reconstruction combining Large Reconstruction Modelling and Gaussian Splatting

    Authors: Xu Wang, Shuai Zhang, Baoru Huang, Danail Stoyanov, Evangelos B. Mazomenos

    Abstract: Complete reconstruction of surgical scenes is crucial for robot-assisted surgery (RAS). Deep depth estimation is promising but existing works struggle with depth discontinuities, resulting in noisy predictions at object boundaries and do not achieve complete reconstruction omitting occluded surfaces. To address these issues we propose EndoLRMGS, that combines Large Reconstruction Modelling (LRM) a… ▽ More

    Submitted 28 March, 2025; originally announced March 2025.

  5. arXiv:2503.22177  [pdf, other

    cs.RO cs.CV eess.IV

    3D Acetabular Surface Reconstruction from 2D Pre-operative X-ray Images using SRVF Elastic Registration and Deformation Graph

    Authors: Shuai Zhang, Jinliang Wang, Sujith Konandetails, Xu Wang, Danail Stoyanov, Evangelos B. Mazomenos

    Abstract: Accurate and reliable selection of the appropriate acetabular cup size is crucial for restoring joint biomechanics in total hip arthroplasty (THA). This paper proposes a novel framework that integrates square-root velocity function (SRVF)-based elastic shape registration technique with an embedded deformation (ED) graph approach to reconstruct the 3D articular surface of the acetabulum by fusing m… ▽ More

    Submitted 28 March, 2025; originally announced March 2025.

    Comments: 10 pages, 3 figures, conference

  6. arXiv:2503.10265  [pdf, other

    cs.AI cs.RO

    SurgRAW: Multi-Agent Workflow with Chain-of-Thought Reasoning for Surgical Intelligence

    Authors: Chang Han Low, Ziyue Wang, Tianyi Zhang, Zhitao Zeng, Zhu Zhuo, Evangelos B. Mazomenos, Yueming Jin

    Abstract: Integration of Vision-Language Models (VLMs) in surgical intelligence is hindered by hallucinations, domain knowledge gaps, and limited understanding of task interdependencies within surgical scenes, undermining clinical reliability. While recent VLMs demonstrate strong general reasoning and thinking capabilities, they still lack the domain expertise and task-awareness required for precise surgica… ▽ More

    Submitted 13 March, 2025; originally announced March 2025.

  7. arXiv:2503.09474  [pdf, other

    cs.CV

    SurgicalVLM-Agent: Towards an Interactive AI Co-Pilot for Pituitary Surgery

    Authors: Jiayuan Huang, Runlong He, Danyal Z. Khan, Evangelos Mazomenos, Danail Stoyanov, Hani J. Marcus, Matthew J. Clarkson, Mobarakol Islam

    Abstract: Image-guided surgery demands adaptive, real-time decision support, yet static AI models struggle with structured task planning and providing interactive guidance. Large vision-language models (VLMs) offer a promising solution by enabling dynamic task planning and predictive decision support. We introduce SurgicalVLM-Agent, an AI co-pilot for image-guided pituitary surgery, capable of conversation,… ▽ More

    Submitted 12 March, 2025; originally announced March 2025.

    Comments: 11 pages

  8. arXiv:2503.07294  [pdf, other

    cs.CV cs.AI cs.LG

    Distilling Knowledge into Quantum Vision Transformers for Biomedical Image Classification

    Authors: Thomas Boucher, Evangelos B. Mazomenos

    Abstract: Quantum vision transformers (QViTs) build on vision transformers (ViTs) by replacing linear layers within the self-attention mechanism with parameterised quantum neural networks (QNNs), harnessing quantum mechanical properties to improve feature representation. This hybrid approach aims to achieve superior performance, with significantly reduced model complexity as a result of the enriched feature… ▽ More

    Submitted 10 March, 2025; originally announced March 2025.

    Comments: Submitted for MICCAI 2025

  9. arXiv:2503.07204  [pdf, other

    cs.CV

    Endo-FASt3r: Endoscopic Foundation model Adaptation for Structure from motion

    Authors: Mona Sheikh Zeinoddin, Mobarakol Islam, Zafer Tandogdu, Greg Shaw, Mathew J. Clarkson, Evangelos Mazomenos, Danail Stoyanov

    Abstract: Accurate depth and camera pose estimation is essential for achieving high-quality 3D visualisations in robotic-assisted surgery. Despite recent advancements in foundation model adaptation to monocular depth estimation of endoscopic scenes via self-supervised learning (SSL), no prior work has explored their use for pose estimation. These methods rely on low rank-based adaptation approaches, which c… ▽ More

    Submitted 18 March, 2025; v1 submitted 10 March, 2025; originally announced March 2025.

  10. arXiv:2502.14149  [pdf, other

    cs.CV cs.AI

    PitVQA++: Vector Matrix-Low-Rank Adaptation for Open-Ended Visual Question Answering in Pituitary Surgery

    Authors: Runlong He, Danyal Z. Khan, Evangelos B. Mazomenos, Hani J. Marcus, Danail Stoyanov, Matthew J. Clarkson, Mobarakol Islam

    Abstract: Vision-Language Models (VLMs) in visual question answering (VQA) offer a unique opportunity to enhance intra-operative decision-making, promote intuitive interactions, and significantly advancing surgical education. However, the development of VLMs for surgical VQA is challenging due to limited datasets and the risk of overfitting and catastrophic forgetting during full fine-tuning of pretrained w… ▽ More

    Submitted 19 February, 2025; originally announced February 2025.

    Comments: 9 pages

  11. arXiv:2408.17433  [pdf, other

    cs.CV

    DARES: Depth Anything in Robotic Endoscopic Surgery with Self-supervised Vector-LoRA of the Foundation Model

    Authors: Mona Sheikh Zeinoddin, Chiara Lena, Jiongqi Qu, Luca Carlini, Mattia Magro, Seunghoi Kim, Elena De Momi, Sophia Bano, Matthew Grech-Sollars, Evangelos Mazomenos, Daniel C. Alexander, Danail Stoyanov, Matthew J. Clarkson, Mobarakol Islam

    Abstract: Robotic-assisted surgery (RAS) relies on accurate depth estimation for 3D reconstruction and visualization. While foundation models like Depth Anything Models (DAM) show promise, directly applying them to surgery often yields suboptimal results. Fully fine-tuning on limited surgical data can cause overfitting and catastrophic forgetting, compromising model robustness and generalization. Although L… ▽ More

    Submitted 21 October, 2024; v1 submitted 30 August, 2024; originally announced August 2024.

    Comments: 11 pages

  12. arXiv:2408.03208  [pdf, other

    cs.CV cs.AI cs.RO physics.med-ph

    Personalizing Federated Instrument Segmentation with Visual Trait Priors in Robotic Surgery

    Authors: Jialang Xu, Jiacheng Wang, Lequan Yu, Danail Stoyanov, Yueming Jin, Evangelos B. Mazomenos

    Abstract: Personalized federated learning (PFL) for surgical instrument segmentation (SIS) is a promising approach. It enables multiple clinical sites to collaboratively train a series of models in privacy, with each model tailored to the individual distribution of each site. Existing PFL methods rarely consider the personalization of multi-headed self-attention, and do not account for appearance diversity… ▽ More

    Submitted 15 August, 2024; v1 submitted 6 August, 2024; originally announced August 2024.

    Comments: 9 pages, 3 figures, under review

  13. arXiv:2406.19217  [pdf, other

    cs.CV cs.AI cs.RO

    Think Step by Step: Chain-of-Gesture Prompting for Error Detection in Robotic Surgical Videos

    Authors: Zhimin Shao, Jialang Xu, Danail Stoyanov, Evangelos B. Mazomenos, Yueming Jin

    Abstract: Despite significant advancements in robotic systems and surgical data science, ensuring safe and optimal execution in robot-assisted minimally invasive surgery (RMIS) remains a complex challenge. Current surgical error detection methods involve two parts: identifying surgical gestures and then detecting errors within each gesture clip. These methods seldom consider the rich contextual and semantic… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 8 pages, 4 figures

    Journal ref: IEEE Robotics and Automation Letters, vol. 9, no. 12, pp. 11513-11520, Dec. 2024

  14. SEDMamba: Enhancing Selective State Space Modelling with Bottleneck Mechanism and Fine-to-Coarse Temporal Fusion for Efficient Error Detection in Robot-Assisted Surgery

    Authors: Jialang Xu, Nazir Sirajudeen, Matthew Boal, Nader Francis, Danail Stoyanov, Evangelos Mazomenos

    Abstract: Automated detection of surgical errors can improve robotic-assisted surgery. Despite promising progress, existing methods still face challenges in capturing rich temporal context to establish long-term dependencies while maintaining computational efficiency. In this paper, we propose a novel hierarchical model named SEDMamba, which incorporates the selective state space model (SSM) into surgical e… ▽ More

    Submitted 29 November, 2024; v1 submitted 22 June, 2024; originally announced June 2024.

    Comments: Accepted by IEEE RA-L

    Journal ref: IEEE Robotics and Automation Letters, vol. 10, no. 1, pp. 232-239, Jan. 2025

  15. arXiv:2303.09648  [pdf, other

    cs.CV

    Shifted-Windows Transformers for the Detection of Cerebral Aneurysms in Microsurgery

    Authors: Jinfan Zhou, William Muirhead, Simon C. Williams, Danail Stoyanov, Hani J. Marcus, Evangelos B. Mazomenos

    Abstract: Purpose: Microsurgical Aneurysm Clipping Surgery (MACS) carries a high risk for intraoperative aneurysm rupture. Automated recognition of instances when the aneurysm is exposed in the surgical video would be a valuable reference point for neuronavigation, indicating phase transitioning and more importantly designating moments of high risk for rupture. This article introduces the MACS dataset conta… ▽ More

    Submitted 16 March, 2023; originally announced March 2023.

  16. arXiv:2212.04448  [pdf, other

    cs.CV

    Objective Surgical Skills Assessment and Tool Localization: Results from the MICCAI 2021 SimSurgSkill Challenge

    Authors: Aneeq Zia, Kiran Bhattacharyya, Xi Liu, Ziheng Wang, Max Berniker, Satoshi Kondo, Emanuele Colleoni, Dimitris Psychogyios, Yueming Jin, Jinfan Zhou, Evangelos Mazomenos, Lena Maier-Hein, Danail Stoyanov, Stefanie Speidel, Anthony Jarc

    Abstract: Timely and effective feedback within surgical training plays a critical role in developing the skills required to perform safe and efficient surgery. Feedback from expert surgeons, while especially valuable in this regard, is challenging to acquire due to their typically busy schedules, and may be subject to biases. Formal assessment procedures like OSATS and GEARS attempt to provide objective mea… ▽ More

    Submitted 8 December, 2022; originally announced December 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:1910.04071

  17. Towards a Computed-Aided Diagnosis System in Colonoscopy: Automatic Polyp Segmentation Using Convolution Neural Networks

    Authors: Patrick Brandao, Odysseas Zisimopoulos, Evangelos Mazomenos, Gastone Ciuti, Jorge Bernal, Marco Visentini-Scarzanella, Arianna Menciassi, Paolo Dario, Anastasios Koulaouzidis, Alberto Arezzo, David J Hawkes, Danail Stoyanov

    Abstract: Early diagnosis is essential for the successful treatment of bowel cancers including colorectal cancer (CRC) and capsule endoscopic imaging with robotic actuation can be a valuable diagnostic tool when combined with automated image analysis. We present a deep learning rooted detection and segmentation framework for recognizing lesions in colonoscopy and capsule endoscopy images. We restructure est… ▽ More

    Submitted 15 January, 2021; originally announced January 2021.

    Comments: 10 pages, 6 figures

    Journal ref: Journal of Medical Robotics Research, Volume 03, No. 02, 1840002 (2018) G

  18. arXiv:1806.05154  [pdf, other

    cs.CV

    Automated Performance Assessment in Transoesophageal Echocardiography with Convolutional Neural Networks

    Authors: Evangelos B. Mazomenos, Kamakshi Bansal, Bruce Martin, Andrew Smith, Susan Wright, Danail Stoyanov

    Abstract: Transoesophageal echocardiography (TEE) is a valuable diagnostic and monitoring imaging modality. Proper image acquisition is essential for diagnosis, yet current assessment techniques are solely based on manual expert review. This paper presents a supervised deep learn ing framework for automatically evaluating and grading the quality of TEE images. To obtain the necessary dataset, 38 participant… ▽ More

    Submitted 13 June, 2018; originally announced June 2018.

    Comments: to be presented in MICCAI 2018, Granada, Spain, 16-20 Sep 2018

  19. arXiv:1711.00499  [pdf, other

    cs.CV

    Widening siamese architectures for stereo matching

    Authors: Patrick Brandao, Evangelos Mazomenos, Danail Stoyanov

    Abstract: Computational stereo is one of the classical problems in computer vision. Numerous algorithms and solutions have been reported in recent years focusing on developing methods for computing similarity, aggregating it to obtain spatial support and finally optimizing an energy function to find the final disparity. In this paper, we focus on the feature extraction component of stereo matching architect… ▽ More

    Submitted 1 November, 2017; originally announced November 2017.

    Comments: 7 pages, 4 figures

  20. arXiv:1611.09829  [pdf

    physics.med-ph nlin.CD stat.AP

    A Statistical Index for Early Diagnosis of Ventricular Arrhythmia from the Trend Analysis of ECG Phase-portraits

    Authors: Grazia Cappiello, Saptarshi Das, Evangelos B. Mazomenos, Koushik Maharatna, George Koulaouzidis, John Morgan, Paolo Emilio Puddu

    Abstract: In this paper, we propose a novel statistical index for the early diagnosis of ventricular arrhythmia (VA) using the time delay phase-space reconstruction (PSR) technique, from the electrocardiogram (ECG) signal. Patients with two classes of fatal VA - with preceding ventricular premature beats (VPBs) and with no VPBs have been analysed using extensive simulations. Three subclasses of VA with VPBs… ▽ More

    Submitted 29 November, 2016; originally announced November 2016.

    Comments: 25 pages, 16 figures, 2 tables

    Journal ref: Physiological Measurement, vol. 36, no. 1, pp. 107-131, January 2015

  21. A novel approach for the diagnosis of ventricular tachycardia based on phase space reconstruction of ECG

    Authors: George Koulaouzidis, Saptarshi Das, Grazia Cappiello, Evangelos B. Mazomenos, Koushik Maharatna, John Morgan

    Abstract: Ventricular arrhythmias comprise a group of disorders which manifest clinically in a variety of ways from ventricular premature beats (VPB) and no sustained ventricular tachycardia (in healthy subjects) to sudden cardiac death due to ventricular tachyarrhythmia in patients with and/or without structural heart disease. Ventricular fibrillation (VF) and ventricular tachycardia (VT) are the most comm… ▽ More

    Submitted 20 October, 2014; originally announced October 2014.

    Comments: 7 pages, 2 figures

    Journal ref: International Journal of Cardiology, Volume 172, Issue 1, March 2014, Pages e31-e33