Intelligent Control of Robotic X-ray Devices using a Language-promptable Digital Twin

Killeen, Benjamin D.; Suresh, Anushri; Gomez, Catalina; Inigo, Blanca; Bailey, Christopher; Unberath, Mathias

Computer Science > Robotics

arXiv:2412.08020 (cs)

[Submitted on 11 Dec 2024]

Title:Intelligent Control of Robotic X-ray Devices using a Language-promptable Digital Twin

Authors:Benjamin D. Killeen, Anushri Suresh, Catalina Gomez, Blanca Inigo, Christopher Bailey, Mathias Unberath

View PDF HTML (experimental)

Abstract:Natural language offers a convenient, flexible interface for controlling robotic C-arm X-ray systems, making advanced functionality and controls accessible. However, enabling language interfaces requires specialized AI models that interpret X-ray images to create a semantic representation for reasoning. The fixed outputs of such AI models limit the functionality of language controls. Incorporating flexible, language-aligned AI models prompted through language enables more versatile interfaces for diverse tasks and procedures. Using a language-aligned foundation model for X-ray image segmentation, our system continually updates a patient digital twin based on sparse reconstructions of desired anatomical structures. This supports autonomous capabilities such as visualization, patient-specific viewfinding, and automatic collimation from novel viewpoints, enabling commands 'Focus in on the lower lumbar vertebrae.' In a cadaver study, users visualized, localized, and collimated structures across the torso using verbal commands, achieving 84% end-to-end success. Post hoc analysis of randomly oriented images showed our patient digital twin could localize 35 commonly requested structures to within 51.68 mm, enabling localization and isolation from arbitrary orientations. Our results demonstrate how intelligent robotic X-ray systems can incorporate physicians' expressed intent directly. While existing foundation models for intra-operative X-ray analysis exhibit failure modes, as they improve, they can facilitate highly flexible, intelligent robotic C-arms.

Subjects:	Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
Cite as:	arXiv:2412.08020 [cs.RO]
	(or arXiv:2412.08020v1 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2412.08020

Submission history

From: Benjamin Killeen [view email]
[v1] Wed, 11 Dec 2024 02:00:25 UTC (18,958 KB)

Computer Science > Robotics

Title:Intelligent Control of Robotic X-ray Devices using a Language-promptable Digital Twin

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:Intelligent Control of Robotic X-ray Devices using a Language-promptable Digital Twin

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators