Skip to main content

Showing 1–16 of 16 results for author: Kapoor, A

Searching in archive eess. Search in all archives.
.
  1. arXiv:2505.07851  [pdf, other

    eess.IV cs.AI cs.CV cs.RO

    Pose Estimation for Intra-cardiac Echocardiography Catheter via AI-Based Anatomical Understanding

    Authors: Jaeyoung Huh, Ankur Kapoor, Young-Ho Kim

    Abstract: Intra-cardiac Echocardiography (ICE) plays a crucial role in Electrophysiology (EP) and Structural Heart Disease (SHD) interventions by providing high-resolution, real-time imaging of cardiac structures. However, existing navigation methods rely on electromagnetic (EM) tracking, which is susceptible to interference and position drift, or require manual adjustments based on operator expertise. To o… ▽ More

    Submitted 7 May, 2025; originally announced May 2025.

  2. arXiv:2505.05518  [pdf, other

    eess.IV cs.CV cs.RO

    Guidance for Intra-cardiac Echocardiography Manipulation to Maintain Continuous Therapy Device Tip Visibility

    Authors: Jaeyoung Huh, Ankur Kapoor, Young-Ho Kim

    Abstract: Intra-cardiac Echocardiography (ICE) plays a critical role in Electrophysiology (EP) and Structural Heart Disease (SHD) interventions by providing real-time visualization of intracardiac structures. However, maintaining continuous visibility of the therapy device tip remains a challenge due to frequent adjustments required during manual ICE catheter manipulation. To address this, we propose an AI-… ▽ More

    Submitted 7 May, 2025; originally announced May 2025.

  3. arXiv:2505.00059  [pdf, other

    cs.CL cs.SD eess.AS

    BERSting at the Screams: A Benchmark for Distanced, Emotional and Shouted Speech Recognition

    Authors: Paige Tuttösí, Mantaj Dhillon, Luna Sang, Shane Eastwood, Poorvi Bhatia, Quang Minh Dinh, Avni Kapoor, Yewon Jin, Angelica Lim

    Abstract: Some speech recognition tasks, such as automatic speech recognition (ASR), are approaching or have reached human performance in many reported metrics. Yet, they continue to struggle in complex, real-world, situations, such as with distanced speech. Previous challenges have released datasets to address the issue of distanced ASR, however, the focus remains primarily on distance, specifically relyin… ▽ More

    Submitted 30 April, 2025; originally announced May 2025.

    Comments: Accepted to Computer Speech and Language, Special issue: Multi-Speaker, Multi-Microphone, and Multi-Modal Distant Speech Recognition (September 2025)

  4. arXiv:2407.16302  [pdf, other

    cs.CV eess.IV

    DeepClean: Integrated Distortion Identification and Algorithm Selection for Rectifying Image Corruptions

    Authors: Aditya Kapoor, Harshad Khadilkar, Jayvardhana Gubbi

    Abstract: Distortion identification and rectification in images and videos is vital for achieving good performance in downstream vision applications. Instead of relying on fixed trial-and-error based image processing pipelines, we propose a two-level sequential planning approach for automated image distortion classification and rectification. At the higher level it detects the class of corruptions present i… ▽ More

    Submitted 19 December, 2024; v1 submitted 23 July, 2024; originally announced July 2024.

    Comments: 7 pages, 3 figures

  5. arXiv:2309.15750  [pdf, other

    eess.IV cs.CV

    Automated CT Lung Cancer Screening Workflow using 3D Camera

    Authors: Brian Teixeira, Vivek Singh, Birgi Tamersoy, Andreas Prokein, Ankur Kapoor

    Abstract: Despite recent developments in CT planning that enabled automation in patient positioning, time-consuming scout scans are still needed to compute dose profile and ensure the patient is properly positioned. In this paper, we present a novel method which eliminates the need for scout scans in CT lung cancer screening by estimating patient scan range, isocenter, and Water Equivalent Diameter (WED) fr… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

    Comments: Accepted at MICCAI 2023

  6. arXiv:2209.11282  [pdf

    eess.IV cs.CV cs.LG

    Automated detection of Alzheimer disease using MRI images and deep neural networks- A review

    Authors: Narotam Singh, Patteshwari. D, Neha Soni, Amita Kapoor

    Abstract: Early detection of Alzheimer disease is crucial for deploying interventions and slowing the disease progression. A lot of machine learning and deep learning algorithms have been explored in the past decade with the aim of building an automated detection for Alzheimer. Advancements in data augmentation techniques and advanced deep learning architectures have opened up new frontiers in this field, a… ▽ More

    Submitted 22 September, 2022; originally announced September 2022.

    Comments: 22 Pages, 5 Figures, 7 Tables

  7. arXiv:2207.14419  [pdf, other

    cs.RO cs.LG eess.SY

    Sample-efficient Safe Learning for Online Nonlinear Control with Control Barrier Functions

    Authors: Wenhao Luo, Wen Sun, Ashish Kapoor

    Abstract: Reinforcement Learning (RL) and continuous nonlinear control have been successfully deployed in multiple domains of complicated sequential decision-making tasks. However, given the exploration nature of the learning process and the presence of model uncertainty, it is challenging to apply them to safety-critical control tasks due to the lack of safety guarantee. On the other hand, while combining… ▽ More

    Submitted 28 July, 2022; originally announced July 2022.

    Comments: The 15th International Workshop on the Algorithmic Foundations of Robotics (WAFR 2022)

  8. arXiv:2203.13411  [pdf, other

    cs.RO cs.AI cs.LG eess.SY

    Reshaping Robot Trajectories Using Natural Language Commands: A Study of Multi-Modal Data Alignment Using Transformers

    Authors: Arthur Bucker, Luis Figueredo, Sami Haddadin, Ashish Kapoor, Shuang Ma, Rogerio Bonatti

    Abstract: Natural language is the most intuitive medium for us to interact with other people when expressing commands and instructions. However, using language is seldom an easy task when humans need to express their intent towards robots, since most of the current language interfaces require rigid templates with a static set of action targets and commands. In this work, we provide a flexible language-based… ▽ More

    Submitted 24 March, 2022; originally announced March 2022.

  9. arXiv:2109.07428  [pdf, other

    cs.RO cs.CV eess.SP eess.SY

    A Wide-area, Low-latency, and Power-efficient 6-DoF Pose Tracking System for Rigid Objects

    Authors: Young-Ho Kim, Ankur Kapoor, Tommaso Mansi, Ali Kamen

    Abstract: Position sensitive detectors (PSDs) offer possibility to track single active marker's two (or three) degrees of freedom (DoF) position with a high accuracy, while having a fast response time with high update frequency and low latency, all using a very simple signal processing circuit. However they are not particularly suitable for 6-DoF object pose tracking system due to lack of orientation measur… ▽ More

    Submitted 10 January, 2022; v1 submitted 15 September, 2021; originally announced September 2021.

  10. arXiv:2010.12983  [pdf

    eess.SY

    Sensor-Based Spreader Automation for Reducing Salt Use and Improving Safety

    Authors: Ayushmaan Aggarwal, Niharika Bhattacharjee, Aadi Bhattacharya, Raka Bose, Anshul Gupta, Deepta Gupta, Anuj Kapoor, Elina Rani

    Abstract: Over 30 million tons of deicing salt is applied on U.S. roads annually at a cost of roughly $1.2 billion and with significant negative environmental impact. Therefore, it is desirable to reduce salt use while maintaining winter road safety. Automatic adjustment of application rate in response to road, weather, traffic, and other conditions has the potential to achieve this goal. In the US, salt… ▽ More

    Submitted 24 October, 2020; originally announced October 2020.

    Comments: 18 pages, 6 figures. Presented at 2019 Transportation Research Board Annual Meeting

  11. arXiv:2005.04258  [pdf, other

    cs.CV cs.LG eess.IV

    View Invariant Human Body Detection and Pose Estimation from Multiple Depth Sensors

    Authors: Walid Bekhtaoui, Ruhan Sa, Brian Teixeira, Vivek Singh, Klaus Kirchberg, Yao-jen Chang, Ankur Kapoor

    Abstract: Point cloud based methods have produced promising results in areas such as 3D object detection in autonomous driving. However, most of the recent point cloud work focuses on single depth sensor data, whereas less work has been done on indoor monitoring applications, such as operation room monitoring in hospitals or indoor surveillance. In these scenarios multiple cameras are often used to tackle o… ▽ More

    Submitted 8 May, 2020; originally announced May 2020.

  12. arXiv:1912.09957  [pdf, other

    cs.RO eess.SY

    Multi-Robot Collision Avoidance under Uncertainty with Probabilistic Safety Barrier Certificates

    Authors: Wenhao Luo, Wen Sun, Ashish Kapoor

    Abstract: Safety in terms of collision avoidance for multi-robot systems is a difficult challenge under uncertainty, non-determinism and lack of complete information. This paper aims to propose a collision avoidance method that accounts for both measurement uncertainty and motion uncertainty. In particular, we propose Probabilistic Safety Barrier Certificates (PrSBC) using Control Barrier Functions to defin… ▽ More

    Submitted 7 December, 2020; v1 submitted 20 December, 2019; originally announced December 2019.

    Comments: NeurIPS 2020

  13. arXiv:1811.03343  [pdf, other

    cs.CV eess.IV

    Repetitive Motion Estimation Network: Recover cardiac and respiratory signal from thoracic imaging

    Authors: Xiaoxiao Li, Vivek Singh, Yifan Wu, Klaus Kirchberg, James Duncan, Ankur Kapoor

    Abstract: Tracking organ motion is important in image-guided interventions, but motion annotations are not always easily available. Thus, we propose Repetitive Motion Estimation Network (RMEN) to recover cardiac and respiratory signals. It learns the spatio-temporal repetition patterns, embedding high dimensional motion manifolds to 1D vectors with partial motion phase boundary annotations. Compared with th… ▽ More

    Submitted 8 November, 2018; originally announced November 2018.

    Comments: Accepted by NIPS workshop MED-NIPS 2018

  14. arXiv:1802.08678  [pdf, other

    eess.SY cs.LG cs.RO stat.ML

    Verifying Controllers Against Adversarial Examples with Bayesian Optimization

    Authors: Shromona Ghosh, Felix Berkenkamp, Gireeja Ranade, Shaz Qadeer, Ashish Kapoor

    Abstract: Recent successes in reinforcement learning have lead to the development of complex controllers for real-world robots. As these robots are deployed in safety-critical applications and interact with humans, it becomes critical to ensure safety in order to avoid causing harm. A first step in this direction is to test the controllers in simulation. To be able to do this, we need to capture what we mea… ▽ More

    Submitted 26 February, 2018; v1 submitted 23 February, 2018; originally announced February 2018.

    Comments: Proc. of the IEEE International Conference on Robotics and Automation, 2018

  15. arXiv:1705.05065  [pdf, other

    cs.RO cs.AI cs.CV eess.SY

    AirSim: High-Fidelity Visual and Physical Simulation for Autonomous Vehicles

    Authors: Shital Shah, Debadeepta Dey, Chris Lovett, Ashish Kapoor

    Abstract: Developing and testing algorithms for autonomous vehicles in real world is an expensive and time consuming process. Also, in order to utilize recent advances in machine intelligence and deep learning we need to collect a large amount of annotated training data in a variety of conditions and environments. We present a new simulator built on Unreal Engine that offers physically and visually realisti… ▽ More

    Submitted 18 July, 2017; v1 submitted 15 May, 2017; originally announced May 2017.

    Comments: Accepted for Field and Service Robotics conference 2017 (FSR 2017)

    Report number: MSR-TR-2017-9

  16. arXiv:1510.07313  [pdf, other

    eess.SY cs.AI cs.LO cs.RO

    Safe Control under Uncertainty

    Authors: Dorsa Sadigh, Ashish Kapoor

    Abstract: Controller synthesis for hybrid systems that satisfy temporal specifications expressing various system properties is a challenging problem that has drawn the attention of many researchers. However, making the assumption that such temporal properties are deterministic is far from the reality. For example, many of the properties the controller has to satisfy are learned through machine learning tech… ▽ More

    Submitted 25 October, 2015; originally announced October 2015.

    Comments: 10 pages, 6 figures, Submitted to HSCC 2016