Skip to main content

Showing 1–50 of 167 results for author: Jha, D

.
  1. arXiv:2509.26459  [pdf, ps, other

    cs.RO cs.CG

    Analytic Conditions for Differentiable Collision Detection in Trajectory Optimization

    Authors: Akshay Jaitly, Devesh K. Jha, Kei Ota, Yuki Shirai

    Abstract: Optimization-based methods are widely used for computing fast, diverse solutions for complex tasks such as collision-free movement or planning in the presence of contacts. However, most of these methods require enforcing non-penetration constraints between objects, resulting in a non-trivial and computationally expensive problem. This makes the use of optimization-based methods for planning and co… ▽ More

    Submitted 30 September, 2025; originally announced September 2025.

    Comments: 8 pages, 8 figures. Accepted to the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2025

  2. arXiv:2509.19350  [pdf, ps, other

    cs.NI

    TinyAC: Bringing Autonomic Computing Principles to Resource-Constrained Systems

    Authors: Wojciech Kalka, Ruitao Xue, Kamil Faber, Aleksander Slominski, Devki Jha, Rajiv Ranjan, Tomasz Szydlo

    Abstract: Autonomic Computing (AC) is a promising approach for developing intelligent and adaptive self-management systems at the deep network edge. In this paper, we present the problems and challenges related to the use of AC for IoT devices. Our proposed hybrid approach bridges bottom-up intelligence (TinyML and on-device learning) and top-down guidance (LLMs) to achieve a scalable and explainable approa… ▽ More

    Submitted 17 September, 2025; originally announced September 2025.

  3. arXiv:2508.15021  [pdf, ps, other

    cs.RO

    In-Context Iterative Policy Improvement for Dynamic Manipulation

    Authors: Mark Van der Merwe, Devesh Jha

    Abstract: Attention-based architectures trained on internet-scale language data have demonstrated state of the art reasoning ability for various language-based tasks, such as logic problems and textual reasoning. Additionally, these Large Language Models (LLMs) have exhibited the ability to perform few-shot prediction via in-context learning, in which input-output examples provided in the prompt are general… ▽ More

    Submitted 20 August, 2025; originally announced August 2025.

    Comments: 14 pages. Accepted at CoRL 2025

  4. arXiv:2508.12410  [pdf, ps, other

    cs.CV cs.AI

    SRMA-Mamba: Spatial Reverse Mamba Attention Network for Pathological Liver Segmentation in MRI Volumes

    Authors: Jun Zeng, Yannan Huang, Elif Keles, Halil Ertugrul Aktas, Gorkem Durak, Nikhil Kumar Tomar, Quoc-Huy Trinh, Deepak Ranjan Nayak, Ulas Bagci, Debesh Jha

    Abstract: Liver Cirrhosis plays a critical role in the prognosis of chronic liver disease. Early detection and timely intervention are critical in significantly reducing mortality rates. However, the intricate anatomical architecture and diverse pathological changes of liver tissue complicate the accurate detection and characterization of lesions in clinical settings. Existing methods underutilize the spati… ▽ More

    Submitted 19 August, 2025; v1 submitted 17 August, 2025; originally announced August 2025.

    Comments: 9 pages, 4 figures

  5. arXiv:2508.07028  [pdf, ps, other

    cs.CV

    Large Language Model Evaluated Stand-alone Attention-Assisted Graph Neural Network with Spatial and Structural Information Interaction for Precise Endoscopic Image Segmentation

    Authors: Juntong Fan, Shuyi Fan, Debesh Jha, Changsheng Fang, Tieyong Zeng, Hengyong Yu, Dayang Wang

    Abstract: Accurate endoscopic image segmentation on the polyps is critical for early colorectal cancer detection. However, this task remains challenging due to low contrast with surrounding mucosa, specular highlights, and indistinct boundaries. To address these challenges, we propose FOCUS-Med, which stands for Fusion of spatial and structural graph with attentional context-aware polyp segmentation in endo… ▽ More

    Submitted 9 August, 2025; originally announced August 2025.

    Comments: Manuscript under review

  6. arXiv:2508.01082  [pdf, ps, other

    cs.RO cs.AI cs.LG eess.SY

    Learning Pivoting Manipulation with Force and Vision Feedback Using Optimization-based Demonstrations

    Authors: Yuki Shirai, Kei Ota, Devesh K. Jha, Diego Romeres

    Abstract: Non-prehensile manipulation is challenging due to complex contact interactions between objects, the environment, and robots. Model-based approaches can efficiently generate complex trajectories of robots and objects under contact constraints. However, they tend to be sensitive to model inaccuracies and require access to privileged information (e.g., object mass, size, pose), making them less suita… ▽ More

    Submitted 5 August, 2025; v1 submitted 1 August, 2025; originally announced August 2025.

  7. arXiv:2507.01509  [pdf, ps, other

    cs.CV cs.LG

    Mamba Guided Boundary Prior Matters: A New Perspective for Generalized Polyp Segmentation

    Authors: Tapas K. Dutta, Snehashis Majhi, Deepak Ranjan Nayak, Debesh Jha

    Abstract: Polyp segmentation in colonoscopy images is crucial for early detection and diagnosis of colorectal cancer. However, this task remains a significant challenge due to the substantial variations in polyp shape, size, and color, as well as the high similarity between polyps and surrounding tissues, often compounded by indistinct boundaries. While existing encoder-decoder CNN and transformer-based app… ▽ More

    Submitted 2 July, 2025; originally announced July 2025.

    Comments: 11 pages, 2 figures, MICCAI-2025

  8. arXiv:2506.22221  [pdf, ps, other

    math.OC math.AP

    Memory-Type Null Controllability of Heat Equations with Delay Effects

    Authors: Dev Prakash Jha, Raju K. George

    Abstract: This article is devoted to the study of null controllability for evolution equations that incorporate both memory and delay effects. The problem is particularly challenging due to the presence of memory integrals and delayed states, which necessitate strengthening the classical controllability requirement to ensure complete rest at the final time. To address this, we adopt the notion of Delay and… ▽ More

    Submitted 27 June, 2025; originally announced June 2025.

    Comments: 26 pages,4 figures

  9. arXiv:2505.16547  [pdf, ps, other

    cs.RO cs.AI

    Find the Fruit: Zero-Shot Sim2Real RL for Occlusion-Aware Plant Manipulation

    Authors: Nitesh Subedi, Hsin-Jung Yang, Devesh K. Jha, Soumik Sarkar

    Abstract: Autonomous harvesting in the open presents a complex manipulation problem. In most scenarios, an autonomous system has to deal with significant occlusion and require interaction in the presence of large structural uncertainties (every plant is different). Perceptual and modeling uncertainty make design of reliable manipulation controllers for harvesting challenging, resulting in poor performance d… ▽ More

    Submitted 30 September, 2025; v1 submitted 22 May, 2025; originally announced May 2025.

    Comments: 9 Pages, 3 Figures, 1 Table

  10. arXiv:2505.11872  [pdf, ps, other

    cs.CV

    PRS-Med: Position Reasoning Segmentation with Vision-Language Model in Medical Imaging

    Authors: Quoc-Huy Trinh, Minh-Van Nguyen, Jung Zeng, Ulas Bagci, Debesh Jha

    Abstract: Recent advancements in prompt-based medical image segmentation have enabled clinicians to identify tumors using simple input like bounding boxes or text prompts. However, existing methods face challenges when doctors need to interact through natural language or when position reasoning is required - understanding spatial relationships between anatomical structures and pathologies. We present PRS-Me… ▽ More

    Submitted 14 August, 2025; v1 submitted 17 May, 2025; originally announced May 2025.

  11. arXiv:2505.07755  [pdf, ps, other

    cs.DC cs.AI

    Benchmarking of CPU-intensive Stream Data Processing in The Edge Computing Systems

    Authors: Tomasz Szydlo, Viacheslaw Horbanow, Dev Nandan Jha, Shashikant Ilager, Aleksander Slominski, Rajiv Ranjan

    Abstract: Edge computing has emerged as a pivotal technology, offering significant advantages such as low latency, enhanced data security, and reduced reliance on centralized cloud infrastructure. These benefits are crucial for applications requiring real-time data processing or strict security measures. Despite these advantages, edge devices operating within edge clusters are often underutilized. This inef… ▽ More

    Submitted 12 May, 2025; originally announced May 2025.

  12. arXiv:2504.14573  [pdf, other

    cs.RO cs.AI

    Modality Selection and Skill Segmentation via Cross-Modality Attention

    Authors: Jiawei Jiang, Kei Ota, Devesh K. Jha, Asako Kanezaki

    Abstract: Incorporating additional sensory modalities such as tactile and audio into foundational robotic models poses significant challenges due to the curse of dimensionality. This work addresses this issue through modality selection. We propose a cross-modality attention (CMA) mechanism to identify and selectively utilize the modalities that are most informative for action generation at each timestep. Fu… ▽ More

    Submitted 20 April, 2025; originally announced April 2025.

  13. arXiv:2504.13597  [pdf, other

    eess.IV cs.AI cs.CV

    FocusNet: Transformer-enhanced Polyp Segmentation with Local and Pooling Attention

    Authors: Jun Zeng, KC Santosh, Deepak Rajan Nayak, Thomas de Lange, Jonas Varkey, Tyler Berzin, Debesh Jha

    Abstract: Colonoscopy is vital in the early diagnosis of colorectal polyps. Regular screenings can effectively prevent benign polyps from progressing to CRC. While deep learning has made impressive strides in polyp segmentation, most existing models are trained on single-modality and single-center data, making them less effective in real-world clinical environments. To overcome these limitations, we propose… ▽ More

    Submitted 18 April, 2025; originally announced April 2025.

    Comments: 9 pages, 6 figures

  14. arXiv:2504.09611  [pdf, ps, other

    math.OC

    An Operator-Theoretic Framework for the Optimal Control Problem of Nonlinear Caputo Fractional Systems

    Authors: Dev Prakash Jha, Raju K. George

    Abstract: This paper addresses the optimal control problem for a class of nonlinear fractional systems involving Caputo derivatives and nonlocal initial conditions. The system is reformulated as an abstract Hammerstein-type operator equation, enabling the application of operator-theoretic techniques. Sufficient conditions are established to guarantee the existence of mild solutions and optimal control-state… ▽ More

    Submitted 13 April, 2025; originally announced April 2025.

    Comments: 29 pages,2 figures

  15. Hierarchical Contact-Rich Trajectory Optimization for Multi-Modal Manipulation using Tight Convex Relaxations

    Authors: Yuki Shirai, Arvind Raghunathan, Devesh K. Jha

    Abstract: Designing trajectories for manipulation through contact is challenging as it requires reasoning of object \& robot trajectories as well as complex contact sequences simultaneously. In this paper, we present a novel framework for simultaneously designing trajectories of robots, objects, and contacts efficiently for contact-rich manipulation. We propose a hierarchical optimization framework where Mi… ▽ More

    Submitted 11 March, 2025; v1 submitted 10 March, 2025; originally announced March 2025.

    Comments: 2025 IEEE International Conference on Robotics and Automation (2025 ICRA)

  16. arXiv:2502.18232  [pdf, other

    eess.IV cs.AI cs.CV

    A Reverse Mamba Attention Network for Pathological Liver Segmentation

    Authors: Jun Zeng, Debesh Jha, Ertugrul Aktas, Elif Keles, Alpay Medetalibeyoglu, Matthew Antalek, Robert Lewandowski, Daniela Ladner, Amir A. Borhani, Gorkem Durak, Ulas Bagci

    Abstract: We present RMA-Mamba, a novel architecture that advances the capabilities of vision state space models through a specialized reverse mamba attention module (RMA). The key innovation lies in RMA-Mamba's ability to capture long-range dependencies while maintaining precise local feature representation through its hierarchical processing pipeline. By integrating Vision Mamba (VMamba)'s efficient seque… ▽ More

    Submitted 5 March, 2025; v1 submitted 23 February, 2025; originally announced February 2025.

    Comments: 8 pages, 3 figures

  17. arXiv:2502.18225  [pdf, other

    eess.IV cs.AI cs.CV

    Liver Cirrhosis Stage Estimation from MRI with Deep Learning

    Authors: Jun Zeng, Debesh Jha, Ertugrul Aktas, Elif Keles, Alpay Medetalibeyoglu, Matthew Antalek, Federica Proietto Salanitri, Amir A. Borhani, Daniela P. Ladner, Gorkem Durak, Ulas Bagci

    Abstract: We present an end-to-end deep learning framework for automated liver cirrhosis stage estimation from multi-sequence MRI. Cirrhosis is the severe scarring (fibrosis) of the liver and a common endpoint of various chronic liver diseases. Early diagnosis is vital to prevent complications such as decompensation and cancer, which significantly decreases life expectancy. However, diagnosing cirrhosis in… ▽ More

    Submitted 22 May, 2025; v1 submitted 23 February, 2025; originally announced February 2025.

    Comments: 7 pages, 1 figure

  18. arXiv:2502.05444  [pdf, other

    eess.IV cs.CV

    Diverse Image Generation with Diffusion Models and Cross Class Label Learning for Polyp Classification

    Authors: Vanshali Sharma, Debesh Jha, M. K. Bhuyan, Pradip K. Das, Ulas Bagci

    Abstract: Pathologic diagnosis is a critical phase in deciding the optimal treatment procedure for dealing with colorectal cancer (CRC). Colonic polyps, precursors to CRC, can pathologically be classified into two major types: adenomatous and hyperplastic. For precise classification and early diagnosis of such polyps, the medical procedure of colonoscopy has been widely adopted paired with various imaging t… ▽ More

    Submitted 7 February, 2025; originally announced February 2025.

  19. arXiv:2502.05229  [pdf, other

    cs.CV

    L2GNet: Optimal Local-to-Global Representation of Anatomical Structures for Generalized Medical Image Segmentation

    Authors: Vandan Gorade, Sparsh Mittal, Neethi Dasu, Rekha Singhal, KC Santosh, Debesh Jha

    Abstract: Continuous Latent Space (CLS) and Discrete Latent Space (DLS) models, like AttnUNet and VQUNet, have excelled in medical image segmentation. In contrast, Synergistic Continuous and Discrete Latent Space (CDLS) models show promise in handling fine and coarse-grained information. However, they struggle with modeling long-range dependencies. CLS or CDLS-based models, such as TransUNet or SynergyNet a… ▽ More

    Submitted 6 February, 2025; originally announced February 2025.

  20. arXiv:2501.17652  [pdf, ps, other

    math.OC math.AP

    Approximate Controllability of Fractional Evolution Equations with Nonlocal Conditions via Operator Theory

    Authors: Dev Prakash Jha, Raju K George

    Abstract: This paper investigates the existence and uniqueness of mild solutions, as well as the approximate controllability, of a class of fractional evolution equations with nonlocal conditions in Hilbert spaces. Sufficient conditions for approximate controllability are established through a novel approach to the approximate solvability of semilinear operator equations. The methodology utilizes Green's fu… ▽ More

    Submitted 29 January, 2025; originally announced January 2025.

    Comments: 22 pages

  21. arXiv:2501.08259  [pdf, other

    cs.RO cs.LG

    FDPP: Fine-tune Diffusion Policy with Human Preference

    Authors: Yuxin Chen, Devesh K. Jha, Masayoshi Tomizuka, Diego Romeres

    Abstract: Imitation learning from human demonstrations enables robots to perform complex manipulation tasks and has recently witnessed huge success. However, these techniques often struggle to adapt behavior to new preferences or changes in the environment. To address these limitations, we propose Fine-tuning Diffusion Policy with Human Preference (FDPP). FDPP learns a reward function through preference-bas… ▽ More

    Submitted 14 January, 2025; originally announced January 2025.

  22. arXiv:2501.04104  [pdf, other

    eess.SY cs.CR

    Security by Design Issues in Autonomous Vehicles

    Authors: Martin Higgins, Devki Jha, David Blundell, David Wallom

    Abstract: As autonomous vehicle (AV) technology advances towards maturity, it becomes imperative to examine the security vulnerabilities within these cyber-physical systems. While conventional cyber-security concerns are often at the forefront of discussions, it is essential to get deeper into the various layers of vulnerability that are often overlooked within mainstream frameworks. Our goal is to spotligh… ▽ More

    Submitted 7 January, 2025; originally announced January 2025.

  23. arXiv:2412.08482  [pdf, other

    cs.CV

    SAM-Mamba: Mamba Guided SAM Architecture for Generalized Zero-Shot Polyp Segmentation

    Authors: Tapas Kumar Dutta, Snehashis Majhi, Deepak Ranjan Nayak, Debesh Jha

    Abstract: Polyp segmentation in colonoscopy is crucial for detecting colorectal cancer. However, it is challenging due to variations in the structure, color, and size of polyps, as well as the lack of clear boundaries with surrounding tissues. Traditional segmentation models based on Convolutional Neural Networks (CNNs) struggle to capture detailed patterns and global context, limiting their performance. Vi… ▽ More

    Submitted 11 December, 2024; originally announced December 2024.

  24. arXiv:2410.16296  [pdf, other

    eess.IV cs.CV

    Large Scale MRI Collection and Segmentation of Cirrhotic Liver

    Authors: Debesh Jha, Onkar Kishor Susladkar, Vandan Gorade, Elif Keles, Matthew Antalek, Deniz Seyithanoglu, Timurhan Cebeci, Halil Ertugrul Aktas, Gulbiz Dagoglu Kartal, Sabahattin Kaymakoglu, Sukru Mehmet Erturk, Yuri Velichko, Daniela Ladner, Amir A. Borhani, Alpay Medetalibeyoglu, Gorkem Durak, Ulas Bagci

    Abstract: Liver cirrhosis represents the end stage of chronic liver disease, characterized by extensive fibrosis and nodular regeneration that significantly increases mortality risk. While magnetic resonance imaging (MRI) offers a non-invasive assessment, accurately segmenting cirrhotic livers presents substantial challenges due to morphological alterations and heterogeneous signal characteristics. Deep lea… ▽ More

    Submitted 7 May, 2025; v1 submitted 6 October, 2024; originally announced October 2024.

  25. arXiv:2410.13979  [pdf, other

    cs.RO cs.AI

    RecoveryChaining: Learning Local Recovery Policies for Robust Manipulation

    Authors: Shivam Vats, Devesh K. Jha, Maxim Likhachev, Oliver Kroemer, Diego Romeres

    Abstract: Model-based planners and controllers are commonly used to solve complex manipulation problems as they can efficiently optimize diverse objectives and generalize to long horizon tasks. However, they often fail during deployment due to noisy actuation, partial observability and imperfect models. To enable a robot to recover from such failures, we propose to use hierarchical reinforcement learning to… ▽ More

    Submitted 7 March, 2025; v1 submitted 17 October, 2024; originally announced October 2024.

    Comments: Added Lazy RecoveryChaining algorithm. 8 pages, 9 figures

  26. arXiv:2410.02044  [pdf, other

    eess.IV

    Frequency-Based Federated Domain Generalization for Polyp Segmentation

    Authors: Hongyi Pan, Debesh Jha, Koushik Biswas, Ulas Bagci

    Abstract: Federated Learning (FL) offers a powerful strategy for training machine learning models across decentralized datasets while maintaining data privacy, yet domain shifts among clients can degrade performance, particularly in medical imaging tasks like polyp segmentation. This paper introduces a novel Frequency-Based Domain Generalization (FDG) framework, utilizing soft-thresholding and hard-threshol… ▽ More

    Submitted 27 December, 2024; v1 submitted 2 October, 2024; originally announced October 2024.

    Comments: This paper has been accepted to ICASSP 2025

  27. arXiv:2409.16087  [pdf, ps, other

    math.OC

    Exact Null Controllability of Non-Autonomous Conformable Fractional Semi-Linear Systems with Nonlocal Conditions

    Authors: Dev Prakash Jha, Raju K. George

    Abstract: We study the exact null controllability of a class of non-autonomous conformable fractional semi-linear evolution systems with nonlocal initial conditions in Hilbert spaces. The analysis is carried out within the framework of conformable fractional calculus and linear evolution operator theory. Under suitable assumptions, we establish the existence of mild solutions and provide sufficient conditio… ▽ More

    Submitted 20 April, 2025; v1 submitted 24 September, 2024; originally announced September 2024.

    Comments: 24 pages, 0 figure. arXiv admin note: text overlap with arXiv:2408.13814

  28. arXiv:2409.05875  [pdf, other

    cs.CV

    Transformer-Enhanced Iterative Feedback Mechanism for Polyp Segmentation

    Authors: Nikhil Kumar Tomar, Debesh Jha, Koushik Biswas, Tyler M. Berzin, Rajesh Keswani, Michael Wallace, Ulas Bagci

    Abstract: Colorectal cancer (CRC) is the third most common cause of cancer diagnosed in the United States and the second leading cause of cancer-related death among both genders. Notably, CRC is the leading cause of cancer in younger men less than 50 years old. Colonoscopy is considered the gold standard for the early diagnosis of CRC. Skills vary significantly among endoscopists, and a high miss rate is re… ▽ More

    Submitted 24 August, 2024; originally announced September 2024.

  29. arXiv:2409.00045  [pdf, other

    cs.CV

    PolypDB: A Curated Multi-Center Dataset for Development of AI Algorithms in Colonoscopy

    Authors: Debesh Jha, Nikhil Kumar Tomar, Vanshali Sharma, Quoc-Huy Trinh, Koushik Biswas, Hongyi Pan, Ritika K. Jha, Gorkem Durak, Alexander Hann, Jonas Varkey, Hang Viet Dao, Long Van Dao, Binh Phuc Nguyen, Nikolaos Papachrysos, Brandon Rieders, Peter Thelin Schmidt, Enrik Geissler, Tyler Berzin, Pål Halvorsen, Michael A. Riegler, Thomas de Lange, Ulas Bagci

    Abstract: Colonoscopy is the primary method for examination, detection, and removal of polyps. However, challenges such as variations among the endoscopists' skills, bowel quality preparation, and the complex nature of the large intestine contribute to high polyp miss-rate. These missed polyps can develop into cancer later, underscoring the importance of improving the detection methods. To address this gap… ▽ More

    Submitted 3 January, 2025; v1 submitted 19 August, 2024; originally announced September 2024.

    Comments: 3 Figures, 6 tables

  30. arXiv:2408.13814  [pdf, ps, other

    math.OC math.AP

    Existence and uniqueness of mild solutions and evolution operators for a class of non-autonomous conformable fractional semi-linear systems and Their Exact Null Controllability

    Authors: Dev Prakash Jha, Raju K George

    Abstract: This paper investigates the controllability of systems governed by conformable fractional order derivatives. It first establishes the existence and uniqueness of evolution operators for non-autonomous fractional-order homogeneous systems, using a suitable initial time defined as the intersection of two specific time intervals. Using the theory of linear evolution operators, Schauder's fixed-point… ▽ More

    Submitted 8 February, 2025; v1 submitted 25 August, 2024; originally announced August 2024.

    Comments: 29 Pages, 0 figures

  31. arXiv:2408.10733  [pdf, other

    eess.IV cs.CV

    Classification of Endoscopy and Video Capsule Images using CNN-Transformer Model

    Authors: Aliza Subedi, Smriti Regmi, Nisha Regmi, Bhumi Bhusal, Ulas Bagci, Debesh Jha

    Abstract: Gastrointestinal cancer is a leading cause of cancer-related incidence and death, making it crucial to develop novel computer-aided diagnosis systems for early detection and enhanced treatment. Traditional approaches rely on the expertise of gastroenterologists to identify diseases; however, this process is subjective, and interpretation can vary even among expert clinicians. Considering recent ad… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

  32. arXiv:2408.05692  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    A Novel Momentum-Based Deep Learning Techniques for Medical Image Classification and Segmentation

    Authors: Koushik Biswas, Ridal Pal, Shaswat Patel, Debesh Jha, Meghana Karri, Amit Reza, Gorkem Durak, Alpay Medetalibeyoglu, Matthew Antalek, Yury Velichko, Daniela Ladner, Amir Borhani, Ulas Bagci

    Abstract: Accurately segmenting different organs from medical images is a critical prerequisite for computer-assisted diagnosis and intervention planning. This study proposes a deep learning-based approach for segmenting various organs from CT and MRI scans and classifying diseases. Our study introduces a novel technique integrating momentum within residual blocks for enhanced training dynamics in medical i… ▽ More

    Submitted 11 August, 2024; originally announced August 2024.

    Comments: 8 pages

  33. arXiv:2408.04491  [pdf, other

    cs.CV cs.AI

    Towards Synergistic Deep Learning Models for Volumetric Cirrhotic Liver Segmentation in MRIs

    Authors: Vandan Gorade, Onkar Susladkar, Gorkem Durak, Elif Keles, Ertugrul Aktas, Timurhan Cebeci, Alpay Medetalibeyoglu, Daniela Ladner, Debesh Jha, Ulas Bagci

    Abstract: Liver cirrhosis, a leading cause of global mortality, requires precise segmentation of ROIs for effective disease monitoring and treatment planning. Existing segmentation models often fail to capture complex feature interactions and generalize across diverse datasets. To address these limitations, we propose a novel synergistic theory that leverages complementary latent spaces for enhanced feature… ▽ More

    Submitted 8 August, 2024; originally announced August 2024.

  34. arXiv:2407.16976  [pdf, other

    cs.RO

    Simultaneous Trajectory Optimization and Contact Selection for Contact-rich Manipulation with High-Fidelity Geometry

    Authors: Mengchao Zhang, Devesh K. Jha, Arvind U. Raghunathan, Kris Hauser

    Abstract: Contact-implicit trajectory optimization (CITO) is an effective method to plan complex trajectories for various contact-rich systems including manipulation and locomotion. CITO formulates a mathematical program with complementarity constraints (MPCC) that enforces that contact forces must be zero when points are not in contact. However, MPCC solve times increase steeply with the number of allowabl… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

    Comments: arXiv admin note: text overlap with arXiv:2306.06465

  35. arXiv:2406.14819  [pdf, other

    cs.CV

    SAM-EG: Segment Anything Model with Egde Guidance framework for efficient Polyp Segmentation

    Authors: Quoc-Huy Trinh, Hai-Dang Nguyen, Bao-Tram Nguyen Ngoc, Debesh Jha, Ulas Bagci, Minh-Triet Tran

    Abstract: Polyp segmentation, a critical concern in medical imaging, has prompted numerous proposed methods aimed at enhancing the quality of segmented masks. While current state-of-the-art techniques produce impressive results, the size and computational cost of these models pose challenges for practical industry applications. Recently, the Segment Anything Model (SAM) has been proposed as a robust foundat… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  36. arXiv:2406.11868  [pdf, other

    cs.CY cs.AI

    Ethical Framework for Responsible Foundational Models in Medical Imaging

    Authors: Abhijit Das, Debesh Jha, Jasmer Sanjotra, Onkar Susladkar, Suramyaa Sarkar, Ashish Rauniyar, Nikhil Tomar, Vanshali Sharma, Ulas Bagci

    Abstract: Foundational models (FMs) have tremendous potential to revolutionize medical imaging. However, their deployment in real-world clinical settings demands extensive ethical considerations. This paper aims to highlight the ethical concerns related to FMs and propose a framework to guide their responsible development and implementation within medicine. We meticulously examine ethical issues such as pri… ▽ More

    Submitted 13 April, 2024; originally announced June 2024.

  37. arXiv:2406.05331  [pdf, other

    cs.RO

    Autonomous Robotic Assembly: From Part Singulation to Precise Assembly

    Authors: Kei Ota, Devesh K. Jha, Siddarth Jain, Bill Yerazunis, Radu Corcodel, Yash Shukla, Antonia Bronars, Diego Romeres

    Abstract: Imagine a robot that can assemble a functional product from the individual parts presented in any configuration to the robot. Designing such a robotic system is a complex problem which presents several open challenges. To bypass these challenges, the current generation of assembly systems is built with a lot of system integration effort to provide the structure and precision necessary for assembly… ▽ More

    Submitted 11 June, 2024; v1 submitted 7 June, 2024; originally announced June 2024.

    Comments: Under submission

  38. arXiv:2405.16740  [pdf, other

    cs.CV

    PP-SAM: Perturbed Prompts for Robust Adaptation of Segment Anything Model for Polyp Segmentation

    Authors: Md Mostafijur Rahman, Mustafa Munir, Debesh Jha, Ulas Bagci, Radu Marculescu

    Abstract: The Segment Anything Model (SAM), originally designed for general-purpose segmentation tasks, has been used recently for polyp segmentation. Nonetheless, fine-tuning SAM with data from new imaging centers or clinics poses significant challenges. This is because this necessitates the creation of an expensive and time-intensive annotated dataset, along with the potential for variability in user prom… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Comments: 7 pages, 9 figures, Proceedings of the 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops

  39. arXiv:2405.12367  [pdf, other

    eess.IV cs.CV

    Large-Scale Multi-Center CT and MRI Segmentation of Pancreas with Deep Learning

    Authors: Zheyuan Zhang, Elif Keles, Gorkem Durak, Yavuz Taktak, Onkar Susladkar, Vandan Gorade, Debesh Jha, Asli C. Ormeci, Alpay Medetalibeyoglu, Lanhong Yao, Bin Wang, Ilkin Sevgi Isler, Linkai Peng, Hongyi Pan, Camila Lopes Vendrami, Amir Bourhani, Yury Velichko, Boqing Gong, Concetto Spampinato, Ayis Pyrros, Pallavi Tiwari, Derk C. F. Klatte, Megan Engels, Sanne Hoogenboom, Candice W. Bolan , et al. (13 additional authors not shown)

    Abstract: Automated volumetric segmentation of the pancreas on cross-sectional imaging is needed for diagnosis and follow-up of pancreatic diseases. While CT-based pancreatic segmentation is more established, MRI-based segmentation methods are understudied, largely due to a lack of publicly available datasets, benchmarking research efforts, and domain-specific deep learning methods. In this retrospective st… ▽ More

    Submitted 24 October, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

    Comments: Peer-reviewer version

  40. arXiv:2405.06166  [pdf, other

    eess.IV cs.CV

    MDNet: Multi-Decoder Network for Abdominal CT Organs Segmentation

    Authors: Debesh Jha, Nikhil Kumar Tomar, Koushik Biswas, Gorkem Durak, Matthew Antalek, Zheyuan Zhang, Bin Wang, Md Mostafijur Rahman, Hongyi Pan, Alpay Medetalibeyoglu, Yury Velichko, Daniela Ladner, Amir Borhani, Ulas Bagci

    Abstract: Accurate segmentation of organs from abdominal CT scans is essential for clinical applications such as diagnosis, treatment planning, and patient monitoring. To handle challenges of heterogeneity in organ shapes, sizes, and complex anatomical relationships, we propose a \textbf{\textit{\ac{MDNet}}}, an encoder-decoder network that uses the pre-trained \textit{MiT-B2} as the encoder and multiple di… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  41. arXiv:2405.01503  [pdf, other

    eess.IV cs.CV

    PAM-UNet: Shifting Attention on Region of Interest in Medical Images

    Authors: Abhijit Das, Debesh Jha, Vandan Gorade, Koushik Biswas, Hongyi Pan, Zheyuan Zhang, Daniela P. Ladner, Yury Velichko, Amir Borhani, Ulas Bagci

    Abstract: Computer-aided segmentation methods can assist medical personnel in improving diagnostic outcomes. While recent advancements like UNet and its variants have shown promise, they face a critical challenge: balancing accuracy with computational efficiency. Shallow encoder architectures in UNets often struggle to capture crucial spatial features, leading in inaccurate and sparse segmentation. To addre… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: Accepted at 2024 IEEE EMBC

  42. arXiv:2404.17064  [pdf, other

    eess.IV cs.CV

    Detection of Peri-Pancreatic Edema using Deep Learning and Radiomics Techniques

    Authors: Ziliang Hong, Debesh Jha, Koushik Biswas, Zheyuan Zhang, Yury Velichko, Cemal Yazici, Temel Tirkes, Amir Borhani, Baris Turkbey, Alpay Medetalibeyoglu, Gorkem Durak, Ulas Bagci

    Abstract: Identifying peri-pancreatic edema is a pivotal indicator for identifying disease progression and prognosis, emphasizing the critical need for accurate detection and assessment in pancreatitis diagnosis and management. This study \textit{introduces a novel CT dataset sourced from 255 patients with pancreatic diseases, featuring annotated pancreas segmentation masks and corresponding diagnostic labe… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

  43. arXiv:2404.07533  [pdf, other

    cs.LG cs.AI cs.ET

    Exploring the Decentraland Economy: Multifaceted Parcel Attributes, Key Insights, and Benchmarking

    Authors: Dipika Jha, Ankit K. Bhagat, Raju Halder, Rajendra N. Paramanik, Chandra M. Kumar

    Abstract: This paper presents a comprehensive Decentraland parcels dataset, called IITP-VDLand, sourced from diverse platforms such as Decentraland, OpenSea, Etherscan, Google BigQuery, and various Social Media Platforms. Unlike existing datasets which have limited attributes and records, IITP-VDLand offers a rich array of attributes, encompassing parcel characteristics, trading history, past activities, tr… ▽ More

    Submitted 2 March, 2025; v1 submitted 11 April, 2024; originally announced April 2024.

  44. arXiv:2403.18960  [pdf, other

    cs.RO

    Robust In-Hand Manipulation with Extrinsic Contacts

    Authors: Boyuan Liang, Kei Ota, Masayoshi Tomizuka, Devesh Jha

    Abstract: We present in-hand manipulation tasks where a robot moves an object in grasp, maintains its external contact mode with the environment, and adjusts its in-hand pose simultaneously. The proposed manipulation task leads to complex contact interactions which can be very susceptible to uncertainties in kinematic and physical parameters. Therefore, we propose a robust in-hand manipulation method, which… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: Accepted at ICRA 24

  45. arXiv:2403.06961  [pdf, other

    cs.CV

    Explainable Transformer Prototypes for Medical Diagnoses

    Authors: Ugur Demir, Debesh Jha, Zheyuan Zhang, Elif Keles, Bradley Allen, Aggelos K. Katsaggelos, Ulas Bagci

    Abstract: Deployments of artificial intelligence in medical diagnostics mandate not just accuracy and efficacy but also trust, emphasizing the need for explainability in machine decisions. The recent trend in automated medical image diagnostics leans towards the deployment of Transformer-based architectures, credited to their impressive capabilities. Since the self-attention feature of transformers contribu… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  46. arXiv:2402.02453  [pdf, other

    cs.CV

    AI Art Neural Constellation: Revealing the Collective and Contrastive State of AI-Generated and Human Art

    Authors: Faizan Farooq Khan, Diana Kim, Divyansh Jha, Youssef Mohamed, Hanna H Chang, Ahmed Elgammal, Luba Elliott, Mohamed Elhoseiny

    Abstract: Discovering the creative potentials of a random signal to various artistic expressions in aesthetic and conceptual richness is a ground for the recent success of generative machine learning as a way of art creation. To understand the new artistic medium better, we conduct a comprehensive analysis to position AI-generated art within the context of human art heritage. Our comparative analysis is bas… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

  47. arXiv:2401.10373  [pdf, other

    eess.IV cs.CV cs.LG

    Harmonized Spatial and Spectral Learning for Robust and Generalized Medical Image Segmentation

    Authors: Vandan Gorade, Sparsh Mittal, Debesh Jha, Rekha Singhal, Ulas Bagci

    Abstract: Deep learning has demonstrated remarkable achievements in medical image segmentation. However, prevailing deep learning models struggle with poor generalization due to (i) intra-class variations, where the same class appears differently in different samples, and (ii) inter-class independence, resulting in difficulties capturing intricate relationships between distinct objects, leading to higher fa… ▽ More

    Submitted 8 August, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

    Comments: Early Accepted at ICPR-2024 for Oral Presentation

  48. arXiv:2401.09630  [pdf, other

    eess.IV cs.CV

    CT Liver Segmentation via PVT-based Encoding and Refined Decoding

    Authors: Debesh Jha, Nikhil Kumar Tomar, Koushik Biswas, Gorkem Durak, Alpay Medetalibeyoglu, Matthew Antalek, Yury Velichko, Daniela Ladner, Amir Borhani, Ulas Bagci

    Abstract: Accurate liver segmentation from CT scans is essential for effective diagnosis and treatment planning. Computer-aided diagnosis systems promise to improve the precision of liver disease diagnosis, disease progression, and treatment planning. In response to the need, we propose a novel deep learning approach, \textit{\textbf{PVTFormer}}, that is built upon a pretrained pyramid vision transformer (P… ▽ More

    Submitted 20 April, 2024; v1 submitted 17 January, 2024; originally announced January 2024.

  49. arXiv:2401.02883  [pdf, other

    cs.RO eess.SY

    iPolicy: Incremental Policy Algorithms for Feedback Motion Planning

    Authors: Guoxiang Zhao, Devesh K. Jha, Yebin Wang, Minghui Zhu

    Abstract: This paper presents policy-based motion planning for robotic systems. The motion planning literature has been mostly focused on open-loop trajectory planning which is followed by tracking online. In contrast, we solve the problem of path planning and controller synthesis simultaneously by solving the related feedback control problem. We present a novel incremental policy (iPolicy) algorithm for mo… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

  50. arXiv:2312.11480  [pdf, other

    cs.NE cs.CV cs.LG eess.IV

    Adaptive Smooth Activation for Improved Disease Diagnosis and Organ Segmentation from Radiology Scans

    Authors: Koushik Biswas, Debesh Jha, Nikhil Kumar Tomar, Gorkem Durak, Alpay Medetalibeyoglu, Matthew Antalek, Yury Velichko, Daniela Ladner, Amir Bohrani, Ulas Bagci

    Abstract: In this study, we propose a new activation function, called Adaptive Smooth Activation Unit (ASAU), tailored for optimized gradient propagation, thereby enhancing the proficiency of convolutional networks in medical image analysis. We apply this new activation function to two important and commonly used general tasks in medical image analysis: automatic disease diagnosis and organ segmentation in… ▽ More

    Submitted 29 November, 2023; originally announced December 2023.