Skip to main content

Showing 1–13 of 13 results for author: R., D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.22363  [pdf, ps, other

    cs.CV cs.AI

    ForcePose: A Deep Learning Approach for Force Calculation Based on Action Recognition Using MediaPipe Pose Estimation Combined with Object Detection

    Authors: Nandakishor M, Vrinda Govind V, Anuradha Puthalath, Anzy L, Swathi P S, Aswathi R, Devaprabha A R, Varsha Raj, Midhuna Krishnan K, Akhila Anilkumar T V, Yamuna P V

    Abstract: Force estimation in human-object interactions is crucial for various fields like ergonomics, physical therapy, and sports science. Traditional methods depend on specialized equipment such as force plates and sensors, which makes accurate assessments both expensive and restricted to laboratory settings. In this paper, we introduce ForcePose, a novel deep learning framework that estimates applied fo… ▽ More

    Submitted 28 March, 2025; originally announced March 2025.

  2. arXiv:2502.05634  [pdf

    cs.SE

    ML DevOps Adoption in Practice: A Mixed-Method Study of Implementation Patterns and Organizational Benefits

    Authors: Dileepkumar S R, Juby Mathew

    Abstract: Machine Learning (ML) DevOps, also known as MLOps, has emerged as a critical framework for efficiently operationalizing ML models in various industries. This study investigates the adoption trends, implementation efforts, and benefits of ML DevOps through a combination of literature review and empirical analysis. By surveying 150 professionals across industries and conducting in-depth interviews w… ▽ More

    Submitted 8 February, 2025; originally announced February 2025.

    Comments: 10 Pages,2 Tables

  3. arXiv:2411.09420  [pdf, other

    cs.CV cs.AI cs.LG

    SAG-ViT: A Scale-Aware, High-Fidelity Patching Approach with Graph Attention for Vision Transformers

    Authors: Shravan Venkatraman, Jaskaran Singh Walia, Joe Dhanith P R

    Abstract: Vision Transformers (ViTs) have redefined image classification by leveraging self-attention to capture complex patterns and long-range dependencies between image patches. However, a key challenge for ViTs is efficiently incorporating multi-scale feature representations, which is inherent in convolutional neural networks (CNNs) through their hierarchical structure. Graph transformers have made stri… ▽ More

    Submitted 7 January, 2025; v1 submitted 14 November, 2024; originally announced November 2024.

    Comments: 14 pages, 8 figures, 9 tables

    MSC Class: 68T07 ACM Class: I.2.10

  4. arXiv:2411.01251  [pdf

    eess.IV cs.CV cs.LG

    Enhancing Diabetic Retinopathy Detection with CNN-Based Models: A Comparative Study of UNET and Stacked UNET Architectures

    Authors: Ameya Uppina, S Navaneetha Krishnan, Talluri Krishna Sai Teja, Nikhil N Iyer, Joe Dhanith P R

    Abstract: Diabetic Retinopathy DR is a severe complication of diabetes. Damaged or abnormal blood vessels can cause loss of vision. The need for massive screening of a large population of diabetic patients has generated an interest in a computer-aided fully automatic diagnosis of DR. In the realm of Deep learning frameworks, particularly convolutional neural networks CNNs, have shown great interest and prom… ▽ More

    Submitted 20 January, 2025; v1 submitted 2 November, 2024; originally announced November 2024.

  5. arXiv:2407.18552  [pdf

    cs.MM cs.CL cs.CV cs.LG cs.SD eess.AS

    Multimodal Emotion Recognition using Audio-Video Transformer Fusion with Cross Attention

    Authors: Joe Dhanith P R, Shravan Venkatraman, Vigya Sharma, Santhosh Malarvannan, Modigari Narendra

    Abstract: Understanding emotions is a fundamental aspect of human communication. Integrating audio and video signals offers a more comprehensive understanding of emotional states compared to traditional methods that rely on a single data source, such as speech or facial expressions. Despite its potential, multimodal emotion recognition faces significant challenges, particularly in synchronization, feature e… ▽ More

    Submitted 19 February, 2025; v1 submitted 26 July, 2024; originally announced July 2024.

    Comments: 38 Pages, 9 Tables, 12 Figures

    ACM Class: F.2.2; I.2.7

  6. arXiv:2407.08349  [pdf

    cs.CV

    Spine Vision X-Ray Image based GUI Planning of Pedicle Screws Using Enhanced YOLOv5 for Vertebrae Segmentation

    Authors: Yashwanth Rao, Gaurisankar S, Durga R, Aparna Purayath, Vivek Maik, Manojkumar Lakshmanan, Mohanasankar Sivaprakasm

    Abstract: In this paper, we propose an innovative Graphical User Interface (GUI) aimed at improving preoperative planning and intra-operative guidance for precise spinal screw placement through vertebrae segmentation. The methodology encompasses both front-end and back-end computations. The front end comprises a GUI that allows surgeons to precisely adjust the placement of screws on X-Ray images, thereby im… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  7. arXiv:2407.08347  [pdf

    eess.IV cs.CV

    GUI-based Pedicle Screw Planning on Fluoroscopic Images Utilizing Vertebral Segmentation

    Authors: Vivek Maik, Aparna Purayath, Durga R, Manojkumar Lakshmanan, Mohanasankar Sivaprakasm

    Abstract: The proposed work establishes a novel Graphical User Interface (GUI) framework, primarily designed for intraoperative pedicle screw planning. Current planning workflow in Image Guided Surgeries primarily relies on pre-operative CT planning. Intraoperative CT planning can be time-consuming and expensive and thus is not a common practice. In situations where efficiency and cost-effectiveness are par… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  8. arXiv:2405.20959  [pdf, other

    cs.AI cs.DB

    Navigating Tabular Data Synthesis Research: Understanding User Needs and Tool Capabilities

    Authors: Maria F. Davila R., Sven Groen, Fabian Panse, Wolfram Wingerath

    Abstract: In an era of rapidly advancing data-driven applications, there is a growing demand for data in both research and practice. Synthetic data have emerged as an alternative when no real data is available (e.g., due to privacy regulations). Synthesizing tabular data presents unique and complex challenges, especially handling (i) missing values, (ii) dataset imbalance, (iii) diverse column types, and (i… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: 14 pages, 3 figures

  9. arXiv:2402.00004  [pdf

    cs.RO cs.CV

    Revolutionizing Underwater Exploration of Autonomous Underwater Vehicles (AUVs) and Seabed Image Processing Techniques

    Authors: Rajesh Sharma R, Akey Sungheetha, Dr Chinnaiyan R

    Abstract: The oceans in the Earth's in one of the last border lines on the World, with only a fraction of their depths having been explored. Advancements in technology have led to the development of Autonomous Underwater Vehicles (AUVs) that can operate independently and perform complex tasks underwater. These vehicles have revolutionized underwater exploration, allowing us to study and understand our ocean… ▽ More

    Submitted 22 November, 2023; originally announced February 2024.

    Comments: 7 pages

  10. arXiv:2009.02942  [pdf

    cs.CR

    Detection of Colluded Black-hole and Grey-hole attacks in Cloud Computing

    Authors: Divyasree I R, Selvamani K, Riasudheen H

    Abstract: The availability of the high-capacity network, massive storage, hardware virtualization, utility computing, service-oriented architecture leads to high accessibility of cloud computing. The extensive usage of cloud resources causes oodles of security controversies. Black-hole & Gray-hole attacks are the notable cloud network defenseless attacks while they launched easily but difficult to detect. T… ▽ More

    Submitted 7 September, 2020; originally announced September 2020.

  11. arXiv:1810.05502  [pdf

    eess.SP cs.DC

    Asynchronous Wi-Fi Control Interface (AWCI) Using Socket IO Technology

    Authors: Devipriya T K, Jovita Franci A, Deepa R, Godwin Sam Josh

    Abstract: The Internet of Things (IoT) is a system of interrelated computing devices to the Internet that are provided with unique identifiers which has the ability to transfer data over a network without requiring human-to- human or human-to- computer interaction. Raspberry pi-3 a popular, cheap, small and powerful computer with built in Wi-Fi can be used to make any devices smart by connecting to that par… ▽ More

    Submitted 6 October, 2018; originally announced October 2018.

    Comments: 5 pages, 5 figures, published with Global Research and Development Journal for Engineering

    Journal ref: Global Research and Development Journal for Engineering, 1(3), pp.66-70, 2017

  12. arXiv:1706.09585  [pdf, other

    cs.LG

    Online Reweighted Least Squares Algorithm for Sparse Recovery and Application to Short-Wave Infrared Imaging

    Authors: Subhadip Mukherjee, Deepak R., Huaijin Chen, Ashok Veeraraghavan, Chandra Sekhar Seelamantula

    Abstract: We address the problem of sparse recovery in an online setting, where random linear measurements of a sparse signal are revealed sequentially and the objective is to recover the underlying signal. We propose a reweighted least squares (RLS) algorithm to solve the problem of online sparse reconstruction, wherein a system of linear equations is solved using conjugate gradient with the arrival of eve… ▽ More

    Submitted 29 June, 2017; originally announced June 2017.

  13. arXiv:1106.3517  [pdf

    cs.CV

    DWT Based Fingerprint Recognition using Non Minutiae Features

    Authors: Shashi Kumar D. R., K. B. Raja, R. K. Chhootaray, Sabyasachi Pattanaik

    Abstract: Forensic applications like criminal investigations, terrorist identification and National security issues require a strong fingerprint data base and efficient identification system. In this paper we propose DWT based Fingerprint Recognition using Non Minutiae (DWTFR) algorithm. Fingerprint image is decomposed into multi resolution sub bands of LL, LH, HL and HH by applying 3 level DWT. The Dominan… ▽ More

    Submitted 17 June, 2011; originally announced June 2011.

    Comments: 9 pages

    Journal ref: IJCSI International Journal of Computer Science Issues, Vol. 8, Issue 2, March 2011, 257-265