Skip to main content

Showing 1–24 of 24 results for author: M, V

Searching in archive cs. Search in all archives.
.
  1. SALF-MOS: Speaker Agnostic Latent Features Downsampled for MOS Prediction

    Authors: Saurabh Agrawal, Raj Gohil, Gopal Kumar Agrawal, Vikram C M, Kushal Verma

    Abstract: Speech quality assessment is a critical process in selecting text-to-speech synthesis (TTS) or voice conversion models. Evaluation of voice synthesis can be done using objective metrics or subjective metrics. Although there are many objective metrics like the Perceptual Evaluation of Speech Quality (PESQ), Perceptual Objective Listening Quality Assessment (POLQA) or Short-Time Objective Intelligib… ▽ More

    Submitted 2 June, 2025; originally announced June 2025.

    Journal ref: 2024 International Conference on Signal Processing and Communications (SPCOM), 2024}, pages 1-5, 10631576

  2. arXiv:2504.19716  [pdf, other

    cs.RO

    QuickGrasp: Lightweight Antipodal Grasp Planning with Point Clouds

    Authors: Navin Sriram Ravie, Keerthi Vasan M, Asokan Thondiyath, Bijo Sebastian

    Abstract: Grasping has been a long-standing challenge in facilitating the final interface between a robot and the environment. As environments and tasks become complicated, the need to embed higher intelligence to infer from the surroundings and act on them has become necessary. Although most methods utilize techniques to estimate grasp pose by treating the problem via pure sampling-based approaches in the… ▽ More

    Submitted 9 May, 2025; v1 submitted 28 April, 2025; originally announced April 2025.

  3. arXiv:2503.14538  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    Vision-Language Models for Acute Tuberculosis Diagnosis: A Multimodal Approach Combining Imaging and Clinical Data

    Authors: Ananya Ganapthy, Praveen Shastry, Naveen Kumarasami, Anandakumar D, Keerthana R, Mounigasri M, Varshinipriya M, Kishore Prasath Venkatesh, Bargava Subramanian, Kalyan Sivasailam

    Abstract: Background: This study introduces a Vision-Language Model (VLM) leveraging SIGLIP and Gemma-3b architectures for automated acute tuberculosis (TB) screening. By integrating chest X-ray images and clinical notes, the model aims to enhance diagnostic accuracy and efficiency, particularly in resource-limited settings. Methods: The VLM combines visual data from chest X-rays with clinical context to… ▽ More

    Submitted 1 April, 2025; v1 submitted 17 March, 2025; originally announced March 2025.

    Comments: 11 pages, 3 figures

    MSC Class: 68T07; 68T45; 92C55; 92C50; 68U10

  4. arXiv:2412.19467  [pdf

    cs.CV cs.AI cs.LG

    Optimizing Helmet Detection with Hybrid YOLO Pipelines: A Detailed Analysis

    Authors: Vaikunth M, Dejey D, Vishaal C, Balamurali S

    Abstract: Helmet detection is crucial for advancing protection levels in public road traffic dynamics. This problem statement translates to an object detection task. Therefore, this paper compares recent You Only Look Once (YOLO) models in the context of helmet detection in terms of reliability and computational load. Specifically, YOLOv8, YOLOv9, and the newly released YOLOv11 have been used. Besides, a mo… ▽ More

    Submitted 27 December, 2024; originally announced December 2024.

  5. arXiv:2408.01746  [pdf, other

    cs.CV

    Domain penalisation for improved Out-of-Distribution Generalisation

    Authors: Shuvam Jena, Sushmetha Sumathi Rajendran, Karthik Seemakurthy, Sasithradevi A, Vijayalakshmi M, Prakash Poornachari

    Abstract: In the field of object detection, domain generalisation (DG) aims to ensure robust performance across diverse and unseen target domains by learning the robust domain-invariant features corresponding to the objects of interest across multiple source domains. While there are many approaches established for performing DG for the task of classification, there has been a very little focus on object det… ▽ More

    Submitted 3 August, 2024; originally announced August 2024.

  6. arXiv:2407.11985  [pdf

    cs.HC cs.AI

    A Novel Implementation of Marksheet Parser Using PaddleOCR

    Authors: Sankalp Bagaria, S Irene, Harikrishnan, Elakia V M

    Abstract: When an applicant files an online application, there is usually a requirement to fill the marks in the online form and also upload the marksheet in the portal for the verification. A system was built for reading the uploaded marksheet using OCR and automatically filling the rows/ columns in the online form. Though there are partial solutions to this problem - implemented using PyTesseract - the ac… ▽ More

    Submitted 4 June, 2024; originally announced July 2024.

    Comments: 5 pages, 1 figure, 1 table

  7. arXiv:2403.12044  [pdf, other

    cs.CV cs.LG

    Mobile Application for Oral Disease Detection using Federated Learning

    Authors: Shankara Narayanan V, Sneha Varsha M, Syed Ashfaq Ahmed, Guruprakash J

    Abstract: The mouth, often regarded as a window to the internal state of the body, plays an important role in reflecting one's overall health. Poor oral hygiene has far-reaching consequences, contributing to severe conditions like heart disease, cancer, and diabetes, while inadequate care leads to discomfort, pain, and costly treatments. Federated Learning (FL) for object detection can be utilized for this… ▽ More

    Submitted 27 October, 2023; originally announced March 2024.

  8. arXiv:2306.13456  [pdf

    cs.LG

    Enhanced Dengue Outbreak Prediction in Tamilnadu using Meteorological and Entomological data

    Authors: Varalakshmi M, Daphne Lopez

    Abstract: This paper focuses on studying the impact of climate data and vector larval indices on dengue outbreak. After a comparative study of the various LSTM models, Bidirectional Stacked LSTM network is selected to analyze the time series climate data and health data collected for the state of Tamil Nadu (India), for the period 2014 to 2020. Prediction accuracy of the model is significantly improved by i… ▽ More

    Submitted 23 June, 2023; originally announced June 2023.

    Comments: 6 Pages and submitted to ICAI'22 - The 24th International Conference on Artificial Intelligence

  9. Cyber-Resilient Privacy Preservation and Secure Billing Approach for Smart Energy Metering Devices

    Authors: Venkatesh Kumar M

    Abstract: Most of the smart applications, such as smart energy metering devices, demand strong privacy preservation to strengthen data privacy. However, it is difficult to protect the privacy of the smart device data, especially on the client side. It is mainly because payment for billing is computed by the server deployed at the client's side, and it is highly challenging to prevent the leakage of client's… ▽ More

    Submitted 6 October, 2022; originally announced October 2022.

    Comments: Journal article

    ACM Class: F.2.2; I.2.7

    Journal ref: Volume 70 Issue 9, 337-345, September 2022

  10. ASTROMER: A transformer-based embedding for the representation of light curves

    Authors: C. Donoso-Oliva, I. Becker, P. Protopapas, G. Cabrera-Vives, Vishnu M., Harsh Vardhan

    Abstract: Taking inspiration from natural language embeddings, we present ASTROMER, a transformer-based model to create representations of light curves. ASTROMER was pre-trained in a self-supervised manner, requiring no human-labeled data. We used millions of R-band light sequences to adjust the ASTROMER weights. The learned representation can be easily adapted to other surveys by re-training ASTROMER on ne… ▽ More

    Submitted 9 November, 2022; v1 submitted 2 May, 2022; originally announced May 2022.

    Journal ref: A&A 670, A54 (2023)

  11. arXiv:2111.04003  [pdf

    cs.LG

    Predictive Model for Gross Community Production Rate of Coral Reefs using Ensemble Learning Methodologies

    Authors: Umanandini S, Rishivardhan M, Aouthithiye Barathwaj SR Y, Jasline Augusta J, Shrirang Sapate, Reenasree S, Vigneash M

    Abstract: Coral reefs play a vital role in maintaining the ecological balance of the marine ecosystem. Various marine organisms depend on coral reefs for their existence and their natural processes. Coral reefs provide the necessary habitat for reproduction and growth for various exotic species of the marine ecosystem. In this article, we discuss the most important parameters which influence the lifecycle o… ▽ More

    Submitted 23 January, 2023; v1 submitted 7 November, 2021; originally announced November 2021.

    Comments: 8 pages, 18 figures

    MSC Class: 68T20 ACM Class: I.2.8

  12. arXiv:2110.01467  [pdf, other

    cs.LG cs.IR

    HyperTeNet: Hypergraph and Transformer-based Neural Network for Personalized List Continuation

    Authors: Vijaikumar M, Deepesh Hada, Shirish Shevade

    Abstract: The personalized list continuation (PLC) task is to curate the next items to user-generated lists (ordered sequence of items) in a personalized way. The main challenge in this task is understanding the ternary relationships among the interacting entities (users, items, and lists) that the existing works do not consider. Further, they do not take into account the multi-hop relationships among entit… ▽ More

    Submitted 7 October, 2021; v1 submitted 4 October, 2021; originally announced October 2021.

    Comments: 11 pages, 5 figures, The IEEE International Conference on Data Mining (ICDM) 2021

  13. arXiv:2011.05895  [pdf, other

    cs.CV cs.LG

    Transferred Fusion Learning using Skipped Networks

    Authors: Vinayaka R Kamath, Vishal S, Varun M

    Abstract: Identification of an entity that is of interest is prominent in any intelligent system. The visual intelligence of the model is enhanced when the capability of recognition is added. Several methods such as transfer learning and zero shot learning help to reuse the existing models or augment the existing model to achieve improved performance at the task of object recognition. Transferred fusion lea… ▽ More

    Submitted 11 November, 2020; originally announced November 2020.

    Comments: 9 Pages, 7 figures, Conference

  14. Multiclass Model for Agriculture development using Multivariate Statistical method

    Authors: N Deepa, Mohammad Zubair Khan, Prabadevi B, Durai Raj Vincent P M, Praveen Kumar Reddy Maddikunta, Thippa Reddy Gadekallu

    Abstract: Mahalanobis taguchi system (MTS) is a multi-variate statistical method extensively used for feature selection and binary classification problems. The calculation of orthogonal array and signal-to-noise ratio in MTS makes the algorithm complicated when more number of factors are involved in the classification problem. Also the decision is based on the accuracy of normal and abnormal observations of… ▽ More

    Submitted 7 October, 2020; v1 submitted 12 September, 2020; originally announced September 2020.

    Comments: in IEEE Access

  15. arXiv:2008.00106  [pdf, other

    cs.CV

    Utilising Visual Attention Cues for Vehicle Detection and Tracking

    Authors: Feiyan Hu, Venkatesh G M, Noel E. O'Connor, Alan F. Smeaton, Suzanne Little

    Abstract: Advanced Driver-Assistance Systems (ADAS) have been attracting attention from many researchers. Vision-based sensors are the closest way to emulate human driver visual behavior while driving. In this paper, we explore possible ways to use visual attention (saliency) for object detection and tracking. We investigate: 1) How a visual attention map such as a \emph{subjectness} attention or saliency m… ▽ More

    Submitted 31 July, 2020; originally announced August 2020.

    Comments: Accepted in ICPR2020

  16. arXiv:1907.08440  [pdf, other

    cs.IR cs.LG stat.ML

    Neural Cross-Domain Collaborative Filtering with Shared Entities

    Authors: Vijaikumar M, Shirish Shevade, M N Murty

    Abstract: Cross-Domain Collaborative Filtering (CDCF) provides a way to alleviate data sparsity and cold-start problems present in recommendation systems by exploiting the knowledge from related domains. Existing CDCF models are either based on matrix factorization or deep neural networks. Either of the techniques in isolation may result in suboptimal performance for the prediction task. Also, most of the e… ▽ More

    Submitted 19 July, 2019; originally announced July 2019.

    Comments: 10 pages, 5 figures

  17. arXiv:1411.7482  [pdf, other

    cs.NI

    SmartConnect: A System for the Design and Deployment of Wireless Sensor Networks

    Authors: Abhijit Bhattacharya, Sanjay Motilal Ladwa, Rachit Srivastava, Aniruddha Mallya, Akhila Rao, Easwar Vivek. M, Deeksha G. Rao Sahib, S. V. R. Anand, Anurag Kumar

    Abstract: We have developed SmartConnect, a tool that addresses the growing need for the design and deployment of multihop wireless relay networks for connecting sensors to a control center. Given the locations of the sensors, the traffic that each sensor generates, the quality of service (QoS) requirements, and the potential locations at which relays can be placed, SmartConnect helps design and deploy a lo… ▽ More

    Submitted 27 November, 2014; originally announced November 2014.

  18. arXiv:1204.2613  [pdf

    cs.OH

    Cloud Computing For Microfinances

    Authors: Suma. V, Bhagavant Deshpande, Vaidehi. M, T. R. Gopalakrishnan Nair

    Abstract: Evolution of Science and Engineering has led to the growth of several commercial applications. The wide spread implementation of commercial based applications has in turn directed the emergence of advanced technologies such as cloud computing. India has well proven itself as a potential hub for advanced technologies including cloud based industrial market. Microfinance system has emerged out as a… ▽ More

    Submitted 12 April, 2012; originally announced April 2012.

    Comments: 3 Pages, 2 Figures, International Conference On Systemics, Cybernetics and Informatics

  19. arXiv:1111.4898  [pdf, other

    cs.SI physics.soc-ph

    A Navigation Algorithm Inspired by Human Navigation

    Authors: Vijesh M., Sudarshan Iyengar, Vijay Mahantesh, Amitash Ramesh, Veni Madhavan

    Abstract: Human navigation has been a topic of interest in spatial cognition from the past few decades. It has been experimentally observed that humans accomplish the task of way-finding a destination in an unknown environment by recognizing landmarks. Investigations using network analytic techniques reveal that humans, when asked to way-find their destination, learn the top ranked nodes of a network. In th… ▽ More

    Submitted 21 November, 2011; originally announced November 2011.

    Comments: Human Navigation, Path Concatenation, Hotspots, Center Strategic Paths, Approximation Algorithm

  20. arXiv:1107.3674   

    cs.MA

    Autonomous Traffic Control System Using Agent Based Technology

    Authors: Venkatesh. M, K. Kumar, Srinivas. V

    Abstract: The way of analyzing, designing and building of real-time projects has been changed due to the rapid growth of internet, mobile technologies and intelligent applications. Most of these applications are intelligent, tiny and distributed components called as agent. Agent works like it takes the input from numerous real-time sources and gives back the real-time response. In this paper how these agent… ▽ More

    Submitted 2 August, 2011; v1 submitted 19 July, 2011; originally announced July 2011.

    Comments: This paper has been withdrawn by the authors. Total Pages 8 and 3 Figures, Author wishes to withdraw for some major changes and corrections

    Journal ref: International Journal of Advancements in Technology, Vol.2, No. 3, July 2011

  21. arXiv:1107.1954  [pdf

    cs.NI

    A Novel Agent Based Approach for Controlling Network Storms

    Authors: Dr. T. R. Gopalakrishnan Nair, B. R. Shubhamangala, Vaidehi. M

    Abstract: One of the fundamental data transmission mechanisms in Ethernet LAN is broadcasting. Flooding is a direct broadcasting technique used in these networks. A significant drawback of this method is that it can lead to broadcast storms. This phenomenon is more common in multivendor switch environment. Broadcast storms usually results in dissension, collision and redundancy leading to degradation of the… ▽ More

    Submitted 11 July, 2011; originally announced July 2011.

    Comments: 7 pages, 12 figures IEEE Third International Conference on Communications and Electronics (ICCE 2010). Nha Trang, Vietnam, Proceedings, 11-13 August 2010

  22. Delay Optimal Event Detection on Ad Hoc Wireless Sensor Networks

    Authors: Premkumar Karumbu, Venkata K. Prasanthi M., Anurag Kumar

    Abstract: We consider a small extent sensor network for event detection, in which nodes take samples periodically and then contend over a {\em random access network} to transmit their measurement packets to the fusion center. We consider two procedures at the fusion center to process the measurements. The Bayesian setting is assumed; i.e., the fusion center has a prior distribution on the change time. In th… ▽ More

    Submitted 30 May, 2011; originally announced May 2011.

    Comments: To appear in ACM Transactions on Sensor Networks. A part of this work was presented in IEEE SECON 2006, and Allerton 2010

  23. arXiv:1001.3716  [pdf, other

    cs.AR

    A Multicore Processor based Real-Time System for Automobile management application

    Authors: Vaidehi. M., T. R. Gopalakrishnan Nair

    Abstract: In this paper we propose an Intelligent Management System which is capable of managing the automobile functions using the rigorous real-time principles and a multicore processor in order to realize higher efficiency and safety for the vehicle. It depicts how various automobile functionalities can be fine grained and treated to fit in real time concepts. It also shows how the modern multicore pro… ▽ More

    Submitted 20 January, 2010; originally announced January 2010.

    Comments: 9 pages, 4 figures

  24. arXiv:0907.4881  [pdf

    cs.NI cs.PF

    Approximate mechanism for measuring stability of Internet link in aggregated Internet pipe

    Authors: Vipin M, Mohamed Imran K R

    Abstract: In this article we propose a method for measuring internet connection stability which is fast and has negligible overhead for the process of its complexity. This method finds a relative value for representing the stability of internet connections and can also be extended for aggregated internet connections. The method is documented with help of a real time implementation and results are shared.… ▽ More

    Submitted 28 July, 2009; originally announced July 2009.

    Comments: 8 pages, 5 figures

    ACM Class: C.2.1; C.2.3; C.2.6