Skip to main content

Showing 1–8 of 8 results for author: Kumar, C

Searching in archive eess. Search in all archives.
.
  1. arXiv:2506.01365  [pdf, ps, other

    cs.SD cs.CL eess.AS

    Attention Is Not Always the Answer: Optimizing Voice Activity Detection with Simple Feature Fusion

    Authors: Kumud Tripathi, Chowdam Venkata Kumar, Pankaj Wasnik

    Abstract: Voice Activity Detection (VAD) plays a key role in speech processing, often utilizing hand-crafted or neural features. This study examines the effectiveness of Mel-Frequency Cepstral Coefficients (MFCCs) and pre-trained model (PTM) features, including wav2vec 2.0, HuBERT, WavLM, UniSpeech, MMS, and Whisper. We propose FusionVAD, a unified framework that combines both feature types using three fusi… ▽ More

    Submitted 2 June, 2025; originally announced June 2025.

    Comments: Accepted at INTERSPEECH 2025, 5 pages, 4 figures, 2 tables

  2. arXiv:2412.17823  [pdf

    eess.SP cs.AI cs.LG eess.SY

    RUL forecasting for wind turbine predictive maintenance based on deep learning

    Authors: Syed Shazaib Shah, Tan Daoliang, Sah Chandan Kumar

    Abstract: Predictive maintenance (PdM) is increasingly pursued to reduce wind farm operation and maintenance costs by accurately predicting the remaining useful life (RUL) and strategically scheduling maintenance. However, the remoteness of wind farms often renders current methodologies ineffective, as they fail to provide a sufficiently reliable advance time window for maintenance planning, limiting PdM's… ▽ More

    Submitted 9 December, 2024; originally announced December 2024.

    Comments: 19 pages, 16 figures, Journal Paper

    Report number: Volume 10, Issue 20e39268October 30, 2024 MSC Class: 14J60 (Primary)

    Journal ref: Helyion (Journal); Volume 10, Issue 20e39268October 30, 2024

  3. arXiv:2407.08655  [pdf, other

    eess.IV cs.AI cs.LG physics.med-ph

    SPOCKMIP: Segmentation of Vessels in MRAs with Enhanced Continuity using Maximum Intensity Projection as Loss

    Authors: Chethan Radhakrishna, Karthikesh Varma Chintalapati, Sri Chandana Hudukula Ram Kumar, Raviteja Sutrave, Hendrik Mattern, Oliver Speck, Andreas Nürnberger, Soumick Chatterjee

    Abstract: Identification of vessel structures of different sizes in biomedical images is crucial in the diagnosis of many neurodegenerative diseases. However, the sparsity of good-quality annotations of such images makes the task of vessel segmentation challenging. Deep learning offers an efficient way to segment vessels of different sizes by learning their high-level feature representations and the spatial… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  4. arXiv:2312.09842  [pdf, ps, other

    cs.SD eess.AS

    On the compression of shallow non-causal ASR models using knowledge distillation and tied-and-reduced decoder for low-latency on-device speech recognition

    Authors: Nagaraj Adiga, Jinhwan Park, Chintigari Shiva Kumar, Shatrughan Singh, Kyungmin Lee, Chanwoo Kim, Dhananjaya Gowda

    Abstract: Recently, the cascaded two-pass architecture has emerged as a strong contender for on-device automatic speech recognition (ASR). A cascade of causal and shallow non-causal encoders coupled with a shared decoder enables operation in both streaming and look-ahead modes. In this paper, we propose shallow cascaded model by combining various model compression techniques such as knowledge distillation,… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

  5. arXiv:2311.12758  [pdf, other

    eess.SY

    Estimating time of arrival of vehicle fleets with GCN based traffic prediction

    Authors: Shivika Sharma, Nandini Mawane, Dhruthick Gowda M, Mayur Taware, Chetan Kumar, Yash Chandrashekhar Dixit, Rakshit Ramesh

    Abstract: This paper presents an effective framework for estimating time of arrival of vehicles (buses) in an Intelligent Transit Management System (ITMS) having sparse position updates. Our contributions towards this is firstly in implementing a constrained optimization based road linestring segmenting framework ensuring ideal segment lengths and segments with sufficient density of vehicle position measure… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

  6. arXiv:2202.13541  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    Pattern Based Multivariable Regression using Deep Learning (PBMR-DP)

    Authors: Jiztom Kavalakkatt Francis, Chandan Kumar, Jansel Herrera-Gerena, Kundan Kumar, Matthew J Darr

    Abstract: We propose a deep learning methodology for multivariate regression that is based on pattern recognition that triggers fast learning over sensor data. We used a conversion of sensors-to-image which enables us to take advantage of Computer Vision architectures and training processes. In addition to this data preparation methodology, we explore the use of state-of-the-art architectures to generate re… ▽ More

    Submitted 9 March, 2022; v1 submitted 27 February, 2022; originally announced February 2022.

    Comments: 7 pages, 5 figures, 3 tables

  7. arXiv:2110.11795  [pdf, other

    eess.IV cs.CV

    HDRVideo-GAN: Deep Generative HDR Video Reconstruction

    Authors: Mrinal Anand, Nidhin Harilal, Chandan Kumar, Shanmuganathan Raman

    Abstract: High dynamic range (HDR) videos provide a more visually realistic experience than the standard low dynamic range (LDR) videos. Despite having significant progress in HDR imaging, it is still a challenging task to capture high-quality HDR video with a conventional off-the-shelf camera. Existing approaches rely entirely on using dense optical flow between the neighboring LDR sequences to reconstruct… ▽ More

    Submitted 3 November, 2021; v1 submitted 22 October, 2021; originally announced October 2021.

    Comments: In Proceedings of 12th Indian Conference on Computer Vision, Graphics and Image Processing (ICVGIP-21)

  8. arXiv:2002.00336  [pdf, other

    cs.CV cs.LG cs.RO eess.IV

    3D Object Detection on Point Clouds using Local Ground-aware and Adaptive Representation of scenes' surface

    Authors: Arun CS Kumar, Disha Ahuja, Ashwath Aithal

    Abstract: A novel, adaptive ground-aware, and cost-effective 3D Object Detection pipeline is proposed. The ground surface representation introduced in this paper, in comparison to its uni-planar counterparts (methods that model the surface of a whole 3D scene using single plane), is far more accurate while being ~10x faster. The novelty of the ground representation lies both in the way in which the ground s… ▽ More

    Submitted 26 June, 2020; v1 submitted 2 February, 2020; originally announced February 2020.