Skip to main content

Showing 1–11 of 11 results for author: Hegde, R M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.00417  [pdf, other

    cs.NI

    Performance Evaluation of Scheduling Scheme in O-RAN 5G Network using NS-3

    Authors: A. K. Subudhi, A. Piccioni, V. Gudepu, A. Marotta, F. Graziosi, R. M. Hegde, K. Kondepu

    Abstract: The integration of Open Radio Access Network (O-RAN) principles into 5G networks introduces a paradigm shift in how radio resources are managed and optimized. O-RAN's open architecture enables the deployment of intelligent applications (xApps) that can dynamically adapt to varying network conditions and user demands. In this paper, we present radio resource scheduling schemes -- a possible O-RAN-c… ▽ More

    Submitted 1 April, 2025; originally announced April 2025.

  2. arXiv:2206.02050  [pdf, other

    cs.CV cs.SD eess.AS

    Learning Speaker-specific Lip-to-Speech Generation

    Authors: Munender Varshney, Ravindra Yadav, Vinay P. Namboodiri, Rajesh M Hegde

    Abstract: Understanding the lip movement and inferring the speech from it is notoriously difficult for the common person. The task of accurate lip-reading gets help from various cues of the speaker and its contextual or environmental setting. Every speaker has a different accent and speaking style, which can be inferred from their visual and speech features. This work aims to understand the correlation/mapp… ▽ More

    Submitted 20 August, 2022; v1 submitted 4 June, 2022; originally announced June 2022.

    Comments: Accepted at ICPR 2022

  3. arXiv:2011.10727  [pdf, other

    cs.CV

    Stochastic Talking Face Generation Using Latent Distribution Matching

    Authors: Ravindra Yadav, Ashish Sardana, Vinay P Namboodiri, Rajesh M Hegde

    Abstract: The ability to envisage the visual of a talking face based just on hearing a voice is a unique human capability. There have been a number of works that have solved for this ability recently. We differ from these approaches by enabling a variety of talking face generations based on single audio input. Indeed, just having the ability to generate a single talking face would make a system almost robot… ▽ More

    Submitted 21 November, 2020; originally announced November 2020.

    Comments: InterSpeech 2020

  4. arXiv:2011.07340  [pdf, other

    cs.CV

    Speech Prediction in Silent Videos using Variational Autoencoders

    Authors: Ravindra Yadav, Ashish Sardana, Vinay P Namboodiri, Rajesh M Hegde

    Abstract: Understanding the relationship between the auditory and visual signals is crucial for many different applications ranging from computer-generated imagery (CGI) and video editing automation to assisting people with hearing or visual impairments. However, this is challenging since the distribution of both audio and visual modality is inherently multimodal. Therefore, most of the existing methods ign… ▽ More

    Submitted 14 November, 2020; originally announced November 2020.

  5. arXiv:2001.01555  [pdf, other

    cs.RO

    A Generalized Framework for Autonomous Calibration of Wheeled Mobile Robots

    Authors: Mohan Krishna Nutalapati, Lavish Arora, Anway Bose, Ketan Rajawat, Rajesh M Hegde

    Abstract: Robotic calibration allows for the fusion of data from multiple sensors such as odometers, cameras, etc., by providing appropriate transformational relationships between the corresponding reference frames. For wheeled robots equipped with exteroceptive sensors, calibration entails learning the motion model of the sensor or the robot in terms of the odometric data, and must generally be performed p… ▽ More

    Submitted 6 January, 2020; originally announced January 2020.

    Comments: This manuscript has been submitted to 'Elsevier Journal of Robotics and Autonomous Systems' and is under review for possible publication. Based on IROS 2019 conference submission [arXiv:1910.11917]

  6. arXiv:1910.11917  [pdf, other

    cs.RO

    Model Free Calibration of Wheeled Robots Using Gaussian Process

    Authors: Mohan Krishna Nutalapati, Lavish Arora, Anway Bose, Ketan Rajawat, Rajesh M Hegde

    Abstract: Robotic calibration allows for the fusion of data from multiple sensors such as odometers, cameras, etc., by providing appropriate relationships between the corresponding reference frames. For wheeled robots equipped with camera/lidar along with wheel encoders, calibration entails learning the motion model of the sensor or the robot in terms of the data from the encoders and generally carried out… ▽ More

    Submitted 25 October, 2019; originally announced October 2019.

    Comments: To be published in International Conference on Intelligent Robots and Systems (IROS), 2019

  7. arXiv:1711.01872  [pdf, other

    eess.AS cs.SD

    Minimum-Phase HRTF Modeling of Pinna Spectral Notches using Group Delay Decomposition

    Authors: Sandeep Reddy C, Rajesh M Hegde

    Abstract: Accurate reconstruction of HRTFs is important in the development of high quality binaural sound synthesis systems. Conventionally, minimum phase HRTF model development for reconstruction of HRTFs has been limited to minimum phase-pure delay models which ignore the all pass component of the HRTF. In this paper, a novel method for minimum phase HRTF modelling of Pinna Spectral Notches (PSNs) using g… ▽ More

    Submitted 3 April, 2018; v1 submitted 6 November, 2017; originally announced November 2017.

    Comments: 11 pages; This paper is a preprint of a paper submitted to IET Signal Processing Journal. If accepted, the copy of record will be available at the IET Digital Library

  8. arXiv:1701.02080  [pdf

    cs.NI

    A Review of Localization and Tracking Algorithms in Wireless Sensor Networks

    Authors: Sudhir Kumar, Rajesh M. Hegde

    Abstract: In this paper, a comprehensive survey of the pioneer as well as the state of-the-art localization and tracking methods in the wireless sensor networks is presented. Localization is mostly applicable for the static sensor nodes, whereas, tracking for the mobile sensor nodes. The localization algorithms are broadly classified as range-based and range-free methods. The estimated range (distance) betw… ▽ More

    Submitted 9 January, 2017; originally announced January 2017.

  9. arXiv:1610.05948  [pdf, ps, other

    cs.SD cs.CL stat.AP

    A Bayesian Approach to Estimation of Speaker Normalization Parameters

    Authors: Dhananjay Ram, Debasis Kundu, Rajesh M. Hegde

    Abstract: In this work, a Bayesian approach to speaker normalization is proposed to compensate for the degradation in performance of a speaker independent speech recognition system. The speaker normalization method proposed herein uses the technique of vocal tract length normalization (VTLN). The VTLN parameters are estimated using a novel Bayesian approach which utilizes the Gibbs sampler, a special type o… ▽ More

    Submitted 19 October, 2016; originally announced October 2016.

    Comments: 23 Pages, 9 Figures

  10. arXiv:1508.02834  [pdf, ps, other

    cs.NI

    Second Order Cone Programming for Sensor Node Localization in Mixed LOS/NLOS Conditions

    Authors: Sudhir Kumar, Rishabh Dixit, Rajesh M. Hegde

    Abstract: In this paper, a novel method for sensor node localization under mixed line-of-sight/non-line-of-sight (LOS/NLOS) conditions based on second order cone programming (SOCP) is presented. SOCP methods have, hitherto, not been utilized in the node localization under mixed LOS/NLOS conditions. Unlike semidefinite programming (SDP) formulation, SOCP is computationally efficient for resource constrained… ▽ More

    Submitted 12 August, 2015; originally announced August 2015.

  11. arXiv:1411.6741  [pdf, other

    cs.SD

    A Complex Matrix Factorization approach to Joint Modeling of Magnitude and Phase for Source Separation

    Authors: Chaitanya Ahuja, Karan Nathwani, Rajesh M. Hegde

    Abstract: Conventional NMF methods for source separation factorize the matrix of spectral magnitudes. Spectral Phase is not included in the decomposition process of these methods. However, phase of the speech mixture is generally used in reconstructing the target speech signal. This results in undesired traces of interfering sources in the target signal. In this paper the spectral phase is incorporated in t… ▽ More

    Submitted 25 November, 2014; originally announced November 2014.

    Comments: 5 pages, 3 figures