Skip to main content

Showing 1–2 of 2 results for author: Shetty, V M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2502.10511  [pdf, other

    eess.AS cs.SD

    Enhancing Age-Related Robustness in Children Speaker Verification

    Authors: Vishwas M. Shetty, Jiusi Zheng, Steven M. Lulich, Abeer Alwan

    Abstract: One of the main challenges in children's speaker verification (C-SV) is the significant change in children's voices as they grow. In this paper, we propose two approaches to improve age-related robustness in C-SV. We first introduce a Feature Transform Adapter (FTA) module that integrates local patterns into higher-level global representations, reducing overfitting to specific local features and i… ▽ More

    Submitted 14 February, 2025; originally announced February 2025.

    Comments: Accepted to ICASSP 2025

  2. arXiv:2008.03247  [pdf, other

    eess.AS cs.CV cs.SD

    Investigation of Speaker-adaptation methods in Transformer based ASR

    Authors: Vishwas M. Shetty, Metilda Sagaya Mary N J, S. Umesh

    Abstract: End-to-end models are fast replacing the conventional hybrid models in automatic speech recognition. Transformer, a sequence-to-sequence model, based on self-attention popularly used in machine translation tasks, has given promising results when used for automatic speech recognition. This paper explores different ways of incorporating speaker information at the encoder input while training a trans… ▽ More

    Submitted 17 November, 2021; v1 submitted 7 August, 2020; originally announced August 2020.

    Comments: 5 pages, 6 figures