Skip to main content

Showing 1–6 of 6 results for author: Sreeram, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2203.15283  [pdf, other

    eess.AS cs.LG

    Mel Frequency Spectral Domain Defenses against Adversarial Attacks on Speech Recognition Systems

    Authors: Nicholas Mehlman, Anirudh Sreeram, Raghuveer Peri, Shrikanth Narayanan

    Abstract: A variety of recent works have looked into defenses for deep neural networks against adversarial attacks particularly within the image processing domain. Speech processing applications such as automatic speech recognition (ASR) are increasingly relying on deep learning models, and so are also prone to adversarial attacks. However, many of the defenses explored for ASR simply adapt the image-domain… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

    Comments: This paper is 5 pages long and was submitted to Interspeech 2022

  2. arXiv:2108.05520  [pdf, other

    eess.AS cs.SD eess.SP

    Dereverberation of Autoregressive Envelopes for Far-field Speech Recognition

    Authors: Anurenjan Purushothaman, Anirudh Sreeram, Rohit Kumar, Sriram Ganapathy

    Abstract: The task of speech recognition in far-field environments is adversely affected by the reverberant artifacts that elicit as the temporal smearing of the sub-band envelopes. In this paper, we develop a neural model for speech dereverberation using the long-term sub-band envelopes of speech. The sub-band envelopes are derived using frequency domain linear prediction (FDLP) which performs an autoregre… ▽ More

    Submitted 13 August, 2021; v1 submitted 12 August, 2021; originally announced August 2021.

    Comments: arXiv admin note: text overlap with arXiv:2008.03339

  3. arXiv:2107.05222  [pdf, other

    eess.AS cs.LG eess.SP

    Perceptual-based deep-learning denoiser as a defense against adversarial attacks on ASR systems

    Authors: Anirudh Sreeram, Nicholas Mehlman, Raghuveer Peri, Dillon Knox, Shrikanth Narayanan

    Abstract: In this paper we investigate speech denoising as a defense against adversarial attacks on automatic speech recognition (ASR) systems. Adversarial attacks attempt to force misclassification by adding small perturbations to the original speech signal. We propose to counteract this by employing a neural-network based denoiser as a pre-processor in the ASR pipeline. The denoiser is independent of the… ▽ More

    Submitted 12 July, 2021; originally announced July 2021.

    Comments: 5 pages, 4 figures submitted to ASRU 2021

  4. arXiv:2008.03339  [pdf, other

    eess.AS cs.SD eess.SP

    Deep Learning Based Dereverberation of Temporal Envelopesfor Robust Speech Recognition

    Authors: Anurenjan Purushothaman, Anirudh Sreeram, Rohit Kumar, Sriram Ganapathy

    Abstract: Automatic speech recognition in reverberant conditions is a challenging task as the long-term envelopes of the reverberant speech are temporally smeared. In this paper, we propose a neural model for enhancement of sub-band temporal envelopes for dereverberation of speech. The temporal envelopes are derived using the autoregressive modeling framework of frequency domain linear prediction (FDLP). Th… ▽ More

    Submitted 7 August, 2020; originally announced August 2020.

  5. arXiv:1911.12617  [pdf, other

    eess.AS cs.LG cs.SD eess.SP

    Unsupervised Neural Mask Estimator For Generalized Eigen-Value Beamforming Based ASR

    Authors: Rohit Kumar, Anirudh Sreeram, Anurenjan Purushothaman, Sriram Ganapathy

    Abstract: The state-of-art methods for acoustic beamforming in multi-channel ASR are based on a neural mask estimator that predicts the presence of speech and noise. These models are trained using a paired corpus of clean and noisy recordings (teacher model). In this paper, we attempt to move away from the requirements of having supervised clean recordings for training the mask estimator. The models based o… ▽ More

    Submitted 28 November, 2019; originally announced November 2019.

  6. arXiv:1911.05504  [pdf, other

    eess.AS cs.LG cs.SD

    3-D Feature and Acoustic Modeling for Far-Field Speech Recognition

    Authors: Anurenjan Purushothaman, Anirudh Sreeram, Sriram Ganapathy

    Abstract: Automatic speech recognition in multi-channel reverberant conditions is a challenging task. The conventional way of suppressing the reverberation artifacts involves a beamforming based enhancement of the multi-channel speech signal, which is used to extract spectrogram based features for a neural network acoustic model. In this paper, we propose to extract features directly from the multi-channel… ▽ More

    Submitted 26 January, 2020; v1 submitted 13 November, 2019; originally announced November 2019.