Skip to main content

Showing 1–2 of 2 results for author: Abeysinghe, B

Searching in archive eess. Search in all archives.
.
  1. arXiv:2407.17416  [pdf, other

    eess.AS cs.AI cs.CL

    Explaining Spectrograms in Machine Learning: A Study on Neural Networks for Speech Classification

    Authors: Jesin James, Balamurali B. T., Binu Abeysinghe, Junchen Liu

    Abstract: This study investigates discriminative patterns learned by neural networks for accurate speech classification, with a specific focus on vowel classification tasks. By examining the activations and features of neural networks for vowel classification, we gain insights into what the networks "see" in spectrograms. Through the use of class activation mapping, we identify the frequencies that contribu… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: 5th International Conference on Artificial Intelligence and Speech Technology (AIST-2023), New Delhi, India

  2. arXiv:2208.09775  [pdf, other

    eess.AS cs.SD

    Visualising Model Training via Vowel Space for Text-To-Speech Systems

    Authors: Binu Abeysinghe, Jesin James, Catherine I. Watson, Felix Marattukalam

    Abstract: With the recent developments in speech synthesis via machine learning, this study explores incorporating linguistics knowledge to visualise and evaluate synthetic speech model training. If changes to the first and second formant (in turn, the vowel space) can be seen and heard in synthetic speech, this knowledge can inform speech synthesis technology developers. A speech synthesis model trained on… ▽ More

    Submitted 20 August, 2022; originally announced August 2022.

    Comments: Accepted to Interspeech 2022