Skip to main content

Showing 1–1 of 1 results for author: Maesawa, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.12279  [pdf, other

    cs.SD cs.AI eess.AS

    Developing vocal system impaired patient-aimed voice quality assessment approach using ASR representation-included multiple features

    Authors: Shaoxiang Dang, Tetsuya Matsumoto, Yoshinori Takeuchi, Takashi Tsuboi, Yasuhiro Tanaka, Daisuke Nakatsubo, Satoshi Maesawa, Ryuta Saito, Masahisa Katsuno, Hiroaki Kudo

    Abstract: The potential of deep learning in clinical speech processing is immense, yet the hurdles of limited and imbalanced clinical data samples loom large. This article addresses these challenges by showcasing the utilization of automatic speech recognition and self-supervised learning representations, pre-trained on extensive datasets of normal speech. This innovative approach aims to estimate voice qua… ▽ More

    Submitted 22 August, 2024; originally announced August 2024.

    Comments: Accepted by Interspeech 2024