Skip to main content

Showing 1–2 of 2 results for author: Akuzawa, K

Searching in archive eess. Search in all archives.
.
  1. arXiv:2112.02796  [pdf, other

    cs.SD cs.LG eess.AS

    Conditional Deep Hierarchical Variational Autoencoder for Voice Conversion

    Authors: Kei Akuzawa, Kotaro Onishi, Keisuke Takiguchi, Kohki Mametani, Koichiro Mori

    Abstract: Variational autoencoder-based voice conversion (VAE-VC) has the advantage of requiring only pairs of speeches and speaker labels for training. Unlike the majority of the research in VAE-VC which focuses on utilizing auxiliary losses or discretizing latent variables, this paper investigates how an increasing model expressiveness has benefits and impacts on the VAE-VC. Specifically, we first analyze… ▽ More

    Submitted 6 December, 2021; originally announced December 2021.

  2. arXiv:1804.02135  [pdf, other

    cs.CL cs.SD eess.AS

    Expressive Speech Synthesis via Modeling Expressions with Variational Autoencoder

    Authors: Kei Akuzawa, Yusuke Iwasawa, Yutaka Matsuo

    Abstract: Recent advances in neural autoregressive models have improve the performance of speech synthesis (SS). However, as they lack the ability to model global characteristics of speech (such as speaker individualities or speaking styles), particularly when these characteristics have not been labeled, making neural autoregressive SS systems more expressive is still an open issue. In this paper, we propos… ▽ More

    Submitted 11 February, 2019; v1 submitted 6 April, 2018; originally announced April 2018.

    Comments: Accepted by Interspeech 2018