Skip to main content

Showing 1–2 of 2 results for author: Ogata, T

Searching in archive eess. Search in all archives.
.
  1. arXiv:1811.02735  [pdf, other

    eess.AS cs.CL cs.SD

    CNN-based MultiChannel End-to-End Speech Recognition for everyday home environments

    Authors: Nelson Yalta, Shinji Watanabe, Takaaki Hori, Kazuhiro Nakadai, Tetsuya Ogata

    Abstract: Casual conversations involving multiple speakers and noises from surrounding devices are common in everyday environments, which degrades the performances of automatic speech recognition systems. These challenging characteristics of environments are the target of the CHiME-5 challenge. By employing a convolutional neural network (CNN)-based multichannel end-to-end speech recognition system, this st… ▽ More

    Submitted 20 June, 2019; v1 submitted 6 November, 2018; originally announced November 2018.

    Comments: 5 pages, 1 figure, EUSIPCO 2019

  2. arXiv:1807.01126  [pdf, other

    cs.LG cs.SD eess.AS stat.ML

    Weakly Supervised Deep Recurrent Neural Networks for Basic Dance Step Generation

    Authors: Nelson Yalta, Shinji Watanabe, Kazuhiro Nakadai, Tetsuya Ogata

    Abstract: Synthesizing human's movements such as dancing is a flourishing research field which has several applications in computer graphics. Recent studies have demonstrated the advantages of deep neural networks (DNNs) for achieving remarkable performance in motion and music tasks with little effort for feature pre-processing. However, applying DNNs for generating dance to a piece of music is nevertheless… ▽ More

    Submitted 20 June, 2019; v1 submitted 3 July, 2018; originally announced July 2018.

    Comments: 8 pages, 7 figures. Proc. IJCNN 2019