Skip to main content

Showing 1–4 of 4 results for author: Kudo, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2412.18217  [pdf, other

    cs.SD cs.LG eess.AS

    U-Mamba-Net: A highly efficient Mamba-based U-net style network for noisy and reverberant speech separation

    Authors: Shaoxiang Dang, Tetsuya Matsumoto, Yoshinori Takeuchi, Hiroaki Kudo

    Abstract: The topic of speech separation involves separating mixed speech with multiple overlapping speakers into several streams, with each stream containing speech from only one speaker. Many highly effective models have emerged and proliferated rapidly over time. However, the size and computational load of these models have also increased accordingly. This is a disaster for the community, as researchers… ▽ More

    Submitted 24 December, 2024; originally announced December 2024.

    Journal ref: 2024 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)

  2. arXiv:2408.12279  [pdf, other

    cs.SD cs.AI eess.AS

    Developing vocal system impaired patient-aimed voice quality assessment approach using ASR representation-included multiple features

    Authors: Shaoxiang Dang, Tetsuya Matsumoto, Yoshinori Takeuchi, Takashi Tsuboi, Yasuhiro Tanaka, Daisuke Nakatsubo, Satoshi Maesawa, Ryuta Saito, Masahisa Katsuno, Hiroaki Kudo

    Abstract: The potential of deep learning in clinical speech processing is immense, yet the hurdles of limited and imbalanced clinical data samples loom large. This article addresses these challenges by showcasing the utilization of automatic speech recognition and self-supervised learning representations, pre-trained on extensive datasets of normal speech. This innovative approach aims to estimate voice qua… ▽ More

    Submitted 22 August, 2024; originally announced August 2024.

    Comments: Accepted by Interspeech 2024

  3. arXiv:1609.06041  [pdf, ps, other

    physics.med-ph cs.CV math.NA

    A very fast iterative algorithm for TV-regularized image reconstruction with applications to low-dose and few-view CT

    Authors: Hiroyuki Kudo, Fukashi Yamazaki, Takuya Nemoto, Keita Takaki

    Abstract: This paper concerns iterative reconstruction for low-dose and few-view CT by minimizing a data-fidelity term regularized with the Total Variation (TV) penalty. We propose a very fast iterative algorithm to solve this problem. The algorithm derivation is outlined as follows. First, the original minimization problem is reformulated into the saddle point (primal-dual) problem by using the Lagrangian… ▽ More

    Submitted 20 September, 2016; originally announced September 2016.

    Comments: 16 pages, 8 figures, SPIE Optics + Photonics 2016 Conference (Developments in X-Ray Tomography X) Paper No. 9967-37

  4. arXiv:1609.06020  [pdf, ps, other

    physics.med-ph cs.CV math.NA

    Proposal of fault-tolerant tomographic image reconstruction

    Authors: Hiroyuki Kudo, Keita Takaki, Fukashi Yamazaki, Takuya Nemoto

    Abstract: This paper deals with tomographic image reconstruction under the situation where some of projection data bins are contaminated with abnormal data. Such situations occur in various instances of tomography. We propose a new reconstruction algorithm called the Fault-Tolerant reconstruction outlined as follows. The least-squares (L2-norm) error function ||Ax-b||_2^2 used in ordinary iterative reconstr… ▽ More

    Submitted 20 September, 2016; originally announced September 2016.

    Comments: 12 pages, 5 figures, SPIE Optics + Photonics 2016 Conference (Developments in X-Ray Tomography X) Paper No. 9967-55