Skip to main content

Showing 1–8 of 8 results for author: Fukumoto, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.00422  [pdf, ps, other

    cs.CL cs.SD eess.AS

    DYNAC: Dynamic Vocabulary based Non-Autoregressive Contextualization for Speech Recognition

    Authors: Yui Sudo, Yosuke Fukumoto, Muhammad Shakeel, Yifan Peng, Chyi-Jiunn Lin, Shinji Watanabe

    Abstract: Contextual biasing (CB) improves automatic speech recognition for rare and unseen phrases. Recent studies have introduced dynamic vocabulary, which represents context phrases as expandable tokens in autoregressive (AR) models. This method improves CB accuracy but with slow inference speed. While dynamic vocabulary can be applied to non-autoregressive (NAR) models, such as connectionist temporal cl… ▽ More

    Submitted 31 May, 2025; originally announced June 2025.

    Comments: Accepted to Interspeech 2025

  2. arXiv:2406.02950  [pdf, other

    eess.AS cs.CL cs.SD

    Joint Beam Search Integrating CTC, Attention, and Transducer Decoders

    Authors: Yui Sudo, Muhammad Shakeel, Yosuke Fukumoto, Brian Yan, Jiatong Shi, Yifan Peng, Shinji Watanabe

    Abstract: End-to-end automatic speech recognition (E2E-ASR) can be classified by its decoder architectures, such as connectionist temporal classification (CTC), recurrent neural network transducer (RNN-T), attention-based encoder-decoder, and Mask-CTC models. Each decoder architecture has advantages and disadvantages, leading practitioners to switch between these different models depending on application re… ▽ More

    Submitted 14 January, 2025; v1 submitted 5 June, 2024; originally announced June 2024.

    Comments: accepted to IEEE/ACM Transactions on Audio Speech and Language Processing

  3. arXiv:2405.13344  [pdf, other

    eess.AS cs.CL cs.SD

    Contextualized Automatic Speech Recognition with Dynamic Vocabulary

    Authors: Yui Sudo, Yosuke Fukumoto, Muhammad Shakeel, Yifan Peng, Shinji Watanabe

    Abstract: Deep biasing (DB) enhances the performance of end-to-end automatic speech recognition (E2E-ASR) models for rare words or contextual phrases using a bias list. However, most existing methods treat bias phrases as sequences of subwords in a predefined static vocabulary. This naive sequence decomposition produces unnatural token patterns, significantly lowering their occurrence probability. More adva… ▽ More

    Submitted 30 August, 2024; v1 submitted 22 May, 2024; originally announced May 2024.

  4. arXiv:2401.10449  [pdf, other

    eess.AS cs.CL cs.SD

    Contextualized Automatic Speech Recognition with Attention-Based Bias Phrase Boosted Beam Search

    Authors: Yui Sudo, Muhammad Shakeel, Yosuke Fukumoto, Yifan Peng, Shinji Watanabe

    Abstract: End-to-end (E2E) automatic speech recognition (ASR) methods exhibit remarkable performance. However, since the performance of such methods is intrinsically linked to the context present in the training data, E2E-ASR methods do not perform as desired for unseen user contexts (e.g., technical terms, personal names, and playlists). Thus, E2E-ASR methods must be easily contextualized by the user or de… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

    Comments: accepted by ICASSP20224

  5. Security Camera Movie and ERP Data Matching System to Prevent Theft

    Authors: Yoji Yamato, Yoshifumi Fukumoto, Hiroki Kumazaki

    Abstract: "(c) 2017 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works." In this pape… ▽ More

    Submitted 21 September, 2024; v1 submitted 12 June, 2017; originally announced June 2017.

    Comments: 2 pages, 2 figures, IEEE Consumer Communications and Networking Conference (CCNC2017), pp.1021-1022, Jan. 2017

    Journal ref: IEEE Consumer Communications and Networking Conference (CCNC2017), pp.1021-1022, Jan. 2017

  6. arXiv:1612.02640  [pdf

    cs.DC

    Realtime Predictive Maintenance with Lambda Architecture

    Authors: Yoji Yamato, Hiroki Kumazaki, Yoshifumi Fukumoto

    Abstract: Recently, IoT technologies have been progressed and applications of maintenance area are expected. However, IoT maintenance applications are not spread in Japan yet because of insufficient analysis of real time situation, high cost to collect sensing data and to configure failure detection rules. In this paper, using lambda architecture concept, we propose a maintenance platform in which edge node… ▽ More

    Submitted 8 December, 2016; originally announced December 2016.

    Comments: 4 pages, in Japanese, 3 figures, IEICE Technical Report, SC2016-28, Nov. 2016

    Journal ref: IEICE Technical Report, SC2016-28, Nov. 2016. (c) 2016 IEICE

  7. arXiv:1612.01603  [pdf

    cs.DC

    Study of shoplifting prevention using image analysis and ERP check

    Authors: Yoji Yamato, Yoshifumi Fukumoto, Hiroki Kumazaki

    Abstract: In this paper, we propose a SaaS service which prevents shoplifting using image analysis and ERP. In Japan, total damage of shoplifting reaches 450 billion yen and more than 1000 small shops gave up their businesses because of shoplifting. Based on recent cloud technology and data analysis technology, we propose a shoplifting prevention service with image analysis of security camera and ERP data c… ▽ More

    Submitted 5 December, 2016; originally announced December 2016.

    Comments: 4 pages, in Japanese, 2 figures, IEICE Technical Report, SC2016-14, Aug. 2016

    Journal ref: IEICE Technical Report, SC2016-14, Aug. 2016. (c) 2016 IEICE

  8. arXiv:1611.09944   

    cs.DC cs.CY

    Proposal of Real Time Predictive Maintenance Platform with 3D Printer for Business Vehicles

    Authors: Yoji Yamato, Yoshifumi Fukumoto, Hiroki Kumazaki

    Abstract: This paper proposes a maintenance platform for business vehicles which detects failure sign using IoT data on the move, orders to create repair parts by 3D printers and to deliver them to the destination. Recently, IoT and 3D printer technologies have been progressed and application cases to manufacturing and maintenance have been increased. Especially in air flight industry, various sensing data… ▽ More

    Submitted 30 January, 2023; v1 submitted 29 November, 2016; originally announced November 2016.

    Comments: The description of evaluation was insufficient

    Journal ref: 5th International Conference on Software and Information Engineering (ICSIE 2016), pp.6-10, May 2016. (c) 2016 ICSIE2016