Skip to main content

Showing 1–10 of 10 results for author: Chau, T

Searching in archive eess. Search in all archives.
.
  1. arXiv:2502.14685  [pdf, other

    cs.SD eess.AS

    SegAug: CTC-Aligned Segmented Augmentation For Robust RNN-Transducer Based Speech Recognition

    Authors: Khanh Le, Tuan Vu Ho, Dung Tran, Duc Thanh Chau

    Abstract: RNN-Transducer (RNN-T) is a widely adopted architecture in speech recognition, integrating acoustic and language modeling in an end-to-end framework. However, the RNN-T predictor tends to over-rely on consecutive word dependencies in training data, leading to high deletion error rates, particularly with less common or out-of-domain phrases. Existing solutions, such as regularization and data augme… ▽ More

    Submitted 20 February, 2025; originally announced February 2025.

    Comments: Accepted to ICASSP 2025

  2. arXiv:2502.14673  [pdf, other

    cs.SD eess.AS

    ChunkFormer: Masked Chunking Conformer For Long-Form Speech Transcription

    Authors: Khanh Le, Tuan Vu Ho, Dung Tran, Duc Thanh Chau

    Abstract: Deploying ASR models at an industrial scale poses significant challenges in hardware resource management, especially for long-form transcription tasks where audio may last for hours. Large Conformer models, despite their capabilities, are limited to processing only 15 minutes of audio on an 80GB GPU. Furthermore, variable input lengths worsen inefficiencies, as standard batching leads to excessive… ▽ More

    Submitted 20 February, 2025; originally announced February 2025.

    Comments: Accepted to ICASSP 2025

  3. arXiv:2301.10966  [pdf

    cs.RO eess.SY

    Design of Mobile Manipulator for Fire Extinguisher Testing. Part II: Design and Simulation

    Authors: Thai Nguyen Chau, Xuan Quang Ngo, Van Tu Duong, Trong Trung Nguyen, Huy Hung Nguyen, Tan Tien Nguyen

    Abstract: All flames are extinguished as early as possible, or fire services have to deal with major conflagrations. This leads to the fact that the quality of fire extinguishers has become a very sensitive and important issue in firefighting. Inspired by the development of automatic fire fighting systems, this paper presents a mobile manipulator to evaluate the power of fire extinguishers, which is designe… ▽ More

    Submitted 26 January, 2023; originally announced January 2023.

    Comments: 10 pages, 15 figures, the 7th International Conference on Advanced Engineering, Theory and Applications

  4. arXiv:2110.13431  [pdf

    eess.SY

    Meter-Range Wireless Motor Drive for Pipeline Transportation

    Authors: Wei Liu, K. T. Chau, Hui Wang, Tengbo Yang

    Abstract: This paper proposes and implements a meter-range wireless motor drive (WMD) system for promising applications of underground pipeline transportations or in-pipe robots. To power a pipeline network beneath the earth, both the power grid and the control system are usually required to be deployed deep underground, thus increasing the construction cost, maintenance difficulty and system complexity. Th… ▽ More

    Submitted 16 January, 2022; v1 submitted 26 October, 2021; originally announced October 2021.

  5. arXiv:2103.05824  [pdf

    eess.SY

    A Cyber-Physical Perspective to Pinning-Decision for Distributed Multi-Agent Control in Microgrid against Stochastic Communication Disruptions

    Authors: Samson S. Yu, Tat Kei Chau

    Abstract: In this study, we propose a decision-making strategy for pinning-based distributed multi-agent (PDMA) automatic generation control (AGC) in islanded microgrids against stochastic communication disruptions. The target microgrid is construed as a cyber-physical system, wherein the physical microgrid is modeled as an inverter-interfaced autonomous grid with detailed system dynamic formulation, and th… ▽ More

    Submitted 9 March, 2021; originally announced March 2021.

    Comments: 8 pages, 7 figures, 2 tables

  6. arXiv:2010.15250  [pdf, other

    cs.CV eess.IV

    Semantic video segmentation for autonomous driving

    Authors: Minh Triet Chau

    Abstract: We aim to solve semantic video segmentation in autonomous driving, namely road detection in real time video, using techniques discussed in (Shelhamer et al., 2016a). While fully convolutional network gives good result, we show that the speed can be halved while preserving the accuracy. The test dataset being used is KITTI, which consists of real footage from Germany's streets.

    Submitted 28 October, 2020; originally announced October 2020.

    Comments: This work was done around 2017. Some minor changes were added

  7. arXiv:2008.07660  [pdf, ps, other

    cs.LG eess.SP

    Revisiting the Application of Feature Selection Methods to Speech Imagery BCI Datasets

    Authors: Javad Rahimipour Anaraki, Jae Moon, Tom Chau

    Abstract: Brain-computer interface (BCI) aims to establish and improve human and computer interactions. There has been an increasing interest in designing new hardware devices to facilitate the collection of brain signals through various technologies, such as wet and dry electroencephalogram (EEG) and functional near-infrared spectroscopy (fNIRS) devices. The promising results of machine learning methods ha… ▽ More

    Submitted 17 August, 2020; originally announced August 2020.

    Comments: 5 pages, 2 figures

    ACM Class: I.2.8

  8. arXiv:2007.08668  [pdf, other

    cs.LG eess.SP stat.ML

    BRP-NAS: Prediction-based NAS using GCNs

    Authors: Łukasz Dudziak, Thomas Chau, Mohamed S. Abdelfattah, Royson Lee, Hyeji Kim, Nicholas D. Lane

    Abstract: Neural architecture search (NAS) enables researchers to automatically explore broad design spaces in order to improve efficiency of neural networks. This efficiency is especially important in the case of on-device deployment, where improvements in accuracy should be balanced out with computational demands of a model. In practice, performance metrics of model are computationally expensive to obtain… ▽ More

    Submitted 19 January, 2021; v1 submitted 16 July, 2020; originally announced July 2020.

    Comments: Published at NeurIPS 2020

  9. arXiv:2002.05022  [pdf, other

    eess.SP cs.LG

    Best of Both Worlds: AutoML Codesign of a CNN and its Hardware Accelerator

    Authors: Mohamed S. Abdelfattah, Łukasz Dudziak, Thomas Chau, Royson Lee, Hyeji Kim, Nicholas D. Lane

    Abstract: Neural architecture search (NAS) has been very successful at outperforming human-designed convolutional neural networks (CNN) in accuracy, and when hardware information is present, latency as well. However, NAS-designed CNNs typically have a complicated topology, therefore, it may be difficult to design a custom hardware (HW) accelerator for such CNNs. We automate HW-CNN codesign using NAS by incl… ▽ More

    Submitted 6 March, 2020; v1 submitted 11 February, 2020; originally announced February 2020.

    Comments: accepted at DAC 2020

  10. arXiv:1912.04828  [pdf

    eess.SP q-bio.NC

    Navigating in Virtual Reality using Thought: The Development and Assessment of a Motor Imagery based Brain-Computer Interface

    Authors: Behnam Reyhani-Masoleh, Tom Chau

    Abstract: Brain-computer interface (BCI) systems have potential as assistive technologies for individuals with severe motor impairments. Nevertheless, individuals must first participate in many training sessions to obtain adequate data for optimizing the classification algorithm and subsequently acquiring brain-based control. Such traditional training paradigms have been dubbed unengaging and unmotivating f… ▽ More

    Submitted 10 December, 2019; originally announced December 2019.

    Comments: 23 pages, 10 figures