Skip to main content

Showing 1–3 of 3 results for author: Byun, S

Searching in archive eess. Search in all archives.
.
  1. arXiv:2411.09838  [pdf, other

    eess.IV cs.CV

    OneNet: A Channel-Wise 1D Convolutional U-Net

    Authors: Sanghyun Byun, Kayvan Shah, Ayushi Gang, Christopher Apton, Jacob Song, Woo Seong Chung

    Abstract: Many state-of-the-art computer vision architectures leverage U-Net for its adaptability and efficient feature extraction. However, the multi-resolution convolutional design often leads to significant computational demands, limiting deployment on edge devices. We present a streamlined alternative: a 1D convolutional encoder that retains accuracy while enhancing its suitability for edge applications… ▽ More

    Submitted 14 November, 2024; originally announced November 2024.

  2. arXiv:2406.13502  [pdf, other

    cs.CL cs.SD eess.AS

    ManWav: The First Manchu ASR Model

    Authors: Jean Seo, Minha Kang, Sungjoo Byun, Sangah Lee

    Abstract: This study addresses the widening gap in Automatic Speech Recognition (ASR) research between high resource and extremely low resource languages, with a particular focus on Manchu, a critically endangered language. Manchu exemplifies the challenges faced by marginalized linguistic communities in accessing state-of-the-art technologies. In a pioneering effort, we introduce the first-ever Manchu ASR… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: ACL2024/Field Matters

  3. arXiv:1904.10788  [pdf, other

    eess.AS cs.AI cs.CL cs.LG cs.SD

    Speech Emotion Recognition Using Multi-hop Attention Mechanism

    Authors: Seunghyun Yoon, Seokhyun Byun, Subhadeep Dey, Kyomin Jung

    Abstract: In this paper, we are interested in exploiting textual and acoustic data of an utterance for the speech emotion classification task. The baseline approach models the information from audio and text independently using two deep neural networks (DNNs). The outputs from both the DNNs are then fused for classification. As opposed to using knowledge from both the modalities separately, we propose a fra… ▽ More

    Submitted 9 May, 2019; v1 submitted 23 April, 2019; originally announced April 2019.

    Comments: 5 pages, Accepted as a conference paper at ICASSP 2019 (oral presentation)