Skip to main content

Showing 1–3 of 3 results for author: Yeow, J

Searching in archive eess. Search in all archives.
.
  1. arXiv:2507.00874  [pdf, ps, other

    eess.AS

    Improving Stereo 3D Sound Event Localization and Detection: Perceptual Features, Stereo-specific Data Augmentation, and Distance Normalization

    Authors: Jun-Wei Yeow, Ee-Leng Tan, Santi Peksi, Woon-Seng Gan

    Abstract: This technical report presents our submission to Task 3 of the DCASE 2025 Challenge: Stereo Sound Event Localization and Detection (SELD) in Regular Video Content. We address the audio-only task in this report and introduce several key contributions. First, we design perceptually-motivated input features that improve event detection, sound source localization, and distance estimation. Second, we a… ▽ More

    Submitted 1 July, 2025; originally announced July 2025.

    Comments: Technical report for DCASE 2025 Challenge Task 3

  2. arXiv:2409.11700  [pdf, other

    eess.SP

    Real-Time Sound Event Localization and Detection: Deployment Challenges on Edge Devices

    Authors: Jun Wei Yeow, Ee-Leng Tan, Jisheng Bai, Santi Peksi, Woon-Seng Gan

    Abstract: Sound event localization and detection (SELD) is critical for various real-world applications, including smart monitoring and Internet of Things (IoT) systems. Although deep neural networks (DNNs) represent the state-of-the-art approach for SELD, their significant computational complexity and model sizes present challenges for deployment on resource-constrained edge devices, especially under real-… ▽ More

    Submitted 18 September, 2024; originally announced September 2024.

    Comments: Submitted to ICASSP'25. Code is available at this link : https://github.com/itsjunwei/Realtime-SELD-Edge

  3. arXiv:2407.09021  [pdf, other

    eess.AS

    Squeeze-and-Excite ResNet-Conformers for Sound Event Localization, Detection, and Distance Estimation for DCASE 2024 Challenge

    Authors: Jun Wei Yeow, Ee-Leng Tan, Jisheng Bai, Santi Peksi, Woon-Seng Gan

    Abstract: This technical report details our systems submitted for Task 3 of the DCASE 2024 Challenge: Audio and Audiovisual Sound Event Localization and Detection (SELD) with Source Distance Estimation (SDE). We address only the audio-only SELD with SDE (SELDDE) task in this report. We propose to improve the existing ResNet-Conformer architectures with Squeeze-and-Excitation blocks in order to introduce add… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: Technical report for DCASE 2024 Challenge Task 3