Skip to main content

Showing 1–3 of 3 results for author: Gulin, J

Searching in archive eess. Search in all archives.
.
  1. arXiv:2408.17166  [pdf, other

    eess.AS cs.LG

    Learning Multi-Target TDOA Features for Sound Event Localization and Detection

    Authors: Axel Berg, Johanna Engman, Jens Gulin, Karl Åström, Magnus Oskarsson

    Abstract: Sound event localization and detection (SELD) systems using audio recordings from a microphone array rely on spatial cues for determining the location of sound events. As a consequence, the localization performance of such systems is to a large extent determined by the quality of the audio features that are used as inputs to the system. We propose a new feature, based on neural generalized cross-c… ▽ More

    Submitted 30 August, 2024; originally announced August 2024.

    Comments: DCASE 2024

  2. wav2pos: Sound Source Localization using Masked Autoencoders

    Authors: Axel Berg, Jens Gulin, Mark O'Connor, Chuteng Zhou, Karl Åström, Magnus Oskarsson

    Abstract: We present a novel approach to the 3D sound source localization task for distributed ad-hoc microphone arrays by formulating it as a set-to-set regression problem. By training a multi-modal masked autoencoder model that operates on audio recordings and microphone coordinates, we show that such a formulation allows for accurate localization of the sound source, by reconstructing coordinates masked… ▽ More

    Submitted 28 August, 2024; originally announced August 2024.

    Comments: IPIN 2024

  3. arXiv:2309.02961  [pdf, other

    eess.SP cs.CV cs.SD eess.AS

    LuViRA Dataset Validation and Discussion: Comparing Vision, Radio, and Audio Sensors for Indoor Localization

    Authors: Ilayda Yaman, Guoda Tian, Erik Tegler, Jens Gulin, Nikhil Challa, Fredrik Tufvesson, Ove Edfors, Kalle Astrom, Steffen Malkowsky, Liang Liu

    Abstract: We present a unique comparative analysis, and evaluation of vision, radio, and audio based localization algorithms. We create the first baseline for the aforementioned sensors using the recently published Lund University Vision, Radio, and Audio (LuViRA) dataset, where all the sensors are synchronized and measured in the same environment. Some of the challenges of using each specific sensor for in… ▽ More

    Submitted 25 April, 2024; v1 submitted 6 September, 2023; originally announced September 2023.

    Comments: 10 pages, 11 figures

    Journal ref: IEEE Journal of Indoor and Seamless Positioning and Navigation (2024) 1-11