Skip to main content

Showing 1–9 of 9 results for author: Åström, K

Searching in archive eess. Search in all archives.
.
  1. arXiv:2411.13179  [pdf, other

    cs.SD cs.CV eess.AS

    SONNET: Enhancing Time Delay Estimation by Leveraging Simulated Audio

    Authors: Erik Tegler, Magnus Oskarsson, Kalle Åström

    Abstract: Time delay estimation or Time-Difference-Of-Arrival estimates is a critical component for multiple localization applications such as multilateration, direction of arrival, and self-calibration. The task is to estimate the time difference between a signal arriving at two different sensors. For the audio sensor modality, most current systems are based on classical methods such as the Generalized Cro… ▽ More

    Submitted 20 November, 2024; originally announced November 2024.

  2. arXiv:2408.17166  [pdf, other

    eess.AS cs.LG

    Learning Multi-Target TDOA Features for Sound Event Localization and Detection

    Authors: Axel Berg, Johanna Engman, Jens Gulin, Karl Åström, Magnus Oskarsson

    Abstract: Sound event localization and detection (SELD) systems using audio recordings from a microphone array rely on spatial cues for determining the location of sound events. As a consequence, the localization performance of such systems is to a large extent determined by the quality of the audio features that are used as inputs to the system. We propose a new feature, based on neural generalized cross-c… ▽ More

    Submitted 30 August, 2024; originally announced August 2024.

    Comments: DCASE 2024

  3. wav2pos: Sound Source Localization using Masked Autoencoders

    Authors: Axel Berg, Jens Gulin, Mark O'Connor, Chuteng Zhou, Karl Åström, Magnus Oskarsson

    Abstract: We present a novel approach to the 3D sound source localization task for distributed ad-hoc microphone arrays by formulating it as a set-to-set regression problem. By training a multi-modal masked autoencoder model that operates on audio recordings and microphone coordinates, we show that such a formulation allows for accurate localization of the sound source, by reconstructing coordinates masked… ▽ More

    Submitted 28 August, 2024; originally announced August 2024.

    Comments: IPIN 2024

  4. arXiv:2309.02961  [pdf, other

    eess.SP cs.CV cs.SD eess.AS

    LuViRA Dataset Validation and Discussion: Comparing Vision, Radio, and Audio Sensors for Indoor Localization

    Authors: Ilayda Yaman, Guoda Tian, Erik Tegler, Jens Gulin, Nikhil Challa, Fredrik Tufvesson, Ove Edfors, Kalle Astrom, Steffen Malkowsky, Liang Liu

    Abstract: We present a unique comparative analysis, and evaluation of vision, radio, and audio based localization algorithms. We create the first baseline for the aforementioned sensors using the recently published Lund University Vision, Radio, and Audio (LuViRA) dataset, where all the sensors are synchronized and measured in the same environment. Some of the challenges of using each specific sensor for in… ▽ More

    Submitted 25 April, 2024; v1 submitted 6 September, 2023; originally announced September 2023.

    Comments: 10 pages, 11 figures

    Journal ref: IEEE Journal of Indoor and Seamless Positioning and Navigation (2024) 1-11

  5. arXiv:2302.05309  [pdf, other

    eess.SP cs.CV cs.SD eess.AS

    The LuViRA Dataset: Synchronized Vision, Radio, and Audio Sensors for Indoor Localization

    Authors: Ilayda Yaman, Guoda Tian, Martin Larsson, Patrik Persson, Michiel Sandra, Alexander Dürr, Erik Tegler, Nikhil Challa, Henrik Garde, Fredrik Tufvesson, Kalle Åström, Ove Edfors, Steffen Malkowsky, Liang Liu

    Abstract: We present a synchronized multisensory dataset for accurate and robust indoor localization: the Lund University Vision, Radio, and Audio (LuViRA) Dataset. The dataset includes color images, corresponding depth maps, inertial measurement unit (IMU) readings, channel response between a 5G massive multiple-input and multiple-output (MIMO) testbed and user equipment, audio recorded by 12 microphones,… ▽ More

    Submitted 26 April, 2024; v1 submitted 10 February, 2023; originally announced February 2023.

    Comments: 7 pages, 7 figures, Accepted to ICRA 2024

  6. Extending GCC-PHAT using Shift Equivariant Neural Networks

    Authors: Axel Berg, Mark O'Connor, Kalle Åström, Magnus Oskarsson

    Abstract: Speaker localization using microphone arrays depends on accurate time delay estimation techniques. For decades, methods based on the generalized cross correlation with phase transform (GCC-PHAT) have been widely adopted for this purpose. Recently, the GCC-PHAT has also been used to provide input features to neural networks in order to remove the effects of noise and reverberation, but at the cost… ▽ More

    Submitted 9 August, 2022; originally announced August 2022.

    Comments: Proceedings of INTERSPEECH

    Journal ref: Proc. Interspeech 2022, 1791-1795

  7. arXiv:2205.11299  [pdf, other

    cs.SD eess.AS eess.SP

    Multiple Offsets Multilateration: a new paradigm for sensor network calibration with unsynchronized reference nodes

    Authors: Luca Ferranti, Kalle Åström, Magnus Oskarsson, Jani Boutellier, Juho Kannala

    Abstract: Positioning using wave signal measurements is used in several applications, such as GPS systems, structure from sound and Wifi based positioning. Mathematically, such problems require the computation of the positions of receivers and/or transmitters as well as time offsets if the devices are unsynchronized. In this paper, we expand the previous state-of-the-art on positioning formulations by intro… ▽ More

    Submitted 23 May, 2022; originally announced May 2022.

    Comments: accepted to ICASSP2022

  8. arXiv:2110.01099  [pdf, other

    eess.SY

    Quadrotor Control on $SU(2)\times R^3$ with SLAM Integration

    Authors: Marcus Greiff, Patrik Persson, Zhiyong Sun, Karl Åström, Anders Robertsson

    Abstract: We present a trajectory tracking controller for a quadrotor unmanned aerial vehicle (UAV) configured on $SU(2)\times R^3$, and relate this result to a family of geometric tracking controllers on $SO(3)\times R^3$. The theoretical results are complemented by simulation examples, and the controller is subsequently implemented in practice and integrated with a simultaneous localization and mapping (S… ▽ More

    Submitted 3 October, 2021; originally announced October 2021.

    Comments: 18 pages, 9 figures, extended version of ACC'22 paper

  9. arXiv:2005.10298  [pdf, ps, other

    eess.SP cs.NI

    Sensor Networks TDOA Self-Calibration: 2D Complexity Analysis and Solutions

    Authors: Luca Ferranti, Kalle Åström, Magnus Oskarsson, Jani Boutellier, Juho Kannala

    Abstract: Given a network of receivers and transmitters, the process of determining their positions from measured pseudoranges is known as network self-calibration. In this paper we consider 2D networks with synchronized receivers but unsynchronized transmitters and the corresponding calibration techniques, known as Time-Difference-Of-Arrival (TDOA) techniques. Despite previous work, TDOA self-calibration i… ▽ More

    Submitted 22 October, 2020; v1 submitted 20 May, 2020; originally announced May 2020.