Skip to main content

Showing 1–6 of 6 results for author: Grumiaux, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.07363  [pdf, other

    cs.SD eess.AS

    Efficient bandwidth extension of musical signals using a differentiable harmonic plus noise model

    Authors: Pierre-Amaury Grumiaux, Mathieu Lagrange

    Abstract: The task of bandwidth extension addresses the generation of missing high frequencies of audio signals based on knowledge of the low-frequency part of the sound. This task applies to various problems, such as audio coding or audio restoration. In this article, we focus on efficient bandwidth extension of monophonic and polyphonic musical signals using a differentiable digital signal processing (DDS… ▽ More

    Submitted 27 November, 2023; v1 submitted 13 November, 2023; originally announced November 2023.

    Comments: Accepting for publication in EURASIP Journal on Audio, Speech, and Music Processing

  2. arXiv:2109.03465  [pdf, other

    cs.SD cs.LG eess.AS

    A Survey of Sound Source Localization with Deep Learning Methods

    Authors: Pierre-Amaury Grumiaux, Srđan Kitić, Laurent Girin, Alexandre Guérin

    Abstract: This article is a survey on deep learning methods for single and multiple sound source localization. We are particularly interested in sound source localization in indoor/domestic environment, where reverberation and diffuse noise are present. We provide an exhaustive topography of the neural-based localization literature in this context, organized according to several aspects: the neural network… ▽ More

    Submitted 17 June, 2022; v1 submitted 8 September, 2021; originally announced September 2021.

    Comments: Accepted for publication in The Journal of the Acoustical Society of America

  3. arXiv:2107.11066  [pdf, other

    cs.SD eess.AS

    SALADnet: Self-Attentive multisource Localization in the Ambisonics Domain

    Authors: Pierre-Amaury Grumiaux, Srdan Kitic, Prerak Srivastava, Laurent Girin, Alexandre Guérin

    Abstract: In this work, we propose a novel self-attention based neural network for robust multi-speaker localization from Ambisonics recordings. Starting from a state-of-the-art convolutional recurrent neural network, we investigate the benefit of replacing the recurrent layers by self-attention encoders, inherited from the Transformer architecture. We evaluate these models on synthetic and real-world data,… ▽ More

    Submitted 23 July, 2021; originally announced July 2021.

    Comments: Accepted to Workshop on Applications of Signal Processing to Audio and Acoustics

  4. arXiv:2105.01897  [pdf, other

    cs.SD eess.AS

    Improved feature extraction for CRNN-based multiple sound source localization

    Authors: Pierre-Amaury Grumiaux, Srdan Kitic, Laurent Girin, Alexandre Guérin

    Abstract: In this work, we propose to extend a state-of-the-art multi-source localization system based on a convolutional recurrent neural network and Ambisonics signals. We significantly improve the performance of the baseline network by changing the layout between convolutional and pooling layers. We propose several configurations with more convolutional layers and smaller pooling sizes in-between, so tha… ▽ More

    Submitted 5 May, 2021; originally announced May 2021.

    Comments: 5 pages, 2 figures. Accepted to EUSIPCO 2021

  5. arXiv:2101.01977  [pdf, other

    cs.SD eess.AS

    Multichannel CRNN for Speaker Counting: an Analysis of Performance

    Authors: Pierre-Amaury Grumiaux, Srdan Kitic, Laurent Girin, Alexandre Guérin

    Abstract: Speaker counting is the task of estimating the number of people that are simultaneously speaking in an audio recording. For several audio processing tasks such as speaker diarization, separation, localization and tracking, knowing the number of speakers at each timestep is a prerequisite, or at least it can be a strong advantage, in addition to enabling a low latency processing. In a previous work… ▽ More

    Submitted 6 January, 2021; originally announced January 2021.

    Comments: Presented at Forum Acusticum 2020

  6. arXiv:2003.07839  [pdf, other

    cs.SD eess.AS

    High-Resolution Speaker Counting In Reverberant Rooms Using CRNN With Ambisonics Features

    Authors: Pierre-Amaury Grumiaux, Srdjan Kitic, Laurent Girin, Alexandre Guérin

    Abstract: Speaker counting is the task of estimating the number of people that are simultaneously speaking in an audio recording. For several audio processing tasks such as speaker diarization, separation, localization and tracking, knowing the number of speakers at each timestep is a prerequisite, or at least it can be a strong advantage, in addition to enabling a low latency processing. For that purpose,… ▽ More

    Submitted 17 March, 2020; originally announced March 2020.

    Comments: 5 pages, 1 figure