Skip to main content

Showing 1–5 of 5 results for author: Hauret, J

Searching in archive eess. Search in all archives.
.
  1. arXiv:2506.04495  [pdf, ps, other

    eess.AS

    French Listening Tests for the Assessment of Intelligibility, Quality, and Identity of Body-Conducted Speech Enhancement

    Authors: Thomas Joubaud, Julien Hauret, Véronique Zimpfer, Éric Bavu

    Abstract: This study evaluates the Extreme Bandwidth Extension Network (EBEN) model on body-conduction sensors through listening tests. Using the Vibravox dataset, we assess intelligibility with a French Modified Rhyme Test, speech quality with a MUSHRA (MUltiple Stimuli with Hidden Reference and Anchor) protocol and speaker identity preservation with an A/B identification task. The experiments involved mal… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

    Comments: Submitted to Interspeech 2025 (accepted)

  2. arXiv:2506.04492  [pdf, ps, other

    eess.AS

    Bringing Interpretability to Neural Audio Codecs

    Authors: Samir Sadok, Julien Hauret, Éric Bavu

    Abstract: The advent of neural audio codecs has increased in popularity due to their potential for efficiently modeling audio with transformers. Such advanced codecs represent audio from a highly continuous waveform to low-sampled discrete units. In contrast to semantic units, acoustic units may lack interpretability because their training objectives primarily focus on reconstruction performance. This paper… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

    Comments: Submitted to Interspeech 2025 (accepted)

  3. arXiv:2407.11828  [pdf, other

    eess.AS cs.LG

    Vibravox: A Dataset of French Speech Captured with Body-conduction Audio Sensors

    Authors: Julien Hauret, Malo Olivier, Thomas Joubaud, Christophe Langrenne, Sarah Poirée, Véronique Zimpfer, Éric Bavu

    Abstract: Vibravox is a dataset compliant with the General Data Protection Regulation (GDPR) containing audio recordings using five different body-conduction audio sensors: two in-ear microphones, two bone conduction vibration pickups, and a laryngophone. The dataset also includes audio data from an airborne microphone used as a reference. The Vibravox corpus contains 45 hours per sensor of speech samples a… ▽ More

    Submitted 26 March, 2025; v1 submitted 16 July, 2024; originally announced July 2024.

    Comments: 23 pages, 42 figures

  4. Configurable EBEN: Extreme Bandwidth Extension Network to enhance body-conducted speech capture

    Authors: Julien Hauret, Thomas Joubaud, Véronique Zimpfer, Éric Bavu

    Abstract: This paper presents a configurable version of Extreme Bandwidth Extension Network (EBEN), a Generative Adversarial Network (GAN) designed to improve audio captured with body-conduction microphones. We show that although these microphones significantly reduce environmental noise, this insensitivity to ambient noise happens at the expense of the bandwidth of the speech signal acquired by the wearer… ▽ More

    Submitted 12 September, 2023; v1 submitted 17 March, 2023; originally announced March 2023.

    Comments: Accepted in IEEE/ACM Transactions on Audio, Speech and Language Processing on 14/08/2023

    Journal ref: IEEE/ACM Transactions on Audio, Speech, and Language Processing (2023 - Volume: 31) - pp. 3499 - 3512

  5. EBEN: Extreme bandwidth extension network applied to speech signals captured with noise-resilient body-conduction microphones

    Authors: Julien Hauret, Thomas Joubaud, Véronique Zimpfer, Éric Bavu

    Abstract: In this paper, we present Extreme Bandwidth Extension Network (EBEN), a Generative Adversarial network (GAN) that enhances audio measured with body-conduction microphones. This type of capture equipment suppresses ambient noise at the expense of speech bandwidth, thereby requiring signal enhancement techniques to recover the wideband speech signal. EBEN leverages a multiband decomposition of the r… ▽ More

    Submitted 3 March, 2023; v1 submitted 25 October, 2022; originally announced October 2022.

    Comments: 5 pages, 5 figures, accepted to ICASSP 2023

    Journal ref: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)