Skip to main content

Showing 1–17 of 17 results for author: Brendel, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.16404  [pdf, ps, other

    eess.AS cs.SD

    UBGAN: Enhancing Coded Speech with Blind and Guided Bandwidth Extension

    Authors: Kishan Gupta, Srikanth Korse, Andreas Brendel, Nicola Pia, Guillaume Fuchs

    Abstract: In practical application of speech codecs, a multitude of factors such as the quality of the radio connection, limiting hardware or required user experience necessitate trade-offs between achievable perceptual quality, engendered bitrate and computational complexity. Most conventional and neural speech codecs operate on wideband (WB) speech signals to achieve this compromise. To further enhance th… ▽ More

    Submitted 22 May, 2025; originally announced May 2025.

  2. arXiv:2504.08470  [pdf, ps, other

    cs.SD cs.AI cs.MM eess.AS

    On the Design of Diffusion-based Neural Speech Codecs

    Authors: Pietro Foti, Andreas Brendel

    Abstract: Recently, neural speech codecs (NSCs) trained as generative models have shown superior performance compared to conventional codecs at low bitrates. Although most state-of-the-art NSCs are trained as Generative Adversarial Networks (GANs), Diffusion Models (DMs), a recent class of generative models, represent a promising alternative due to their superior performance in image generation relative to… ▽ More

    Submitted 11 April, 2025; originally announced April 2025.

  3. arXiv:2410.13599  [pdf, other

    eess.AS cs.SD eess.SP

    GAN-Based Speech Enhancement for Low SNR Using Latent Feature Conditioning

    Authors: Shrishti Saha Shetu, Emanuël A. P. Habets, Andreas Brendel

    Abstract: Enhancing speech quality under adverse SNR conditions remains a significant challenge for discriminative deep neural network (DNN)-based approaches. In this work, we propose DisCoGAN, which is a time-frequency-domain generative adversarial network (GAN) conditioned by the latent features of a discriminative model pre-trained for speech enhancement in low SNR scenarios. Our proposed method achieves… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

    Comments: 5 pages, 2 figures

  4. arXiv:2408.14582  [pdf, ps, other

    eess.AS cs.SD

    Comparative Analysis Of Discriminative Deep Learning-Based Noise Reduction Methods In Low SNR Scenarios

    Authors: Shrishti Saha Shetu, Emanuël A. P. Habets, Andreas Brendel

    Abstract: In this study, we conduct a comparative analysis of deep learning-based noise reduction methods in low signal-to-noise ratio (SNR) scenarios. Our investigation primarily focuses on five key aspects: The impact of training data, the influence of various loss functions, the effectiveness of direct and indirect speech estimation techniques, the efficacy of masking, mapping, and deep filtering methodo… ▽ More

    Submitted 26 August, 2024; originally announced August 2024.

    Comments: 5 pages, 4 figures

  5. arXiv:2406.08900  [pdf, other

    eess.AS cs.SD eess.SP

    On Improving Error Resilience of Neural End-to-End Speech Coders

    Authors: Kishan Gupta, Nicola Pia, Srikanth Korse, Andreas Brendel, Guillaume Fuchs, Markus Multrus

    Abstract: Error resilient tools like Packet Loss Concealment (PLC) and Forward Error Correction (FEC) are essential to maintain a reliable speech communication for applications like Voice over Internet Protocol (VoIP), where packets are frequently delayed and lost. In recent times, end-to-end neural speech codecs have seen a significant rise, due to their ability to transmit speech signal at low bitrates bu… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  6. arXiv:2405.08417  [pdf, other

    eess.AS cs.SD

    Neural Speech Coding for Real-time Communications using Constant Bitrate Scalar Quantization

    Authors: Andreas Brendel, Nicola Pia, Kishan Gupta, Lyonel Behringer, Guillaume Fuchs, Markus Multrus

    Abstract: Neural audio coding has emerged as a vivid research direction by promising good audio quality at very low bitrates unachievable by classical coding techniques. Here, end-to-end trainable autoencoder-like models represent the state of the art, where a discrete representation in the bottleneck of the autoencoder is learned. This allows for efficient transmission of the input audio signal. The learne… ▽ More

    Submitted 19 September, 2024; v1 submitted 14 May, 2024; originally announced May 2024.

  7. A Unifying View on Blind Source Separation of Convolutive Mixtures based on Independent Component Analysis

    Authors: Andreas Brendel, Thomas Haubner, Walter Kellermann

    Abstract: In many daily-life scenarios, acoustic sources recorded in an enclosure can only be observed with other interfering sources. Hence, convolutive Blind Source Separation (BSS) is a central problem in audio signal processing. Methods based on Independent Component Analysis (ICA) are especially important in this field as they require only few and weak assumptions and allow for blindness regarding the… ▽ More

    Submitted 28 July, 2022; originally announced July 2022.

  8. arXiv:2201.09946  [pdf, other

    eess.AS cs.SD

    Microphone Utility Estimation in Acoustic Sensor Networks using Single-Channel Signal Features

    Authors: Michael Günther, Andreas Brendel, Walter Kellermann

    Abstract: In multichannel signal processing with distributed sensors, choosing the optimal subset of observed sensor signals to be exploited is crucial in order to maximize algorithmic performance and reduce computational load, ideally both at the same time. In the acoustic domain, signal cross-correlation is a natural choice to quantify the usefulness of microphone signals, i.e., microphone utility, for ar… ▽ More

    Submitted 14 January, 2023; v1 submitted 24 January, 2022; originally announced January 2022.

    Comments: submitted to EURASIP Journal on Audio, Speech, and Music Processing

  9. arXiv:2110.02189  [pdf, ps, other

    eess.AS cs.SD

    Manifold learning-supported estimation of relative transfer functions for spatial filtering

    Authors: Andreas Brendel, Johannes Zeitler, Walter Kellermann

    Abstract: Many spatial filtering algorithms used for voice capture in, e.g., teleconferencing applications, can benefit from or even rely on knowledge of Relative Transfer Functions (RTFs). Accordingly, many RTF estimators have been proposed which, however, suffer from performance degradation under acoustically adverse conditions or need prior knowledge on the properties of the interfering sources. While st… ▽ More

    Submitted 5 October, 2021; originally announced October 2021.

  10. arXiv:2107.06253  [pdf, other

    cs.PL

    Bottom-up Synthesis of Recursive Functional Programs using Angelic Execution

    Authors: Anders Miltner, Adrian Trejo Nuñez, Ana Brendel, Swarat Chaudhuri, Isil Dillig

    Abstract: We present a novel bottom-up method for the synthesis of functional recursive programs. While bottom-up synthesis techniques can work better than top-down methods in certain settings, there is no prior technique for synthesizing recursive programs from logical specifications in a purely bottom-up fashion. The main challenge is that effective bottom-up methods need to execute sub-expressions of the… ▽ More

    Submitted 8 December, 2021; v1 submitted 13 July, 2021; originally announced July 2021.

  11. arXiv:2106.01262  [pdf, ps, other

    eess.AS cs.SD eess.SP

    End-To-End Deep Learning-Based Adaptation Control for Frequency-Domain Adaptive System Identification

    Authors: Thomas Haubner, Andreas Brendel, Walter Kellermann

    Abstract: We present a novel end-to-end deep learning-based adaptation control algorithm for frequency-domain adaptive system identification. The proposed method exploits a deep neural network to map observed signal features to corresponding step-sizes which control the filter adaptation. The parameters of the network are optimized in an end-to-end fashion by minimizing the average normalized system distanc… ▽ More

    Submitted 4 March, 2022; v1 submitted 2 June, 2021; originally announced June 2021.

    Comments: Accepted for IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2022, Singapore, Singapore

  12. A Synergistic Kalman- and Deep Postfiltering Approach to Acoustic Echo Cancellation

    Authors: Thomas Haubner, Mhd. Modar Halimeh, Andreas Brendel, Walter Kellermann

    Abstract: We introduce a synergistic approach to double-talk robust acoustic echo cancellation combining adaptive Kalman filtering with a deep neural network-based postfilter. The proposed algorithm overcomes the well-known limitations of Kalman filter-based adaptation control in scenarios characterized by abrupt echo path changes. As the key innovation, we suggest to exploit the different statistical prope… ▽ More

    Submitted 4 March, 2022; v1 submitted 16 December, 2020; originally announced December 2020.

    Comments: Accepted for European Signal Processing Conference (EUSIPCO), Dublin, Ireland, August 2021

  13. arXiv:2011.03432  [pdf, ps, other

    eess.AS cs.SD

    Misalignment Recognition in Acoustic Sensor Networks using a Semi-supervised Source Estimation Method and Markov Random Fields

    Authors: Gabriel F Miller, Andreas Brendel, Walter Kellermann, Sharon Gannot

    Abstract: In this paper, we consider the problem of acoustic source localization by acoustic sensor networks (ASNs) using a promising, learning-based technique that adapts to the acoustic environment. In particular, we look at the scenario when a node in the ASN is displaced from its position during training. As the mismatch between the ASN used for learning the localization model and the one after a node d… ▽ More

    Submitted 6 November, 2020; originally announced November 2020.

  14. arXiv:2007.01579  [pdf, ps, other

    eess.AS cs.SD

    Noise-Robust Adaptation Control for Supervised Acoustic System Identification Exploiting A Noise Dictionary

    Authors: Thomas Haubner, Andreas Brendel, Mohamed Elminshawi, Walter Kellermann

    Abstract: We present a noise-robust adaptation control strategy for block-online supervised acoustic system identification by exploiting a noise dictionary. The proposed algorithm takes advantage of the pronounced spectral structure which characterizes many types of interfering noise signals. We model the noisy observations by a linear Gaussian Discrete Fourier Transform-domain state space model whose param… ▽ More

    Submitted 3 February, 2021; v1 submitted 3 July, 2020; originally announced July 2020.

  15. arXiv:2007.01543  [pdf, other

    eess.AS cs.SD

    Online Supervised Acoustic System Identification exploiting Prelearned Local Affine Subspace Models

    Authors: Thomas Haubner, Andreas Brendel, Walter Kellermann

    Abstract: In this paper we present a novel algorithm for improved block-online supervised acoustic system identification in adverse noise scenarios by exploiting prior knowledge about the space of Room Impulse Responses (RIRs). The method is based on the assumption that the variability of the unknown RIRs is controlled by only few physical parameters, describing, e.g., source position movements, and thus is… ▽ More

    Submitted 3 July, 2020; originally announced July 2020.

  16. arXiv:2006.13769  [pdf, other

    eess.AS cs.SD

    Deep Neural Network based Distance Estimation for Geometry Calibration in Acoustic Sensor Networks

    Authors: Tobias Gburrek, Joerg Schmalenstroeer, Andreas Brendel, Walter Kellermann, Reinhold Haeb-Umbach

    Abstract: We present an approach to deep neural network based (DNN-based) distance estimation in reverberant rooms for supporting geometry calibration tasks in wireless acoustic sensor networks. Signal diffuseness information from acoustic signals is aggregated via the coherent-to-diffuse power ratio to obtain a distance-related feature, which is mapped to a source-to-microphone distance estimate by means o… ▽ More

    Submitted 24 June, 2020; originally announced June 2020.

    Comments: Accepted for EUSIPCO 2020

  17. arXiv:1511.04063  [pdf, ps, other

    cs.SD

    Single-Channel Maximum-Likelihood T60 Estimation Exploiting Subband Information

    Authors: Heinrich Loellmann, Andreas Brendel, Peter Vary, Walter Kellermann

    Abstract: This contribution presents four algorithms developed by the authors for single-channel fullband and subband T60 estimation within the ACE challenge. The blind estimation of the fullband reverberation time (RT) by maximum-likelihood (ML) estimation based on [15] is considered as baseline approach. An improvement of this algorithm is devised where an energy-weighted averaging of the upper subband RT… ▽ More

    Submitted 12 November, 2015; originally announced November 2015.

    Comments: In Proceedings of the ACE Challenge Workshop - a satellite event of IEEE-WASPAA 2015 (arXiv:1510.00383)

    Report number: ACEChallenge/2015/05