Skip to main content

Showing 1–9 of 9 results for author: Halimeh, M M

Searching in archive eess. Search in all archives.
.
  1. arXiv:2505.19760  [pdf, ps, other

    eess.AS

    Navigating PESQ: Up-to-Date Versions and Open Implementations

    Authors: Matteo Torcoli, Mhd Modar Halimeh, Emanuël A. P. Habets

    Abstract: Perceptual Evaluation of Speech Quality (PESQ) is an objective quality measure that remains widely used despite its withdrawal by the International Telecommunication Union (ITU). PESQ has evolved over two decades, with multiple versions and publicly available implementations emerging during this time. The numerous versions and their updates can be overwhelming, especially for new PESQ users. This… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

  2. arXiv:2503.03304  [pdf, ps, other

    eess.AS

    On the Relation Between Speech Quality and Quantized Latent Representations of Neural Codecs

    Authors: Mhd Modar Halimeh, Matteo Torcoli, Philipp Grundhuber, Emanuël A. P. Habets

    Abstract: Neural audio signal codecs have attracted significant attention in recent years. In essence, the impressive low bitrate achieved by such encoders is enabled by learning an abstract representation that captures the properties of encoded signals, e.g., speech. In this work, we investigate the relation between the latent representation of the input signal learned by a neural codec and the quality of… ▽ More

    Submitted 5 March, 2025; originally announced March 2025.

  3. arXiv:2409.13502  [pdf, other

    eess.AS cs.SD

    Neural Directional Filtering: Far-Field Directivity Control With a Small Microphone Array

    Authors: Julian Wechsler, Srikanth Raj Chetupalli, Mhd Modar Halimeh, Oliver Thiergart, Emanuël A. P. Habets

    Abstract: Capturing audio signals with specific directivity patterns is essential in speech communication. This study presents a deep neural network (DNN)-based approach to directional filtering, alleviating the need for explicit signal models. More specifically, our proposed method uses a DNN to estimate a single-channel complex mask from the signals of a microphone array. This mask is then applied to a re… ▽ More

    Submitted 20 September, 2024; originally announced September 2024.

    Comments: Presented at the International Workshop on Acoustic Signal Enhancement (IWAENC), 2024

  4. arXiv:2408.08729  [pdf, ps, other

    eess.AS cs.CL cs.SD

    ConcateNet: Dialogue Separation Using Local And Global Feature Concatenation

    Authors: Mhd Modar Halimeh, Matteo Torcoli, Emanuël Habets

    Abstract: Dialogue separation involves isolating a dialogue signal from a mixture, such as a movie or a TV program. This can be a necessary step to enable dialogue enhancement for broadcast-related applications. In this paper, ConcateNet for dialogue separation is proposed, which is based on a novel approach for processing local and global features aimed at better generalization for out-of-domain signals. C… ▽ More

    Submitted 16 August, 2024; originally announced August 2024.

  5. arXiv:2405.17364  [pdf, other

    eess.AS

    Speech Loudness in Broadcasting and Streaming

    Authors: Matteo Torcoli, Mhd Modar Halimeh, Thomas Leitz, Yannik Grewe, Michael Kratschmer, Bernhard Neugebauer, Adrian Murtaza, Harald Fuchs, Emanuël A. P. Habets

    Abstract: The introduction and regulation of loudness in broadcasting and streaming brought clear benefits to the audience, e.g., a level of uniformity across programs and channels. Yet, speech loudness is frequently reported as being too low in certain passages, which can hinder the full understanding and enjoyment of movies and TV programs. This paper proposes expanding the set of loudness-based measures… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: Accepted for presentation at the Audio Engineering Society (AES) 156th Convention, June 2024, Madrid, Spain

  6. arXiv:2401.00197  [pdf, other

    eess.AS

    ODAQ: Open Dataset of Audio Quality

    Authors: Matteo Torcoli, Chih-Wei Wu, Sascha Dick, Phillip A. Williams, Mhd Modar Halimeh, William Wolcott, Emanuel A. P. Habets

    Abstract: Research into the prediction and analysis of perceived audio quality is hampered by the scarcity of openly available datasets of audio signals accompanied by corresponding subjective quality scores. To address this problem, we present the Open Dataset of Audio Quality (ODAQ), a new dataset containing the results of a MUSHRA listening test conducted with expert listeners from 2 international labora… ▽ More

    Submitted 30 December, 2023; originally announced January 2024.

    Comments: Accepted paper. IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), Seoul, Korea, April 2024

  7. Exploiting spatial information with the informed complex-valued spatial autoencoder for target speaker extraction

    Authors: Annika Briegleb, Mhd Modar Halimeh, Walter Kellermann

    Abstract: In conventional multichannel audio signal enhancement, spatial and spectral filtering are often performed sequentially. In contrast, it has been shown that for neural spatial filtering a joint approach of spectro-spatial filtering is more beneficial. In this contribution, we investigate the spatial filtering performed by such a time-varying spectro-spatial filter. We extend the recently proposed c… ▽ More

    Submitted 14 March, 2023; v1 submitted 27 October, 2022; originally announced October 2022.

    Comments: Accepted to 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Rhodes Island, Greece. 5 pages, 2 figures

  8. arXiv:2108.03130  [pdf, other

    eess.AS

    Complex-valued Spatial Autoencoders for Multichannel Speech Enhancement

    Authors: Mhd Modar Halimeh, Walter Kellermann

    Abstract: In this contribution, we present a novel online approach to multichannel speech enhancement. The proposed method estimates the enhanced signal through a filter-and-sum framework. More specifically, complex-valued masks are estimated by a deep complex-valued neural network, termed the complex-valued spatial autoencoder. The proposed network is capable of exploiting as well as manipulating both the… ▽ More

    Submitted 6 August, 2021; originally announced August 2021.

  9. A Synergistic Kalman- and Deep Postfiltering Approach to Acoustic Echo Cancellation

    Authors: Thomas Haubner, Mhd. Modar Halimeh, Andreas Brendel, Walter Kellermann

    Abstract: We introduce a synergistic approach to double-talk robust acoustic echo cancellation combining adaptive Kalman filtering with a deep neural network-based postfilter. The proposed algorithm overcomes the well-known limitations of Kalman filter-based adaptation control in scenarios characterized by abrupt echo path changes. As the key innovation, we suggest to exploit the different statistical prope… ▽ More

    Submitted 4 March, 2022; v1 submitted 16 December, 2020; originally announced December 2020.

    Comments: Accepted for European Signal Processing Conference (EUSIPCO), Dublin, Ireland, August 2021