Skip to main content

Showing 1–17 of 17 results for author: Christensen, M G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2410.07978  [pdf, other

    eess.AS cs.SD

    Sound Zone Control Robust To Sound Speed Change

    Authors: Sankha Subhra Bhattacharjee, Jesper Rindom Jensen, Mads Græsbøll Christensen

    Abstract: Sound zone control (SZC) implemented using static optimal filters is significantly affected by various perturbations in the acoustic environment, an important one being the fluctuation in the speed of sound, which is in turn influenced by changes in temperature and humidity (TH). This issue arises because control algorithms typically use pre-recorded, static impulse responses (IRs) to design the o… ▽ More

    Submitted 10 October, 2024; originally announced October 2024.

    Comments: 5 pages, 4 figures, submitted to ICASSP 2025

  2. arXiv:2407.09324  [pdf, other

    cs.LG cs.AI cs.IT

    Provable Privacy Advantages of Decentralized Federated Learning via Distributed Optimization

    Authors: Wenrui Yu, Qiongxiu Li, Milan Lopuhaä-Zwakenberg, Mads Græsbøll Christensen, Richard Heusdens

    Abstract: Federated learning (FL) emerged as a paradigm designed to improve data privacy by enabling data to reside at its source, thus embedding privacy as a core consideration in FL architectures, whether centralized or decentralized. Contrasting with recent findings by Pasquini et al., which suggest that decentralized FL does not empirically offer any additional privacy or security benefits over centrali… ▽ More

    Submitted 30 November, 2024; v1 submitted 12 July, 2024; originally announced July 2024.

  3. arXiv:2302.12048  [pdf, ps, other

    eess.AS cs.SD

    Frequency bin-wise single channel speech presence probability estimation using multiple DNNs

    Authors: Shuai Tao, Himavanth Reddy, Jesper Rindom Jensen, Mads Græsbøll Christensen

    Abstract: In this work, we propose a frequency bin-wise method to estimate the single-channel speech presence probability (SPP) with multiple deep neural networks (DNNs) in the short-time Fourier transform domain. Since all frequency bins are typically considered simultaneously as input features for conventional DNN-based SPP estimators, high model complexity is inevitable. To reduce the model complexity an… ▽ More

    Submitted 23 February, 2023; originally announced February 2023.

    Comments: Accepted for ICASSP 2023

  4. arXiv:2211.09166  [pdf, other

    eess.AS cs.SD

    A Two-Stage Deep Representation Learning-Based Speech Enhancement Method Using Variational Autoencoder and Adversarial Training

    Authors: Yang Xiang, Jesper Lisby Højvang, Morten Højfeldt Rasmussen, Mads Græsbøll Christensen

    Abstract: This paper focuses on leveraging deep representation learning (DRL) for speech enhancement (SE). In general, the performance of the deep neural network (DNN) is heavily dependent on the learning of data representation. However, the DRL's importance is often ignored in many DNN-based SE algorithms. To obtain a higher quality enhanced speech, we propose a two-stage DRL-based SE method through advers… ▽ More

    Submitted 27 September, 2023; v1 submitted 16 November, 2022; originally announced November 2022.

    Comments: Accepted by IEEE/ACM Transactions on Audio, Speech and Language Processing

  5. Privacy-Preserving Distributed Expectation Maximization for Gaussian Mixture Model using Subspace Perturbation

    Authors: Qiongxiu Li, Jaron Skovsted Gundersen, Katrine Tjell, Rafal Wisniewski, Mads Græsbøll Christensen

    Abstract: Privacy has become a major concern in machine learning. In fact, the federated learning is motivated by the privacy concern as it does not allow to transmit the private data but only intermediate updates. However, federated learning does not always guarantee privacy-preservation as the intermediate updates may also reveal sensitive information. In this paper, we give an explicit information-theore… ▽ More

    Submitted 16 September, 2022; originally announced September 2022.

    Journal ref: ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022, pp. 4263-4267

  6. arXiv:2205.05581  [pdf, other

    eess.AS cs.SD

    A deep representation learning speech enhancement method using $β$-VAE

    Authors: Yang Xiang, Jesper Lisby Højvang, Morten Højfeldt Rasmussen, Mads Græsbøll Christensen

    Abstract: In previous work, we proposed a variational autoencoder-based (VAE) Bayesian permutation training speech enhancement (SE) method (PVAE) which indicated that the SE performance of the traditional deep neural network-based (DNN) method could be improved by deep representation learning (DRL). Based on our previous work, we in this paper propose to use $β$-VAE to further improve PVAE's ability of repr… ▽ More

    Submitted 11 May, 2022; originally announced May 2022.

    Comments: Submitted to Eurosipco

  7. arXiv:2201.09875  [pdf, other

    eess.AS cs.SD

    A Bayesian Permutation training deep representation learning method for speech enhancement with variational autoencoder

    Authors: Yang Xiang, Jesper Lisby Højvang, Morten Højfeldt Rasmussen, Mads Græsbøll Christensen

    Abstract: Recently, variational autoencoder (VAE), a deep representation learning (DRL) model, has been used to perform speech enhancement (SE). However, to the best of our knowledge, current VAE-based SE methods only apply VAE to the model speech signal, while noise is modeled using the traditional non-negative matrix factorization (NMF) model. One of the most important reasons for using NMF is that these… ▽ More

    Submitted 24 January, 2022; originally announced January 2022.

    Comments: Accepted by ICASSP 2022

  8. arXiv:2105.14416  [pdf, other

    cs.DC eess.SP

    Communication efficient privacy-preserving distributed optimization using adaptive differential quantization

    Authors: Qiongxiu Li, Richard Heusdens, Mads Græsbøll Christensen

    Abstract: Privacy issues and communication cost are both major concerns in distributed optimization. There is often a trade-off between them because the encryption methods required for privacy-preservation often incur expensive communication bandwidth. To address this issue, we, in this paper, propose a quantization-based approach to achieve both communication efficient and privacy-preserving solutions in t… ▽ More

    Submitted 29 May, 2021; originally announced May 2021.

  9. arXiv:2105.01302  [pdf, other

    eess.AS cs.SD

    Speech Decomposition Based on a Hybrid Speech Model and Optimal Segmentation

    Authors: Alfredo Esquivel Jaramillo, Jesper Kjær Nielsen, Mads Græsbøll Christensen

    Abstract: In a hybrid speech model, both voiced and unvoiced components can coexist in a segment. Often, the voiced speech is regarded as the deterministic component, and the unvoiced speech and additive noise are the stochastic components. Typically, the speech signal is considered stationary within fixed segments of 20-40 ms, but the degree of stationarity varies over time. For decomposing noisy speech in… ▽ More

    Submitted 4 May, 2021; originally announced May 2021.

    Comments: 5 pages, 3 figures, Interspeech conference

  10. arXiv:2009.01098  [pdf, other

    cs.CR eess.SP

    Privacy-Preserving Distributed Processing: Metrics, Bounds, and Algorithms

    Authors: Qiongxiu Li, Jaron Skovsted Gundersen, Richard Heusdens, Mads Græsbøll Christensen

    Abstract: Privacy-preserving distributed processing has recently attracted considerable attention. It aims to design solutions for conducting signal processing tasks over networks in a decentralized fashion without violating privacy. Many algorithms can be adopted to solve this problem such as differential privacy, secure multiparty computation, and the recently proposed distributed optimization based subsp… ▽ More

    Submitted 2 September, 2020; originally announced September 2020.

    Comments: 12 pages, 3 figures

  11. arXiv:2006.16689  [pdf, other

    eess.AS cs.SD

    A Speech Enhancement Algorithm based on Non-negative Hidden Markov Model and Kullback-Leibler Divergence

    Authors: Yang Xiang, Liming Shi, Jesper Lisby Højvang, Morten Højfeldt Rasmussen, Mads Græsbøll Christensen

    Abstract: In this paper, we propose a novel supervised single-channel speech enhancement method combing the the Kullback-Leibler divergence-based non-negative matrix factorization (NMF) and hidden Markov model (NMF-HMM). With the application of HMM, the temporal dynamics information of speech signals can be taken into account. In the training stage, the sum of Poisson, leading to the KL divergence measure,… ▽ More

    Submitted 30 June, 2020; originally announced June 2020.

  12. Privacy-Preserving Distributed Optimization via Subspace Perturbation: A General Framework

    Authors: Qiongxiu Li, Richard Heusdens, Mads Græsbøll Christensen

    Abstract: As the modern world becomes increasingly digitized and interconnected, distributed signal processing has proven to be effective in processing its large volume of data. However, a main challenge limiting the broad use of distributed signal processing techniques is the issue of privacy in handling sensitive data. To address this privacy issue, we propose a novel yet general subspace perturbation met… ▽ More

    Submitted 29 April, 2020; originally announced April 2020.

  13. arXiv:1905.11785  [pdf, other

    eess.AS cs.SD

    Automatic Quality Control and Enhancement for Voice-Based Remote Parkinson's Disease Detection

    Authors: Amir Hossein Poorjam, Mathew Shaji Kavalekalam, Liming Shi, Yordan P. Raykov, Jesper Rindom Jensen, Max A. Little, Mads Græsbøll Christensen

    Abstract: The performance of voice-based Parkinson's disease (PD) detection systems degrades when there is an acoustic mismatch between training and operating conditions caused mainly by degradation in test signals. In this paper, we address this mismatch by considering three types of degradation commonly encountered in remote voice analysis, namely background noise, reverberation and nonlinear distortion,… ▽ More

    Submitted 31 May, 2019; v1 submitted 28 May, 2019; originally announced May 2019.

    Comments: Preprint, 12 pages, 6 figures

  14. arXiv:1905.08557  [pdf, other

    cs.SD cs.LG eess.AS

    Bayesian Pitch Tracking Based on the Harmonic Model

    Authors: Liming Shi, Jesper Kjaer Nielsen, Jesper Rindom Jensen, Max A. Little, Mads Graesboll Christensen

    Abstract: Fundamental frequency is one of the most important characteristics of speech and audio signals. Harmonic model-based fundamental frequency estimators offer a higher estimation accuracy and robustness against noise than the widely used autocorrelation-based methods. However, the traditional harmonic model-based estimators do not take the temporal smoothness of the fundamental frequency, the model o… ▽ More

    Submitted 21 May, 2019; originally announced May 2019.

  15. arXiv:1806.04885  [pdf, other

    eess.AS cs.SD

    Model-based Speech Enhancement for Intelligibility Improvement in Binaural Hearing Aids

    Authors: Mathew Shaji Kavalekalam, Jesper K. Nielsen, Jesper B. Boldt, Mads G. Christensen

    Abstract: Speech intelligibility is often severely degraded among hearing impaired individuals in situations such as the cocktail party scenario. The performance of the current hearing aid technology has been observed to be limited in these scenarios. In this paper, we propose a binaural speech enhancement framework that takes into consideration the speech production model. The enhancement framework propose… ▽ More

    Submitted 1 October, 2018; v1 submitted 13 June, 2018; originally announced June 2018.

    Comments: after revision

  16. arXiv:1706.07927  [pdf, other

    cs.SD cs.LG

    A Variational EM Method for Pole-Zero Modeling of Speech with Mixed Block Sparse and Gaussian Excitation

    Authors: Liming Shi, Jesper Kjær Nielsen, Jesper Rindom Jensen, Mads Græsbøll Christensen

    Abstract: The modeling of speech can be used for speech synthesis and speech recognition. We present a speech analysis method based on pole-zero modeling of speech with mixed block sparse and Gaussian excitation. By using a pole-zero model, instead of the all-pole model, a better spectral fitting can be expected. Moreover, motivated by the block sparse glottal flow excitation during voiced speech and the wh… ▽ More

    Submitted 24 June, 2017; originally announced June 2017.

    Comments: Accepted in the 25th European Signal Processing Conference (EUSIPCO 2017), published by EUROSIP, scheduled for Aug. 28 - Sep. 2 in Kos island, Greece

  17. arXiv:1609.04167  [pdf, other

    math.NA cs.CV cs.IT cs.LG math.OC

    Proceedings of the third "international Traveling Workshop on Interactions between Sparse models and Technology" (iTWIST'16)

    Authors: V. Abrol, O. Absil, P. -A. Absil, S. Anthoine, P. Antoine, T. Arildsen, N. Bertin, F. Bleichrodt, J. Bobin, A. Bol, A. Bonnefoy, F. Caltagirone, V. Cambareri, C. Chenot, V. Crnojević, M. Daňková, K. Degraux, J. Eisert, J. M. Fadili, M. Gabrié, N. Gac, D. Giacobello, A. Gonzalez, C. A. Gomez Gonzalez, A. González , et al. (36 additional authors not shown)

    Abstract: The third edition of the "international - Traveling Workshop on Interactions between Sparse models and Technology" (iTWIST) took place in Aalborg, the 4th largest city in Denmark situated beautifully in the northern part of the country, from the 24th to 26th of August 2016. The workshop venue was at the Aalborg University campus. One implicit objective of this biennial workshop is to foster collab… ▽ More

    Submitted 14 September, 2016; originally announced September 2016.

    Comments: 69 pages, 22 extended abstracts, iTWIST'16 website: http://www.itwist16.es.aau.dk