Skip to main content

Showing 1–7 of 7 results for author: Samarasinghe, P N

Searching in archive eess. Search in all archives.
.
  1. arXiv:2405.11792  [pdf, other

    eess.AS

    Source Localization by Multidimensional Steered Response Power Mapping with Sparse Bayesian Learning

    Authors: Wei-Ting Lai, Lachlan Birnie, Xingyu Chen, Amy Bastine, Thushara D. Abhayapala, Prasanga N. Samarasinghe

    Abstract: We propose an advance Steered Response Power (SRP) method for localizing multiple sources. While conventional SRP performs well in adverse conditions, it remains to struggle in scenarios with closely neighboring sources, resulting in ambiguous SRP maps. We address this issue by applying sparsity optimization in SRP to obtain high-resolution maps. Our approach represents SRP maps as multidimensiona… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

  2. arXiv:2309.08290  [pdf, other

    eess.AS cs.SD

    Head-Related Transfer Function Interpolation with a Spherical CNN

    Authors: Xingyu Chen, Fei Ma, Yile Zhang, Amy Bastine, Prasanga N. Samarasinghe

    Abstract: Head-related transfer functions (HRTFs) are crucial for spatial soundfield reproduction in virtual reality applications. However, obtaining personalized, high-resolution HRTFs is a time-consuming and costly task. Recently, deep learning-based methods showed promise in interpolating high-resolution HRTFs from sparse measurements. Some of these methods treat HRTF interpolation as an image super-reso… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

  3. arXiv:2308.00242  [pdf, ps, other

    eess.AS

    Circumvent spherical Bessel function nulls for open sphere microphone arrays with physics informed neural network

    Authors: Fei Ma, Thushara D. Abhayapala, Prasanga N. Samarasinghe

    Abstract: Open sphere microphone arrays (OSMAs) are simple to design and do not introduce scattering fields, and thus can be advantageous than other arrays for implementing spatial acoustic algorithms under spherical model decomposition. However, an OSMA suffers from spherical Bessel function nulls which make it hard to obtain some sound field coefficients at certain frequencies. This paper proposes to assi… ▽ More

    Submitted 31 July, 2023; originally announced August 2023.

  4. arXiv:2307.14650  [pdf, other

    eess.AS

    Spatial Upsampling of Head-Related Transfer Functions Using a Physics-Informed Neural Network

    Authors: Fei Ma, Thushara D. Abhayapala, Prasanga N. Samarasinghe, Xingyu Chen

    Abstract: Head-related transfer function (HRTF) capture the information that a person uses to localize sound sources in space, and thus is crucial for creating personalized virtual acoustic experiences. However, practical HRTF measurement systems may only measure a person's HRTFs sparsely, and this necessitates HRTF upsampling. This paper proposes a physics-informed neural network (PINN) method for HRTF ups… ▽ More

    Submitted 10 December, 2023; v1 submitted 27 July, 2023; originally announced July 2023.

  5. arXiv:2206.09298  [pdf, ps, other

    cs.SD cs.RO eess.AS

    GMM based multi-stage Wiener filtering for low SNR speech enhancement

    Authors: Wageesha Manamperi, Prasanga N. Samarasinghe, Thushara D. Abhayapala, Jihui Zhang

    Abstract: This paper proposes a single-channel speech enhancement method to reduce the noise and enhance speech at low signal-to-noise ratio (SNR) levels and non-stationary noise conditions. Specifically, we focus on modeling the noise using a Gaussian mixture model (GMM) based on a multi-stage process with a parametric Wiener filter. The proposed noise model estimates a more accurate noise power spectral d… ▽ More

    Submitted 14 July, 2022; v1 submitted 18 June, 2022; originally announced June 2022.

    Comments: 5 pages, 3 figures, submitted to a conference

  6. arXiv:2105.08219  [pdf, other

    eess.AS eess.SP

    A time-domain nearfield frequency-invariant beamforming method

    Authors: Fei Ma, Thushara D. Abhayapala, Prasanga N. Samarasinghe

    Abstract: Most existing beamforming methods are frequency-domain methods, and are designed for enhancing a farfield target source over a narrow frequency band. They have found diverse applications and are still under active development. However, they struggle to achieve desired performance if the target source is in the nearfield with a broadband output. This paper proposes a time-domain nearfield frequency… ▽ More

    Submitted 17 May, 2021; originally announced May 2021.

  7. Multi-Source DOA Estimation through Pattern Recognition of the Modal Coherence of a Reverberant Soundfield

    Authors: A. Fahim, P. N. Samarasinghe, T. D. Abhayapala

    Abstract: We propose a novel multi-source direction of arrival (DOA) estimation technique using a convolutional neural network algorithm which learns the modal coherence patterns of an incident soundfield through measured spherical harmonic coefficients. We train our model for individual time-frequency bins in the short-time Fourier transform spectrum by analyzing the unique snapshot of modal coherence for… ▽ More

    Submitted 18 March, 2020; originally announced March 2020.

    Journal ref: IEEE/ACM Transactions on Audio, Speech, and Language Processing 28 (2019) 605 - 618