Skip to main content

Showing 1–13 of 13 results for author: Pezzoli, M

Searching in archive eess. Search in all archives.
.
  1. arXiv:2505.00897  [pdf, other

    eess.AS eess.SP

    Physics-Informed Neural Network-Driven Sparse Field Discretization Method for Near-Field Acoustic Holography

    Authors: Xinmeng Luan, Mirco Pezzoli, Fabio Antonacci, Augusto Sarti

    Abstract: We propose the Physics-Informed Neural Network-driven Sparse Field Discretization method (PINN-SFD), a novel self-supervised, physics-informed deep learning approach for addressing the Near-Field Acoustic Holography (NAH) problem. Unlike existing deep learning methods for NAH, which are predominantly supervised by large datasets, our approach does not require a training phase and it is physics-inf… ▽ More

    Submitted 1 May, 2025; originally announced May 2025.

    Comments: 12 pages, 7 figures

  2. arXiv:2504.20625  [pdf, other

    cs.SD cs.AI eess.AS

    DiffusionRIR: Room Impulse Response Interpolation using Diffusion Models

    Authors: Sagi Della Torre, Mirco Pezzoli, Fabio Antonacci, Sharon Gannot

    Abstract: Room Impulse Responses (RIRs) characterize acoustic environments and are crucial in multiple audio signal processing tasks. High-quality RIR estimates drive applications such as virtual microphones, sound source localization, augmented reality, and data augmentation. However, obtaining RIR measurements with high spatial resolution is resource-intensive, making it impractical for large spaces or wh… ▽ More

    Submitted 29 April, 2025; originally announced April 2025.

  3. arXiv:2501.02871  [pdf, other

    cs.SD eess.AS

    Towards HRTF Personalization using Denoising Diffusion Models

    Authors: Juan Camilo Albarracín Sánchez, Luca Comanducci, Mirco Pezzoli, Fabio Antonacci

    Abstract: Head-Related Transfer Functions (HRTFs) have fundamental applications for realistic rendering in immersive audio scenarios. However, they are strongly subject-dependent as they vary considerably depending on the shape of the ears, head and torso. Thus, personalization procedures are required for accurate binaural rendering. Recently, Denoising Diffusion Probabilistic Models (DDPMs), a class of gen… ▽ More

    Submitted 6 January, 2025; originally announced January 2025.

    Comments: to appear in ICASSP 2025

  4. arXiv:2412.18348  [pdf, other

    eess.AS eess.SP

    A Zero-Shot Physics-Informed Dictionary Learning Approach for Sound Field Reconstruction

    Authors: Stefano Damiano, Federico Miotello, Mirco Pezzoli, Alberto Bernardini, Fabio Antonacci, Augusto Sarti, Toon van Waterschoot

    Abstract: Sound field reconstruction aims to estimate pressure fields in areas lacking direct measurements. Existing techniques often rely on strong assumptions or face challenges related to data availability or the explicit modeling of physical properties. To bridge these gaps, this study introduces a zero-shot, physics-informed dictionary learning approach to perform sound field reconstruction. Our method… ▽ More

    Submitted 24 December, 2024; originally announced December 2024.

    Comments: Accepted for publication at ICASSP 2025

  5. arXiv:2408.14731  [pdf, other

    cs.SD eess.AS

    Physics-Informed Machine Learning For Sound Field Estimation

    Authors: Shoichi Koyama, Juliano G. C. Ribeiro, Tomohiko Nakamura, Natsuki Ueno, Mirco Pezzoli

    Abstract: The area of study concerning the estimation of spatial sound, i.e., the distribution of a physical quantity of sound such as acoustic pressure, is called sound field estimation, which is the basis for various applied technologies related to spatial audio processing. The sound field estimation problem is formulated as a function interpolation problem in machine learning in a simplified scenario. Ho… ▽ More

    Submitted 26 August, 2024; originally announced August 2024.

    Comments: Accepted to IEEE Signal Processing Magazine, Special Issue on Model-based and Data-Driven Audio Signal Processing

  6. arXiv:2407.18732  [pdf, ps, other

    eess.AS cs.LG cs.SD eess.SP

    A Physics-Informed Neural Network-Based Approach for the Spatial Upsampling of Spherical Microphone Arrays

    Authors: Federico Miotello, Ferdinando Terminiello, Mirco Pezzoli, Alberto Bernardini, Fabio Antonacci, Augusto Sarti

    Abstract: Spherical microphone arrays are convenient tools for capturing the spatial characteristics of a sound field. However, achieving superior spatial resolution requires arrays with numerous capsules, consequently leading to expensive devices. To address this issue, we present a method for spatially upsampling spherical microphone arrays with a limited number of capsules. Our approach exploits a physic… ▽ More

    Submitted 26 July, 2024; originally announced July 2024.

    Comments: Accepted for publication at IWAENC 2024

  7. arXiv:2403.09524  [pdf, other

    eess.AS

    Physics-Informed Neural Network for Volumetric Sound field Reconstruction of Speech Signals

    Authors: Marco Olivieri, Xenofon Karakonstantis, Mirco Pezzoli, Fabio Antonacci, Augusto Sarti, Efren Fernandez-Grande

    Abstract: Recent developments in acoustic signal processing have seen the integration of deep learning methodologies, alongside the continued prominence of classical wave expansion-based approaches, particularly in sound field reconstruction. Physics-Informed Neural Networks (PINNs) have emerged as a novel framework, bridging the gap between data-driven and model-based techniques for addressing physical phe… ▽ More

    Submitted 23 April, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

  8. arXiv:2402.13896  [pdf, other

    eess.AS eess.SP

    HOMULA-RIR: A Room Impulse Response Dataset for Teleconferencing and Spatial Audio Applications Acquired Through Higher-Order Microphones and Uniform Linear Microphone Arrays

    Authors: Federico Miotello, Paolo Ostan, Mirco Pezzoli, Luca Comanducci, Alberto Bernardini, Fabio Antonacci, Augusto Sarti

    Abstract: In this paper, we present HOMULA-RIR, a dataset of room impulse responses (RIRs) acquired using both higher-order microphones (HOMs) and a uniform linear array (ULA), in order to model a remote attendance teleconferencing scenario. Specifically, measurements were performed in a seminar room, where a 64-microphone ULA was used as a multichannel audio acquisition system in the proximity of the speak… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: Accepted for publication at ICASSP 2024 - HSCMA Workshop

  9. arXiv:2402.04866  [pdf, other

    eess.AS cs.LG cs.SD eess.SP

    Room Transfer Function Reconstruction Using Complex-valued Neural Networks and Irregularly Distributed Microphones

    Authors: Francesca Ronchini, Luca Comanducci, Mirco Pezzoli, Fabio Antonacci, Augusto Sarti

    Abstract: Reconstructing the room transfer functions needed to calculate the complex sound field in a room has several important real-world applications. However, an unpractical number of microphones is often required. Recently, in addition to classical signal processing methods, deep learning techniques have been applied to reconstruct the room transfer function starting from a very limited set of measurem… ▽ More

    Submitted 11 June, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

    Comments: Accepted at EUSIPCO 2024

  10. arXiv:2312.08821  [pdf, other

    eess.AS cs.LG cs.SD eess.SP

    Reconstruction of Sound Field through Diffusion Models

    Authors: Federico Miotello, Luca Comanducci, Mirco Pezzoli, Alberto Bernardini, Fabio Antonacci, Augusto Sarti

    Abstract: Reconstructing the sound field in a room is an important task for several applications, such as sound control and augmented (AR) or virtual reality (VR). In this paper, we propose a data-driven generative model for reconstructing the magnitude of acoustic fields in rooms with a focus on the modal frequency range. We introduce, for the first time, the use of a conditional Denoising Diffusion Probab… ▽ More

    Submitted 21 February, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: Accepted for publication at ICASSP 2024

  11. arXiv:2306.11509  [pdf, other

    eess.AS cs.LG cs.SD eess.SP

    Implicit neural representation with physics-informed neural networks for the reconstruction of the early part of room impulse responses

    Authors: Mirco Pezzoli, Fabio Antonacci, Augusto Sarti

    Abstract: Recently deep learning and machine learning approaches have been widely employed for various applications in acoustics. Nonetheless, in the area of sound field processing and reconstruction classic methods based on the solutions of wave equation are still widespread. Recently, physics-informed neural networks have been proposed as a deep learning paradigm for solving partial differential equations… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

    Comments: Accepted for publication at Forum Acusticum 2023

    Journal ref: Proceedings of the 10th Convention of the European Acoustics Association Forum Acusticum 2023 (pp. 2177-2184)

  12. Acoustic source localization in the spherical harmonics domain exploiting low-rank approximations

    Authors: Maximo Cobos, Mirco Pezzoli, Fabio Antonacci, Augusto Sarti

    Abstract: Acoustic signal processing in the spherical harmonics domain (SHD) is an active research area that exploits the signals acquired by higher order microphone arrays. A very important task is that concerning the localization of active sound sources. In this paper, we propose a simple yet effective method to localize prominent acoustic sources in adverse acoustic scenarios. By using a proper normaliza… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

    Comments: To appear in ICASSP 2023

    Journal ref: 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

  13. arXiv:2103.16935  [pdf, other

    cs.SD cs.LG eess.AS

    Near field Acoustic Holography on arbitrary shapes using Convolutional Neural Network

    Authors: Marco Olivieri, Mirco Pezzoli, Fabio Antonacci, Augusto Sarti

    Abstract: Near-field Acoustic Holography (NAH) is a well-known problem aimed at estimating the vibrational velocity field of a structure by means of acoustic measurements. In this paper, we propose a NAH technique based on Convolutional Neural Network (CNN). The devised CNN predicts the vibrational field on the surface of arbitrary shaped plates (violin plates) with orthotropic material properties from a li… ▽ More

    Submitted 29 June, 2021; v1 submitted 31 March, 2021; originally announced March 2021.

    Comments: accepted for publication in EUSIPCO21