Fully Reversing the Shoebox Image Source Method: From Impulse Responses to Room Parameters

Sprunck, Tom; Deleforge, Antoine; Privat, Yannick; Foy, Cédric

doi:10.1109/TASLPRO.2025.3536841

Computer Science > Sound

arXiv:2405.03385 (cs)

[Submitted on 6 May 2024 (v1), last revised 10 Mar 2025 (this version, v2)]

Title:Fully Reversing the Shoebox Image Source Method: From Impulse Responses to Room Parameters

Authors:Tom Sprunck (IRMA, MACARON), Antoine Deleforge (IRMA, MACARON), Yannick Privat (IECL, SPHINX, IUF), Cédric Foy (UMRAE, Cerema Direction Est)

View PDF

Abstract:We present an algorithm that fully reverses the shoebox image source method (ISM), a popular and widely used room impulse response (RIR) simulator for cuboid rooms introduced by Allen and Berkley in 1979. More precisely, given a discrete multichannel RIR generated by the shoebox ISM for a microphone array of known geometry, the algorithm reliably recovers the 18 input parameters. These are the 3D source position, the 3 dimensions of the room, the 6-degrees-of-freedom room translation and orientation, and an absorption coefficient for each of the 6 room boundaries. The approach builds on a recently proposed gridless image source localization technique combined with new procedures for room axes recovery and first-order-reflection identification. Extensive simulated experiments reveal that near-exact recovery of all parameters is achieved for a 32-element, 8.4-cm-wide spherical microphone array and a sampling rate of 16~kHz using fully randomized input parameters within rooms of size 2X2X2 to 10X10X5 meters. Estimation errors decay towards zero when increasing the array size and sampling rate. The method is also shown to strongly outperform a known baseline, and its ability to extrapolate RIRs at new positions is demonstrated. Crucially, the approach is strictly limited to low-passed discrete RIRs simulated using the vanilla shoebox ISM. Nonetheless, it represents to our knowledge the first algorithmic demonstration that this difficult inverse problem is in-principle fully solvable over a wide range of configurations.

Subjects:	Sound (cs.SD); Audio and Speech Processing (eess.AS); Signal Processing (eess.SP); Classical Physics (physics.class-ph)
Cite as:	arXiv:2405.03385 [cs.SD]
	(or arXiv:2405.03385v2 [cs.SD] for this version)
	https://doi.org/10.48550/arXiv.2405.03385
Journal reference:	IEEE transactions on acoustics, speech, and signal processing, 2025, 33, pp.1022-1033
Related DOI:	https://doi.org/10.1109/TASLPRO.2025.3536841

Submission history

From: Tom Sprunck [view email] [via CCSD proxy]
[v1] Mon, 6 May 2024 11:43:49 UTC (1,178 KB)
[v2] Mon, 10 Mar 2025 09:48:50 UTC (1,315 KB)

Computer Science > Sound

Title:Fully Reversing the Shoebox Image Source Method: From Impulse Responses to Room Parameters

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Sound

Title:Fully Reversing the Shoebox Image Source Method: From Impulse Responses to Room Parameters

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators