Sound Field Translation and Mixed Source Model for Virtual Applications with Perceptual Validation

Birnie, Lachlan; Abhayapala, Thushara; Tourbabin, Vladimir; Samarasinghe, Prasanga

Electrical Engineering and Systems Science > Audio and Speech Processing

arXiv:2007.11795 (eess)

[Submitted on 23 Jul 2020]

Title:Sound Field Translation and Mixed Source Model for Virtual Applications with Perceptual Validation

Authors:Lachlan Birnie (1), Thushara Abhayapala (1), Vladimir Tourbabin (2), Prasanga Samarasinghe (1) ((1) The Australian National University, (2) Facebook Reality Labs)

View PDF

Abstract:Non-interactive and linear experiences like cinema film offer high quality surround sound audio to enhance immersion, however the listener's experience is usually fixed to a single acoustic perspective. With the rise of virtual reality, there is a demand for recording and recreating real-world experiences in a way that allows for the user to interact and move within the reproduction. Conventional sound field translation techniques take a recording and expand it into an equivalent environment of virtual sources. However, the finite sampling of a commercial higher order microphone produces an acoustic sweet-spot in the virtual reproduction. As a result, the technique remains to restrict the listener's navigable region. In this paper, we propose a method for listener translation in an acoustic reproduction that incorporates a mixture of near-field and far-field sources in a sparsely expanded virtual environment. We perceptually validate the method through a Multiple Stimulus with Hidden Reference and Anchor (MUSHRA) experiment. Compared to the planewave benchmark, the proposed method offers both improved source localizability and robustness to spectral distortions at translated positions. A cross-examination with numerical simulations demonstrated that the sparse expansion relaxes the inherent sweet-spot constraint, leading to the improved localizability for sparse environments. Additionally, the proposed method is seen to better reproduce the intensity and binaural room impulse response spectra of near-field environments, further supporting the strong perceptual results.

Comments:	12 pages, 11 figures This work has been submitted to the IEEE for possible publication
Subjects:	Audio and Speech Processing (eess.AS); Sound (cs.SD)
Cite as:	arXiv:2007.11795 [eess.AS]
	(or arXiv:2007.11795v1 [eess.AS] for this version)
	https://doi.org/10.48550/arXiv.2007.11795

Submission history

From: Lachlan Birnie [view email]
[v1] Thu, 23 Jul 2020 05:16:01 UTC (1,464 KB)

Electrical Engineering and Systems Science > Audio and Speech Processing

Title:Sound Field Translation and Mixed Source Model for Virtual Applications with Perceptual Validation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Audio and Speech Processing

Title:Sound Field Translation and Mixed Source Model for Virtual Applications with Perceptual Validation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators