A Data-Driven Exploration of Elevation Cues in HRTFs: An Explainable AI Perspective Across Multiple Datasets

De Rus, Juan Antonio; Montagud, Mario; Lopez-Ballester, Jesus; Ferri, Francesc J.; Cobos, Maximo

Electrical Engineering and Systems Science > Signal Processing

arXiv:2503.11312 (eess)

[Submitted on 14 Mar 2025]

Title:A Data-Driven Exploration of Elevation Cues in HRTFs: An Explainable AI Perspective Across Multiple Datasets

Authors:Juan Antonio De Rus, Mario Montagud, Jesus Lopez-Ballester, Francesc J. Ferri, Maximo Cobos

View PDF HTML (experimental)

Abstract:Precise elevation perception in binaural audio remains a challenge, despite extensive research on head-related transfer functions (HRTFs) and spectral cues. While prior studies have advanced our understanding of sound localization cues, the interplay between spectral features and elevation perception is still not fully understood. This paper presents a comprehensive analysis of over 600 subjects from 11 diverse public HRTF datasets, employing a convolutional neural network (CNN) model combined with explainable artificial intelligence (XAI) techniques to investigate elevation cues. In addition to testing various HRTF pre-processing methods, we focus on both within-dataset and inter-dataset generalization and explainability, assessing the model's robustness across different HRTF variations stemming from subjects and measurement setups. By leveraging class activation mapping (CAM) saliency maps, we identify key frequency bands that may contribute to elevation perception, providing deeper insights into the spectral features that drive elevation-specific classification. This study offers new perspectives on HRTF modeling and elevation perception by analyzing diverse datasets and pre-processing techniques, expanding our understanding of these cues across a wide range of conditions.

Comments:	14 pages, 9 figures
Subjects:	Signal Processing (eess.SP); Sound (cs.SD); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:2503.11312 [eess.SP]
	(or arXiv:2503.11312v1 [eess.SP] for this version)
	https://doi.org/10.48550/arXiv.2503.11312

Submission history

From: Juan Antonio De Rus Arance [view email]
[v1] Fri, 14 Mar 2025 11:27:50 UTC (31,419 KB)

Electrical Engineering and Systems Science > Signal Processing

Title:A Data-Driven Exploration of Elevation Cues in HRTFs: An Explainable AI Perspective Across Multiple Datasets

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Signal Processing

Title:A Data-Driven Exploration of Elevation Cues in HRTFs: An Explainable AI Perspective Across Multiple Datasets

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators