Skip to main content

Showing 1–32 of 32 results for author: Rafaely, B

Searching in archive eess. Search in all archives.
.
  1. arXiv:2507.04108  [pdf, ps, other

    eess.AS eess.SP

    Ambisonics Encoder for Wearable Array with Improved Binaural Reproduction

    Authors: Yhonatan Gayer, Vladimir Tourbabin, Zamir Ben-Hur, David Alon, Boaz Rafaely

    Abstract: Ambisonics Signal Matching (ASM) is a recently proposed signal-independent approach to encoding Ambisonic signal from wearable microphone arrays, enabling efficient and standardized spatial sound reproduction. However, reproduction accuracy is currently limited due to the non-ideal layout of the microphones. This research introduces an enhanced ASM encoder that reformulates the loss function by in… ▽ More

    Submitted 5 July, 2025; originally announced July 2025.

    Comments: Published in Forum Acousticum 2025, 6 pages, 2 figures

  2. arXiv:2506.19404  [pdf, ps, other

    eess.AS cs.SD

    Loss functions incorporating auditory spatial perception in deep learning -- a review

    Authors: Boaz Rafaely, Stefan Weinzierl, Or Berebi, Fabian Brinkmann

    Abstract: Binaural reproduction aims to deliver immersive spatial audio with high perceptual realism over headphones. Loss functions play a central role in optimizing and evaluating algorithms that generate binaural signals. However, traditional signal-related difference measures often fail to capture the perceptual properties that are essential to spatial audio quality. This review paper surveys recent los… ▽ More

    Submitted 24 June, 2025; originally announced June 2025.

    Comments: Submitted to I3DA 2025

  3. BSM-iMagLS: ILD Informed Binaural Signal Matching for Reproduction with Head-Mounted Microphone Arrays

    Authors: Or Berebi, Zamir Ben-Hur, David Lou Alon, Boaz Rafaely

    Abstract: Headphone listening in applications such as augmented and virtual reality (AR and VR) relies on high-quality spatial audio to ensure immersion, making accurate binaural reproduction a critical component. As capture devices, wearable arrays with only a few microphones with irregular arrangement face challenges in achieving a reproduction quality comparable to that of arrays with a large number of m… ▽ More

    Submitted 25 June, 2025; v1 submitted 30 January, 2025; originally announced January 2025.

    Comments: 14 pages, 8 figures, Accepted to IEEE TASLP (IEEE Transactions on Audio, Speech and Language Processing, 2025)

  4. Ambisonics Binaural Rendering via Masked Magnitude Least Squares

    Authors: Or Berebi, Fabian Brinkmann, Stefan Weinzierl, Boaz Rafaely

    Abstract: Ambisonics rendering has become an integral part of 3D audio for headphones. It works well with existing recording hardware, the processing cost is mostly independent of the number of sound sources, and it elegantly allows for rotating the scene and listener. One challenge in Ambisonics headphone rendering is to find a perceptually well behaved low-order representation of the Head-Related Transfer… ▽ More

    Submitted 30 January, 2025; originally announced January 2025.

    Comments: 5 pages, 4 figures, Accepted to IEEE ICASSP 2025 (IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Hyderabad, India, 2025)

  5. arXiv:2410.11453  [pdf, other

    eess.AS cs.SD

    The importance of spatial and spectral information in multiple speaker tracking

    Authors: Hanan Beit-On, Vladimir Tourbabin, Boaz Rafaely

    Abstract: Multi-speaker localization and tracking using microphone array recording is of importance in a wide range of applications. One of the challenges with multi-speaker tracking is to associate direction estimates with the correct speaker. Most existing association approaches rely on spatial or spectral information alone, leading to performance degradation when one of these information channels is part… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

  6. arXiv:2409.15484  [pdf, other

    eess.AS cs.SD

    Blind Localization of Early Room Reflections with Arbitrary Microphone Array

    Authors: Yogev Hadadi, Vladimir Tourbabin, Zamir Ben-Hur, David Lou Alon, Boaz Rafaely

    Abstract: Blindly estimating the direction of arrival (DoA) of early room reflections without prior knowledge of the room impulse response or source signal is highly valuable in audio signal processing applications. The FF-PHALCOR (Frequency Focusing PHase ALigned CORrelation) method was recently developed for this purpose, extending the original PHALCOR method to work with arbitrary arrays rather than just… ▽ More

    Submitted 23 September, 2024; originally announced September 2024.

  7. arXiv:2409.14346  [pdf, other

    eess.AS cs.SD

    Improved direction of arrival estimations with a wearable microphone array for dynamic environments by reliability weighting

    Authors: Daniel A. Mitchell, Boaz Rafaely, Anurag Kumar, Vladimir Tourbabin

    Abstract: Direction-of-arrival estimation of multiple speakers in a room is an important task for a wide range of applications. In particular, challenging environments with moving speakers, reverberation and noise, lead to significant performance degradation for current methods. With the aim of better understanding factors affecting performance and improving current methods, in this paper multi-speaker dire… ▽ More

    Submitted 22 September, 2024; originally announced September 2024.

  8. arXiv:2409.11731  [pdf, other

    eess.AS cs.SD

    Performance and Robustness of Signal-Dependent vs. Signal-Independent Binaural Signal Matching with Wearable Microphone Arrays

    Authors: Ami Berger, Vladimir Tourbabin, Jacob Donley, Zamir Ben-Hur, Boaz Rafaely

    Abstract: The increasing popularity of spatial audio in applications such as teleconferencing, entertainment, and virtual reality has led to the recent developments of binaural reproduction methods. However, only a few of these methods are well-suited for wearable and mobile arrays, which typically consist of a small number of microphones. One such method is binaural signal matching (BSM), which has been sh… ▽ More

    Submitted 14 February, 2025; v1 submitted 18 September, 2024; originally announced September 2024.

  9. arXiv:2408.04288  [pdf, other

    eess.AS cs.SD

    Assessing the Potential Impact of Direction-Dependent HRTF Selection on Sound Localization Accuracy

    Authors: Sapir Goldring, Zamir Ben Hur, David Lou Alon, Boaz Rafaely

    Abstract: This study investigates the approach of direction-dependent selection of Head-Related Transfer Functions (HRTFs) and its impact on sound localization accuracy. For applications such as virtual reality (VR) and teleconferencing, obtaining individualized HRTFs can be beneficial yet challenging, the objective of this work is therefore to assess whether incorporating HRTFs in a direction-dependent man… ▽ More

    Submitted 8 August, 2024; originally announced August 2024.

    Comments: Accepted for publication in the 2024 AES International Conference on Audio for Virtual and Augmented Reality, 5 pages, 4 figures

  10. arXiv:2408.03611  [pdf, other

    eess.AS cs.SD

    Feasibility of iMagLS-BSM -- ILD Informed Binaural Signal Matching with Arbitrary Microphone Arrays

    Authors: Or Berebi, Zamir Ben-Hur, David Lou Alon, Boaz Rafaely

    Abstract: Binaural reproduction for headphone-centric listening has become a focal point in ongoing research, particularly within the realm of advancing technologies such as augmented and virtual reality (AR and VR). The demand for high-quality spatial audio in these applications is essential to uphold a seamless sense of immersion. However, challenges arise from wearable recording devices equipped with onl… ▽ More

    Submitted 7 August, 2024; originally announced August 2024.

    Comments: Paper accepted for publication in IWAENC 2024, 4 pages, 2 figures

  11. Design and Analysis of Binaural Signal Matching with Arbitrary Microphone Arrays and Listener Head Rotations

    Authors: Lior Madmoni, Zamir Ben-Hur, Jacob Donley, Vladimir Tourbabin, Boaz Rafaely

    Abstract: Binaural reproduction is rapidly becoming a topic of great interest in the research community, especially with the surge of new and popular devices, such as virtual reality headsets, smart glasses, and head-tracked headphones. In order to immerse the listener in a virtual or remote environment with such devices, it is essential to generate realistic and accurate binaural signals. This is challengi… ▽ More

    Submitted 29 April, 2025; v1 submitted 7 August, 2024; originally announced August 2024.

    Comments: Published on EURASIP Journal on audio speech and music processing

  12. On HRTF Notch Frequency Prediction Using Anthropometric Features and Neural Networks

    Authors: Lior Arbel, Ishwarya Ananthabhotla, Zamir Ben-Hur, David Lou Alon, Boaz Rafaely

    Abstract: High fidelity spatial audio often performs better when produced using a personalized head-related transfer function (HRTF). However, the direct acquisition of HRTFs is cumbersome and requires specialized equipment. Thus, many personalization methods estimate HRTF features from easily obtained anthropometric features of the pinna, head, and torso. The first HRTF notch frequency (N1) is known to be… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  13. arXiv:2402.18968  [pdf, other

    eess.AS cs.SD

    Ambisonics Networks -- The Effect Of Radial Functions Regularization

    Authors: Bar Shaybet, Anurag Kumar, Vladimir Tourbabin, Boaz Rafaely

    Abstract: Ambisonics, a popular format of spatial audio, is the spherical harmonic (SH) representation of the plane wave density function of a sound field. Many algorithms operate in the SH domain and utilize the Ambisonics as their input signal. The process of encoding Ambisonics from a spherical microphone array involves dividing by the radial functions, which may amplify noise at low frequencies. This ca… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

    Comments: to be published in Icassp 2024

  14. Ambisonics Encoding For Arbitrary Microphone Arrays Incorporating Residual Channels For Binaural Reproduction

    Authors: Yhonatan Gayer, Vladimir Tourbabin, Zamir Ben-Hur, Jacob Donley, Boaz Rafaely

    Abstract: In the rapidly evolving fields of virtual and augmented reality, accurate spatial audio capture and reproduction are essential. For these applications, Ambisonics has emerged as a standard format. However, existing methods for encoding Ambisonics signals from arbitrary microphone arrays face challenges, such as errors due to the irregular array configurations and limited spatial resolution resulti… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: Accepted for presentation at HSCMA 2024

  15. arXiv:2401.03493  [pdf, ps, other

    eess.AS cs.SD

    Theory and investigation of acoustic multiple-input multiple-output systems based on spherical arrays in a room

    Authors: Hai Morgenstern, Boaz Rafaely, Franz Zotter

    Abstract: Spatial attributes of room acoustics have been widely studied using microphone and loudspeaker arrays. However, systems that combine both arrays, referred to as multiple-input multiple-output (MIMO) systems, have only been studied to a limited degree in this context. These systems can potentially provide a powerful tool for room acoustics analysis due to the ability to simultaneously control both… ▽ More

    Submitted 7 January, 2024; originally announced January 2024.

    Journal ref: J. Acoust. Soc. Am., vol. 138, no. 5, pp. 2998-3009, November 2015

  16. arXiv:2401.03458  [pdf, ps, other

    eess.AS cs.SD

    Modal smoothing for analysis of room reflections measured with spherical microphone and loudspeaker arrays

    Authors: Hai Morgenstern, Boaz Rafaely

    Abstract: Spatial analysis of room acoustics is an ongoing research topic. Microphone arrays have been employed for spatial analyses with an important objective being the estimation of the direction-of-arrival (DOA) of direct sound and early room reflections using room impulse responses (RIRs). An optimal method for DOA estimation is the multiple signal classification algorithm. When RIRs are considered, th… ▽ More

    Submitted 7 January, 2024; originally announced January 2024.

    Journal ref: J. Acoust. Soc. Am., vol. 143, no. 2, pp. 1008-1018, 2018

  17. Spatial Reverberation and Dereverberation using an Acoustic Multiple-Input Multiple-Output System

    Authors: Hai Morgenstern, Boaz Rafaely

    Abstract: Methods are proposed for modifying the reverberation characteristics of sound fields in rooms by employing a loudspeaker with adjustable directivity, realized with a compact spherical loudspeaker array (SLA). These methods are based on minimization and maximization of clarity and direct-to-reverberant sound ratio. Significant modification of reverberation is achieved by these methods, as shown in… ▽ More

    Submitted 7 January, 2024; originally announced January 2024.

    Journal ref: J. Audio Eng. Soc, vol. 65, no. 1/2, pp. 42-55, 2017

  18. arXiv:2401.03291  [pdf, ps, other

    eess.AS cs.SD

    Design framework for spherical microphone and loudspeaker arrays in a multiple-input multiple-output system

    Authors: Hai Morgenstern, Boaz Rafaely, Markus Noisternig

    Abstract: Spherical microphone arrays (SMAs) and spherical loudspeaker arrays (SLAs) facilitate the study of room acoustics due to the three-dimensional analysis they provide. More recently, systems that combine both arrays, referred to as multiple-input multiple-output (MIMO) systems, have been proposed due to the added spatial diversity they facilitate. The literature provides frameworks for designing SMA… ▽ More

    Submitted 6 January, 2024; originally announced January 2024.

    Journal ref: J. Acoust. Soc. Am. 2017, vol 141, no 3, 2024-2038

  19. arXiv:2401.03286  [pdf, ps, other

    eess.AS cs.RO cs.SD

    Theoretical Framework for the Optimization of Microphone Array Configuration for Humanoid Robot Audition

    Authors: Vladimir Tourbabin, Boaz Rafaely

    Abstract: An important aspect of a humanoid robot is audition. Previous work has presented robot systems capable of sound localization and source segregation based on microphone arrays with various configurations. However, no theoretical framework for the design of these arrays has been presented. In the current paper, a design framework is proposed based on a novel array quality measure. The measure is bas… ▽ More

    Submitted 6 January, 2024; originally announced January 2024.

    Journal ref: in IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 22, no. 12, 1803-1814, 2014

  20. arXiv:2401.02386  [pdf, ps, other

    eess.AS cs.RO cs.SD

    Direction of Arrival Estimation Using Microphone Array Processing for Moving Humanoid Robots

    Authors: Vladimir Tourbabin, Boaz Rafaely

    Abstract: The auditory system of humanoid robots has gained increased attention in recent years. This system typically acquires the surrounding sound field by means of a microphone array. Signals acquired by the array are then processed using various methods. One of the widely applied methods is direction of arrival estimation. The conventional direction of arrival estimation methods assume that the array i… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

    Journal ref: in IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 23, no. 11, pp. 2046-2058, Nov. 2015

  21. Optimal Real-Weighted Beamforming With Application to Linear and Spherical Arrays

    Authors: V. Tourbabin, M. Agmon, B. Rafaely, J. Tabrikian

    Abstract: One of the uses of sensor arrays is for spatial filtering or beamforming. Current digital signal processing methods facilitate complex-weighted beamforming, providing flexibility in array design. Previous studies proposed the use of real-valued beamforming weights, which although reduce flexibility in design, may provide a range of benefits, e.g., simplified beamformer implementation or efficient… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

    Journal ref: n IEEE Transactions on Audio, Speech, and Language Processing, vol. 20, no. 9, pp. 2575-2585, Nov. 2012

  22. The role of direct sound spherical harmonics representation in externalization using binaural reproduction

    Authors: Eran Miller, Boaz Rafaely

    Abstract: The importance of the information in the direct sound to human perception of spatial sound sources is an ongoing research topic. The classification between direct sound and diffuse or reverberant sound forms the basis of numerous studies in the field of spatial audio. In particular, parametric spatial audio representation methods use this classification and employ signal processing in order to enh… ▽ More

    Submitted 1 January, 2024; originally announced January 2024.

    Journal ref: Applied Acoustics, Volume 148, 2019, Pages 40-45

  23. arXiv:2312.13707  [pdf, other

    eess.AS cs.SD

    Blind Localization of Room Reflections with Application to Spatial Audio

    Authors: Yogev Hadadi, Vladimir Tourbabin, Paul Calamia, Boaz Rafaely

    Abstract: Blind estimation of early room reflections, without knowledge of the room impulse response, holds substantial value. The FF-PHALCOR (Frequency Focusing PHase ALigned CORrelation), method was recently developed for this objective, extending the original PHALCOR method from spherical to arbitrary arrays. However, previous studies only compared the two methods under limited conditions without present… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Journal ref: in 2023 Immersive and 3D Audio: from Architecture to Automotive (I3DA 2023), Bologna, Italy, September 2023

  24. arXiv:2311.16927  [pdf, other

    eess.AS

    Study of speaker localization under dynamic and reverberant environments

    Authors: Daniel A. Mitchell, Boaz Rafaely

    Abstract: Speaker localization in a reverberant environment is a fundamental problem in audio signal processing. Many solutions have been developed to tackle this problem. However, previous algorithms typically assume a stationary environment in which both the microphone array and the sound sources are not moving. With the emergence of wearable microphone arrays, acoustic scenes have become dynamic with mov… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

    Journal ref: in Proceedings of the 24rd International Congress on Acoustics (ICA 2022), no. ABS-0359, Oct 2022

  25. iMagLS: Interaural Level Difference with Magnitude Least-Squares Loss for Optimized First-Order Head-Related Transfer Function

    Authors: Or Berebi, Zamir Ben-Hur, David Lou Alon, Boaz Rafaely

    Abstract: Binaural reproduction for headphone-based listening is an active research area due to its widespread use in evolving technologies such as augmented and virtual reality (AR and VR). On the one hand, these applications demand high quality spatial audio perception to preserve the sense of immersion. On the other hand, recording devices may only have a few microphones, leading to low-order representat… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

    Comments: 3 pages, 2 figures, Forum Acusticum 2023

  26. arXiv:2311.13390  [pdf, other

    eess.AS

    Performance Analysis Of Binaural Signal Matching (BSM) in the Time-Frequency Domain

    Authors: Ami Berger, Vladimir Tourbabin, Jacob Donley, Zamir Ben-Hur, Boaz Rafaely

    Abstract: The capture and reproduction of spatial audio is becoming increasingly popular, with the mushrooming of applications in teleconferencing, entertainment and virtual reality. Many binaural reproduction methods have been developed and studied extensively for spherical and other specially designed arrays. However, the recent increased popularity of wearable and mobile arrays requires the development o… ▽ More

    Submitted 23 November, 2023; v1 submitted 22 November, 2023; originally announced November 2023.

    Journal ref: in Proceedings of the 24th International Congress on Acoustics (ICA 2022), ABS-0302, 2022

  27. Study of speaker localization with binaural microphone array incorporating auditory filters and lateral angle estimation

    Authors: Yanir Maymon, Israel Nelken, Boaz Rafaely

    Abstract: Speaker localization for binaural microphone arrays has been widely studied for applications such as speech communication, video conferencing, and robot audition. Many methods developed for this task, including the direct path dominance (DPD) test, share common stages in their processing, which include transformation using the short-time Fourier transform (STFT), and a direction of arrival (DOA) s… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

    Journal ref: Applied Acoustics, Volume 213,2023, 109632

  28. Optimal model-based beamforming and independent steering for spherical loudspeaker arrays

    Authors: Boaz Rafaely, Dima Khaykin

    Abstract: Spherical loudspeaker arrays have been recently studied for directional sound radiation, where the compact arrangement of the loudspeaker units around a sphere facilitated the control of sound radiation in three-dimensional space. Directivity of sound radiation, or beamforming, was achieved by driving each loudspeaker unit independently, where the design of beamforming weights was typically achiev… ▽ More

    Submitted 6 October, 2023; originally announced October 2023.

    Journal ref: in IEEE Trans. Audio, Speech, and Lang. Proc., vol. 19, no. 7, pp. 2234-2238, Sept. 2011

  29. Zones of quiet in a broadband diffuse sound field

    Authors: Boaz Rafaely

    Abstract: The zones of quiet in pure-tone diffuse sound fields have been studied extensively in the past, both theoretically and experimentally, with the well known result of the 10\,dB attenuation extending to about a tenth of a wavelength. Recent results on the spatial-temporal correlation of broadband diffuse sound fields are used in this study to develop a theoretical framework for predicting the extens… ▽ More

    Submitted 6 October, 2023; originally announced October 2023.

    Journal ref: J. Acoust. Soc. Am., vol. 110, no. 1, pp. 296-302, July 2001

  30. Spatial sampling and beamforming for spherical microphone arrays

    Authors: Boaz Rafaely

    Abstract: Spherical microphone arrays have been recently studied for spatial sound recording, speech communication, and sound field analysis for room acoustics and noise control. Complementary theoretical studies presented progress in spatial sampling and beamforming methods. This paper reviews recent results in spatial sampling that facilitate a wide range of spherical array configurations, from a single r… ▽ More

    Submitted 6 October, 2023; originally announced October 2023.

    Journal ref: 2008 Hands-Free Speech Communication and Microphone Arrays, Trento, Italy, 2008, pp. 5-8

  31. Speaker localization using direct path dominance test based on sound field directivity

    Authors: Boaz Rafaely, Koby Alhaiany

    Abstract: Estimation of the direction-of-arrival (DoA) of a speaker in a room is important in many audio signal processing applications. Environments with reverberation that masks the DoA information are particularly challenging. Recently, a DoA estimation method that is robust to reverberation has been developed. This method identifies time-frequency bins dominated by the contribution from the direct path,… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

    Journal ref: Signal Processing, vol. 143, pp. 42 - 47, 2018

  32. arXiv:1812.04942  [pdf, other

    cs.SD eess.AS

    Description of algorithms for Ben-Gurion University Submission to the LOCATA challenge

    Authors: Lior Madmoni, Hanan Beit-On, Hai Morgenstern, Boaz Rafaely

    Abstract: This paper summarizes the methods used to localize the sources recorded for the LOCalization And TrAcking (LOCATA) challenge. The tasks of stationary sources and arrays were considered, i.e., tasks 1 and 2 of the challenge, which were recorded with the Nao robot array, and the Eigenmike array. For both arrays, direction of arrival (DOA) estimation has been performed with measurements in the short… ▽ More

    Submitted 12 December, 2018; originally announced December 2018.

    Comments: In Proceedings of the LOCATA Challenge Workshop - a satellite event of IWAENC 2018 (arXiv:1811.08482 )

    Report number: LOCATAchallenge/2018/03