Skip to main content

Showing 1–3 of 3 results for author: Zacharov, N

Searching in archive eess. Search in all archives.
.
  1. arXiv:2502.05139  [pdf, other

    cs.SD cs.LG eess.AS

    Meta Audiobox Aesthetics: Unified Automatic Quality Assessment for Speech, Music, and Sound

    Authors: Andros Tjandra, Yi-Chiao Wu, Baishan Guo, John Hoffman, Brian Ellis, Apoorv Vyas, Bowen Shi, Sanyuan Chen, Matt Le, Nick Zacharov, Carleigh Wood, Ann Lee, Wei-Ning Hsu

    Abstract: The quantification of audio aesthetics remains a complex challenge in audio processing, primarily due to its subjective nature, which is influenced by human perception and cultural context. Traditional methods often depend on human listeners for evaluation, leading to inconsistencies and high resource demands. This paper addresses the growing need for automated systems capable of predicting audio… ▽ More

    Submitted 7 February, 2025; originally announced February 2025.

    Comments: Repository: https://github.com/facebookresearch/audiobox-aesthetics Website: https://ai.meta.com/research/publications/meta-audiobox-aesthetics-unified-automatic-quality-assessment-for-speech-music-and-sound/

  2. arXiv:2205.08007  [pdf, other

    cs.MM cs.SD eess.AS eess.IV

    Perceptual Evaluation on Audio-visual Dataset of 360 Content

    Authors: Randy F Fela, Andréas Pastor, Patrick Le Callet, Nick Zacharov, Toinon Vigier, Søren Forchhammer

    Abstract: To open up new possibilities to assess the multimodal perceptual quality of omnidirectional media formats, we proposed a novel open source 360 audiovisual (AV) quality dataset. The dataset consists of high-quality 360 video clips in equirectangular (ERP) format and higher-order ambisonic (4th order) along with the subjective scores. Three subjective quality experiments were conducted for audio, vi… ▽ More

    Submitted 16 May, 2022; originally announced May 2022.

    Comments: 6 pages, 5 figures, International Conference on Multimedia and Expo 2022

  3. arXiv:2112.12273  [pdf, other

    cs.MM cs.SD eess.AS

    Perceptual Evaluation of 360 Audiovisual Quality and Machine Learning Predictions

    Authors: Randy Frans Fela, Nick Zacharov, Søren Forchhammer

    Abstract: In an earlier study, we gathered perceptual evaluations of the audio, video, and audiovisual quality for 360 audiovisual content. This paper investigates perceived audiovisual quality prediction based on objective quality metrics and subjective scores of 360 video and spatial audio content. Thirteen objective video quality metrics and three objective audio quality metrics were evaluated for five s… ▽ More

    Submitted 22 December, 2021; originally announced December 2021.