Bias-Aware Loss for Training Image and Speech Quality Prediction Models from Multiple Datasets

Mittag, Gabriel; Zadtootaghaj, Saman; Michael, Thilo; Naderi, Babak; Möller, Sebastian

doi:10.1109/QoMEX51781.2021.9465384

Electrical Engineering and Systems Science > Audio and Speech Processing

arXiv:2104.10217 (eess)

[Submitted on 20 Apr 2021]

Title:Bias-Aware Loss for Training Image and Speech Quality Prediction Models from Multiple Datasets

Authors:Gabriel Mittag, Saman Zadtootaghaj, Thilo Michael, Babak Naderi, Sebastian Möller

View PDF

Abstract:The ground truth used for training image, video, or speech quality prediction models is based on the Mean Opinion Scores (MOS) obtained from subjective experiments. Usually, it is necessary to conduct multiple experiments, mostly with different test participants, to obtain enough data to train quality models based on machine learning. Each of these experiments is subject to an experiment-specific bias, where the rating of the same file may be substantially different in two experiments (e.g. depending on the overall quality distribution). These different ratings for the same distortion levels confuse neural networks during training and lead to lower performance. To overcome this problem, we propose a bias-aware loss function that estimates each dataset's biases during training with a linear function and considers it while optimising the network weights. We prove the efficiency of the proposed method by training and validating quality prediction models on synthetic and subjective image and speech quality datasets.

Comments:	Accepted at QoMEX 2021
Subjects:	Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD); Image and Video Processing (eess.IV)
Cite as:	arXiv:2104.10217 [eess.AS]
	(or arXiv:2104.10217v1 [eess.AS] for this version)
	https://doi.org/10.48550/arXiv.2104.10217
Related DOI:	https://doi.org/10.1109/QoMEX51781.2021.9465384

Submission history

From: Gabriel Mittag [view email]
[v1] Tue, 20 Apr 2021 19:20:11 UTC (196 KB)

Electrical Engineering and Systems Science > Audio and Speech Processing

Title:Bias-Aware Loss for Training Image and Speech Quality Prediction Models from Multiple Datasets

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Audio and Speech Processing

Title:Bias-Aware Loss for Training Image and Speech Quality Prediction Models from Multiple Datasets

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators