PLCMOS -- a data-driven non-intrusive metric for the evaluation of packet loss concealment algorithms

Diener, Lorenz; Purin, Marju; Sootla, Sten; Saabas, Ando; Aichner, Robert; Cutler, Ross

Computer Science > Sound

arXiv:2305.15127 (cs)

[Submitted on 24 May 2023]

Title:PLCMOS -- a data-driven non-intrusive metric for the evaluation of packet loss concealment algorithms

Authors:Lorenz Diener, Marju Purin, Sten Sootla, Ando Saabas, Robert Aichner, Ross Cutler

View PDF

Abstract:Speech quality assessment is a problem for every researcher working on models that produce or process speech. Human subjective ratings, the gold standard in speech quality assessment, are expensive and time-consuming to acquire in a quantity that is sufficient to get reliable data, while automated objective metrics show a low correlation with gold standard ratings. This paper presents PLCMOS, a non-intrusive data-driven tool for generating a robust, accurate estimate of the mean opinion score a human rater would assign an audio file that has been processed by being transmitted over a degraded packet-switched network with missing packets being healed by a packet loss concealment algorithm. Our new model shows a model-wise Pearson's correlation of ~0.97 and rank correlation of ~0.95 with human ratings, substantially above all other available intrusive and non-intrusive metrics. The model is released as an ONNX model for other researchers to use when building PLC systems.

Comments:	to appear: INTERSPEECH 2023, associated model release: this https URL
Subjects:	Sound (cs.SD); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:2305.15127 [cs.SD]
	(or arXiv:2305.15127v1 [cs.SD] for this version)
	https://doi.org/10.48550/arXiv.2305.15127

Submission history

From: Lorenz Diener [view email]
[v1] Wed, 24 May 2023 13:21:22 UTC (78 KB)

Computer Science > Sound

Title:PLCMOS -- a data-driven non-intrusive metric for the evaluation of packet loss concealment algorithms

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Sound

Title:PLCMOS -- a data-driven non-intrusive metric for the evaluation of packet loss concealment algorithms

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators