Dense Limit of the Dawid-Skene Model for Crowdsourcing and Regions of Sub-optimality of Message Passing Algorithms

Schmidt, Christian; Zdeborová, Lenka

doi:10.1088/1751-8121/ab757f

Statistics > Machine Learning

arXiv:1803.04924 (stat)

[Submitted on 13 Mar 2018 (v1), last revised 15 Mar 2018 (this version, v2)]

Title:Dense Limit of the Dawid-Skene Model for Crowdsourcing and Regions of Sub-optimality of Message Passing Algorithms

Authors:Christian Schmidt, Lenka Zdeborová

View PDF

Abstract:Crowdsourcing is a strategy to categorize data through the contribution of many individuals. A wide range of theoretical and algorithmic contributions are based on the model of Dawid and Skene [1]. Recently it was shown in [2,3] that, in certain regimes, belief propagation is asymptotically optimal for data generated from the Dawid-Skene model. This paper is motivated by this recent progress. We analyze the dense limit of the Dawid-Skene model. It is shown that it belongs to a larger class of low-rank matrix estimation problems for which it is possible to express the asymptotic, Bayes-optimal, performance in a simple closed form. In the dense limit the mapping to a low-rank matrix estimation problem provides an approximate message passing algorithm that solves the problem algorithmically. We identify the regions where the algorithm efficiently computes the Bayes-optimal estimates. Our analysis refines the results of [2,3] about optimality of message passing algorithms by characterizing regions of parameters where these algorithms do not match the Bayes-optimal performance. We further study numerically the performance of approximate message passing, derived in the dense limit, on sparse instances and carry out experiments on a real world dataset.

Comments:	16 pages, 7 figures
Subjects:	Machine Learning (stat.ML); Disordered Systems and Neural Networks (cond-mat.dis-nn); Statistical Mechanics (cond-mat.stat-mech); Data Analysis, Statistics and Probability (physics.data-an)
Cite as:	arXiv:1803.04924 [stat.ML]
	(or arXiv:1803.04924v2 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.1803.04924
Journal reference:	J. Phys. A: Math. Theor. 53 124001 (2020)
Related DOI:	https://doi.org/10.1088/1751-8121/ab757f

Submission history

From: Hinnerk Christian Schmidt [view email]
[v1] Tue, 13 Mar 2018 16:31:37 UTC (1,005 KB)
[v2] Thu, 15 Mar 2018 11:46:09 UTC (1,005 KB)

Statistics > Machine Learning

Title:Dense Limit of the Dawid-Skene Model for Crowdsourcing and Regions of Sub-optimality of Message Passing Algorithms

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Dense Limit of the Dawid-Skene Model for Crowdsourcing and Regions of Sub-optimality of Message Passing Algorithms

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators