Robust Learning from Untrusted Sources

Konstantinov, Nikola; Lampert, Christoph

Computer Science > Machine Learning

arXiv:1901.10310 (cs)

[Submitted on 29 Jan 2019 (v1), last revised 17 May 2019 (this version, v2)]

Title:Robust Learning from Untrusted Sources

Authors:Nikola Konstantinov, Christoph Lampert

View PDF

Abstract:Modern machine learning methods often require more data for training than a single expert can provide. Therefore, it has become a standard procedure to collect data from external sources, e.g. via crowdsourcing. Unfortunately, the quality of these sources is not always guaranteed. As additional complications, the data might be stored in a distributed way, or might even have to remain private. In this work, we address the question of how to learn robustly in such scenarios. Studying the problem through the lens of statistical learning theory, we derive a procedure that allows for learning from all available sources, yet automatically suppresses irrelevant or corrupted data. We show by extensive experiments that our method provides significant improvements over alternative approaches from robust statistics and distributed optimization.

Comments:	Accepted to International Conference on Machine Learning (ICML), 2019; Camera-ready version
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1901.10310 [cs.LG]
	(or arXiv:1901.10310v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1901.10310

Submission history

From: Nikola Konstantinov [view email]
[v1] Tue, 29 Jan 2019 14:33:42 UTC (82 KB)
[v2] Fri, 17 May 2019 16:21:28 UTC (2,621 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2019-01

Change to browse by:

cs
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Nikola Konstantinov
Christoph Lampert

export BibTeX citation

Computer Science > Machine Learning

Title:Robust Learning from Untrusted Sources

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Robust Learning from Untrusted Sources

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators