Reducing overfitting in challenge-based competitions

Neto, Elias Chaibub; Hoff, Bruce R; Bare, Chris; Bot, Brian M; Yu, Thomas; Magravite, Lara; Trister, Andrew D; Norman, Thea; Meyer, Pablo; Saez-Rodrigues, Julio; Costello, James C; Guinney, Justin; Stolovitzky, Gustavo

Statistics > Applications

arXiv:1607.00091 (stat)

[Submitted on 1 Jul 2016]

Title:Reducing overfitting in challenge-based competitions

Authors:Elias Chaibub Neto, Bruce R Hoff, Chris Bare, Brian M Bot, Thomas Yu, Lara Magravite, Andrew D Trister, Thea Norman, Pablo Meyer, Julio Saez-Rodrigues, James C Costello, Justin Guinney, Gustavo Stolovitzky

View PDF

Abstract:Over-fitting is a dreaded foe in challenge-based competitions. Because participants rely on public leaderboards to evaluate and refine their models, there is always the danger they might over-fit to the holdout data supporting the leaderboard. The recently published Ladder algorithm aims to address this problem by preventing the participants from exploiting willingly or inadvertently minor fluctuations in public leaderboard scores during model refinement. In this paper, we report a vulnerability of the Ladder that induces severe over-fitting of the leaderboard when the sample size is small. To circumvent this attack, we propose a variation of the Ladder that releases a bootstrapped estimate of the public leaderboard score instead of providing participants with a direct measure of performance. We also extend the scope of the Ladder to arbitrary performance metrics by relying on a more broadly applicable testing procedure based on the Bayesian bootstrap. Our method makes it possible to use a leaderboard, with the technical and social advantages that it provides, even in cases where data is scant.

Subjects:	Applications (stat.AP)
Cite as:	arXiv:1607.00091 [stat.AP]
	(or arXiv:1607.00091v1 [stat.AP] for this version)
	https://doi.org/10.48550/arXiv.1607.00091

Submission history

From: Elias Chaibub Neto [view email]
[v1] Fri, 1 Jul 2016 01:10:35 UTC (788 KB)

Statistics > Applications

Title:Reducing overfitting in challenge-based competitions

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Applications

Title:Reducing overfitting in challenge-based competitions

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators