Nondeterminism and Instability in Neural Network Optimization

Summers, Cecilia; Dinneen, Michael J.

Computer Science > Machine Learning

arXiv:2103.04514 (cs)

[Submitted on 8 Mar 2021 (v1), last revised 10 Jul 2021 (this version, v3)]

Title:Nondeterminism and Instability in Neural Network Optimization

Authors:Cecilia Summers, Michael J. Dinneen

View PDF

Abstract:Nondeterminism in neural network optimization produces uncertainty in performance, making small improvements difficult to discern from run-to-run variability. While uncertainty can be reduced by training multiple model copies, doing so is time-consuming, costly, and harms reproducibility. In this work, we establish an experimental protocol for understanding the effect of optimization nondeterminism on model diversity, allowing us to isolate the effects of a variety of sources of nondeterminism. Surprisingly, we find that all sources of nondeterminism have similar effects on measures of model diversity. To explain this intriguing fact, we identify the instability of model training, taken as an end-to-end procedure, as the key determinant. We show that even one-bit changes in initial parameters result in models converging to vastly different values. Last, we propose two approaches for reducing the effects of instability on run-to-run variability.

Comments:	ICML 2021
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2103.04514 [cs.LG]
	(or arXiv:2103.04514v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2103.04514

Submission history

From: Cecilia Summers [view email]
[v1] Mon, 8 Mar 2021 02:28:18 UTC (103 KB)
[v2] Tue, 1 Jun 2021 00:54:16 UTC (2,268 KB)
[v3] Sat, 10 Jul 2021 21:58:40 UTC (2,268 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2021-03

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Cecilia Summers
Michael J. Dinneen

export BibTeX citation

Computer Science > Machine Learning

Title:Nondeterminism and Instability in Neural Network Optimization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Nondeterminism and Instability in Neural Network Optimization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators