Parallelizing Over Artificial Neural Network Training Runs with Multigrid

Schroder, Jacob B.

Computer Science > Numerical Analysis

arXiv:1708.02276 (cs)

[Submitted on 7 Aug 2017 (v1), last revised 1 Oct 2017 (this version, v2)]

Title:Parallelizing Over Artificial Neural Network Training Runs with Multigrid

Authors:Jacob B. Schroder

View PDF

Abstract:Artificial neural networks are a popular and effective machine learning technique. Great progress has been made parallelizing the expensive training phase of an individual network, leading to highly specialized pieces of hardware, many based on GPU-type architectures, and more concurrent algorithms such as synthetic gradients. However, the training phase continues to be a bottleneck, where the training data must be processed serially over thousands of individual training runs. This work considers a multigrid reduction in time (MGRIT) algorithm that is able to parallelize over the thousands of training runs and converge to the exact same solution as traditional training would provide. MGRIT was originally developed to provide parallelism for time evolution problems that serially step through a finite number of time-steps. This work recasts the training of a neural network similarly, treating neural network training as an evolution equation that evolves the network weights from one step to the next. Thus, this work concerns distributed computing approaches for neural networks, but is distinct from other approaches which seek to parallelize only over individual training runs. The work concludes with supporting numerical results for two model problems.

Comments:	Version 2: - Added more complete references to basic neural network literature - Corrected typos - Condensed results in Section 3 to be more concise - 22 pages
Subjects:	Numerical Analysis (math.NA); Machine Learning (cs.LG)
Report number:	LLNL-JRNL-736173
Cite as:	arXiv:1708.02276 [cs.NA]
	(or arXiv:1708.02276v2 [cs.NA] for this version)
	https://doi.org/10.48550/arXiv.1708.02276

Submission history

From: Jacob Schroder [view email]
[v1] Mon, 7 Aug 2017 19:42:24 UTC (532 KB)
[v2] Sun, 1 Oct 2017 18:14:19 UTC (500 KB)

Computer Science > Numerical Analysis

Title:Parallelizing Over Artificial Neural Network Training Runs with Multigrid

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Numerical Analysis

Title:Parallelizing Over Artificial Neural Network Training Runs with Multigrid

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators