Impact of GPU uncertainty on the training of predictive deep neural networks

Pietrowski, Maciej; Gajda, Andrzej; Yamamoto, Takuto; Kobayashi, Taisuke; Sinapayen, Lana; Watanabe, Eiji

Computer Science > Machine Learning

arXiv:2109.01451 (cs)

This paper has been withdrawn by Lana Sinapayen

[Submitted on 3 Sep 2021 (v1), last revised 6 Oct 2021 (this version, v4)]

Title:Impact of GPU uncertainty on the training of predictive deep neural networks

Authors:Maciej Pietrowski, Andrzej Gajda, Takuto Yamamoto, Taisuke Kobayashi, Lana Sinapayen, Eiji Watanabe

No PDF available, click to view other formats

Abstract:[retracted] We found out that the difference was dependent on the Chainer library, and does not replicate with another library (pytorch) which indicates that the results are probably due to a bug in Chainer, rather than being hardware-dependent. -- old abstract Deep neural networks often present uncertainties such as hardware- and software-derived noise and randomness. We studied the effects of such uncertainty on learning outcomes, with a particular focus on the function of graphics processing units (GPUs), and found that GPU-induced uncertainty increased learning accuracy of a certain deep neural network. When training a predictive deep neural network using only the CPU without the GPU, the learning error is higher than when training the same number of epochs using the GPU, suggesting that the GPU plays a different role in the learning process than just increasing the computational speed. Because this effect cannot be observed in learning by a simple autoencoder, it could be a phenomenon specific to certain types of neural networks. GPU-specific computational processing is more indeterminate than that by CPUs, and hardware-derived uncertainties, which are often considered obstacles that need to be eliminated, might, in some cases, be successfully incorporated into the training of deep neural networks. Moreover, such uncertainties might be interesting phenomena to consider in brain-related computational processing, which comprises a large mass of uncertain signals.

Comments:	The results obtained in Chainer did not replicate with a different python library, pointing to a software bug rather than hardware cause. The title and discussion of the paper are therefore irrelevant to the real cause
Subjects:	Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Performance (cs.PF)
Cite as:	arXiv:2109.01451 [cs.LG]
	(or arXiv:2109.01451v4 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2109.01451

Submission history

From: Lana Sinapayen [view email]
[v1] Fri, 3 Sep 2021 11:21:40 UTC (1,934 KB)
[v2] Mon, 20 Sep 2021 09:33:02 UTC (1,934 KB)
[v3] Sat, 25 Sep 2021 03:07:34 UTC (2,079 KB)
[v4] Wed, 6 Oct 2021 07:09:40 UTC (1 KB) (withdrawn)

Computer Science > Machine Learning

Title:Impact of GPU uncertainty on the training of predictive deep neural networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Impact of GPU uncertainty on the training of predictive deep neural networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators