Training of Quantized Deep Neural Networks using a Magnetic Tunnel Junction-Based Synapse

Toledo, Tzofnat Greenberg; Perach, Ben; Hubara, Itay; Soudry, Daniel; Kvatinsky, Shahar

doi:10.1088/1361-6641/ac251b

Computer Science > Emerging Technologies

arXiv:1912.12636 (cs)

[Submitted on 29 Dec 2019 (v1), last revised 29 May 2022 (this version, v2)]

Title:Training of Quantized Deep Neural Networks using a Magnetic Tunnel Junction-Based Synapse

Authors:Tzofnat Greenberg Toledo, Ben Perach, Itay Hubara, Daniel Soudry, Shahar Kvatinsky

View PDF

Abstract:Quantized neural networks (QNNs) are being actively researched as a solution for the computational complexity and memory intensity of deep neural networks. This has sparked efforts to develop algorithms that support both inference and training with quantized weight and activation values, without sacrificing accuracy. A recent example is the GXNOR framework for stochastic training of ternary (TNN) and binary (BNN) neural networks. In this paper, we show how magnetic tunnel junction (MTJ) devices can be used to support QNN training. We introduce a novel hardware synapse circuit that uses the MTJ stochastic behavior to support the quantize update. The proposed circuit enables processing near memory (PNM) of QNN training, which subsequently reduces data movement. We simulated MTJ-based stochastic training of a TNN over the MNIST, SVHN, and CIFAR10 datasets and achieved an accuracy of 98.61%, 93.99% and 82.71%, respectively (less than 1% degradation compared to the GXNOR algorithm). We evaluated the synapse array performance potential and showed that the proposed synapse circuit can train ternary networks in situ, with 18.3TOPs/W for feedforward and 3TOPs/W for weight update.

Comments:	Published in Semiconductor Science and Technology, Vol 36
Subjects:	Emerging Technologies (cs.ET); Hardware Architecture (cs.AR); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
Cite as:	arXiv:1912.12636 [cs.ET]
	(or arXiv:1912.12636v2 [cs.ET] for this version)
	https://doi.org/10.48550/arXiv.1912.12636
Journal reference:	Semicond. Sci. Technol. 36 114003 (2021)
Related DOI:	https://doi.org/10.1088/1361-6641/ac251b

Submission history

From: Tzofnat Greenberg-Toledo [view email]
[v1] Sun, 29 Dec 2019 11:36:32 UTC (1,504 KB)
[v2] Sun, 29 May 2022 07:16:41 UTC (2,731 KB)

Computer Science > Emerging Technologies

Title:Training of Quantized Deep Neural Networks using a Magnetic Tunnel Junction-Based Synapse

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Emerging Technologies

Title:Training of Quantized Deep Neural Networks using a Magnetic Tunnel Junction-Based Synapse

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators