GPTPU: Accelerating Applications using Edge Tensor Processing Units

Hsu, Kuan-Chieh; Tseng, Hung-Wei

Computer Science > Distributed, Parallel, and Cluster Computing

arXiv:2107.05473 (cs)

[Submitted on 22 Jun 2021 (v1), last revised 13 Jul 2021 (this version, v2)]

Title:GPTPU: Accelerating Applications using Edge Tensor Processing Units

Authors:Kuan-Chieh Hsu, Hung-Wei Tseng

View PDF

Abstract:Neural network (NN) accelerators have been integrated into a wide-spectrum of computer systems to accommodate the rapidly growing demands for artificial intelligence (AI) and machine learning (ML) applications. NN accelerators share the idea of providing native hardware support for operations on multidimensional tensor data. Therefore, NN accelerators are theoretically tensor processors that can improve system performance for any problem that uses tensors as inputs/outputs. Unfortunately, commercially available NN accelerators only expose computation capabilities through AI/ML-specific interfaces. Furthermore, NN accelerators reveal very few hardware design details, so applications cannot easily leverage the tensor operations NN accelerators provide.
This paper introduces General-Purpose Computing on Edge Tensor Processing Units (GPTPU), an open-source, open-architecture framework that allows the developer and research communities to discover opportunities that NN accelerators enable for applications. GPTPU includes a powerful programming interface with efficient runtime system-level support -- similar to that of CUDA/OpenCL in GPGPU computing -- to bridge the gap between application demands and mismatched hardware/software interfaces.
We built GPTPU machine uses Edge Tensor Processing Units (Edge TPUs), which are widely available and representative of many commercial NN accelerators. We identified several novel use cases and revisited the algorithms. By leveraging the underlying Edge TPUs to perform tensor-algorithm-based compute kernels, our results reveal that GPTPU can achieve a 2.46x speedup over high-end CPUs and reduce energy consumption by 40%.

Comments:	This paper is a pre-print of a paper in the 2021 SC, the International Conference for High Performance Computing, Networking, Storage and Analysis
Subjects:	Distributed, Parallel, and Cluster Computing (cs.DC); Performance (cs.PF); Signal Processing (eess.SP)
Cite as:	arXiv:2107.05473 [cs.DC]
	(or arXiv:2107.05473v2 [cs.DC] for this version)
	https://doi.org/10.48550/arXiv.2107.05473

Submission history

From: Hung-Wei Tseng [view email]
[v1] Tue, 22 Jun 2021 18:03:22 UTC (17,943 KB)
[v2] Tue, 13 Jul 2021 06:04:21 UTC (17,942 KB)

Computer Science > Distributed, Parallel, and Cluster Computing

Title:GPTPU: Accelerating Applications using Edge Tensor Processing Units

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Distributed, Parallel, and Cluster Computing

Title:GPTPU: Accelerating Applications using Edge Tensor Processing Units

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators