HPIPE: Heterogeneous Layer-Pipelined and Sparse-Aware CNN Inference for FPGAs

Hall, Mathew; Betz, Vaughn

Computer Science > Hardware Architecture

arXiv:2007.10451 (cs)

[Submitted on 20 Jul 2020]

Title:HPIPE: Heterogeneous Layer-Pipelined and Sparse-Aware CNN Inference for FPGAs

Authors:Mathew Hall, Vaughn Betz

View PDF

Abstract:We present both a novel Convolutional Neural Network (CNN) accelerator architecture and a network compiler for FPGAs that outperforms all prior work. Instead of having generic processing elements that together process one layer at a time, our network compiler statically partitions available device resources and builds custom-tailored hardware for each layer of a CNN. By building hardware for each layer we can pack our controllers into fewer lookup tables and use dedicated routing. These efficiencies enable our accelerator to utilize 2x the DSPs and operate at more than 2x the frequency of prior work on sparse CNN acceleration on FPGAs. We evaluate the performance of our architecture on both sparse Resnet-50 and dense MobileNet Imagenet classifiers on a Stratix 10 2800 FPGA. We find that the sparse Resnet-50 model has throughput at a batch size of 1 of 4550 images/s, which is nearly 4x the throughput of NVIDIA's fastest machine learning targeted GPU, the V100, and outperforms all prior work on FPGAs.

Comments:	8 Pages, 11 Figures
Subjects:	Hardware Architecture (cs.AR)
ACM classes:	B.5.1
Cite as:	arXiv:2007.10451 [cs.AR]
	(or arXiv:2007.10451v1 [cs.AR] for this version)
	https://doi.org/10.48550/arXiv.2007.10451

Submission history

From: Mathew Hall [view email]
[v1] Mon, 20 Jul 2020 20:17:58 UTC (7,073 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.AR

< prev | next >

new | recent | 2020-07

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

export BibTeX citation

Computer Science > Hardware Architecture

Title:HPIPE: Heterogeneous Layer-Pipelined and Sparse-Aware CNN Inference for FPGAs

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Hardware Architecture

Title:HPIPE: Heterogeneous Layer-Pipelined and Sparse-Aware CNN Inference for FPGAs

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators