Fast convolutional neural networks on FPGAs with hls4ml

Aarrestad, Thea; Loncar, Vladimir; Ghielmetti, Nicolò; Pierini, Maurizio; Summers, Sioni; Ngadiuba, Jennifer; Petersson, Christoffer; Linander, Hampus; Iiyama, Yutaro; Di Guglielmo, Giuseppe; Duarte, Javier; Harris, Philip; Rankin, Dylan; Jindariani, Sergo; Pedro, Kevin; Tran, Nhan; Liu, Mia; Kreinar, Edward; Wu, Zhenbin; Hoang, Duc

doi:10.1088/2632-2153/ac0ea1

Computer Science > Machine Learning

arXiv:2101.05108 (cs)

[Submitted on 13 Jan 2021 (v1), last revised 29 Apr 2021 (this version, v2)]

Title:Fast convolutional neural networks on FPGAs with hls4ml

View PDF

Abstract:We introduce an automated tool for deploying ultra low-latency, low-power deep neural networks with convolutional layers on FPGAs. By extending the hls4ml library, we demonstrate an inference latency of $5\,\mu$s using convolutional architectures, targeting microsecond latency applications like those at the CERN Large Hadron Collider. Considering benchmark models trained on the Street View House Numbers Dataset, we demonstrate various methods for model compression in order to fit the computational constraints of a typical FPGA device used in trigger and data acquisition systems of particle detectors. In particular, we discuss pruning and quantization-aware training, and demonstrate how resource utilization can be significantly reduced with little to no loss in model accuracy. We show that the FPGA critical resource consumption can be reduced by 97% with zero loss in model accuracy, and by 99% when tolerating a 6% accuracy degradation.

Comments:	18 pages, 18 figures, 4 tables
Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); High Energy Physics - Experiment (hep-ex); Instrumentation and Detectors (physics.ins-det); Machine Learning (stat.ML)
Cite as:	arXiv:2101.05108 [cs.LG]
	(or arXiv:2101.05108v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2101.05108
Journal reference:	Mach. Learn.: Sci. Technol. 2 045015 (2021)
Related DOI:	https://doi.org/10.1088/2632-2153/ac0ea1

Submission history

From: Thea Aarrestad [view email]
[v1] Wed, 13 Jan 2021 14:47:11 UTC (6,581 KB)
[v2] Thu, 29 Apr 2021 11:30:02 UTC (5,360 KB)

Computer Science > Machine Learning

Title:Fast convolutional neural networks on FPGAs with hls4ml

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Fast convolutional neural networks on FPGAs with hls4ml

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators