xTern: Energy-Efficient Ternary Neural Network Inference on RISC-V-Based Edge Systems

Rutishauser, Georg; Mihali, Joan; Scherer, Moritz; Benini, Luca

Computer Science > Hardware Architecture

arXiv:2405.19065 (cs)

[Submitted on 29 May 2024]

Title:xTern: Energy-Efficient Ternary Neural Network Inference on RISC-V-Based Edge Systems

Authors:Georg Rutishauser, Joan Mihali, Moritz Scherer, Luca Benini

View PDF HTML (experimental)

Abstract:Ternary neural networks (TNNs) offer a superior accuracy-energy trade-off compared to binary neural networks. However, until now, they have required specialized accelerators to realize their efficiency potential, which has hindered widespread adoption. To address this, we present xTern, a lightweight extension of the RISC-V instruction set architecture (ISA) targeted at accelerating TNN inference on general-purpose cores. To complement the ISA extension, we developed a set of optimized kernels leveraging xTern, achieving 67% higher throughput than their 2-bit equivalents. Power consumption is only marginally increased by 5.2%, resulting in an energy efficiency improvement by 57.1%. We demonstrate that the proposed xTern extension, integrated into an octa-core compute cluster, incurs a minimal silicon area overhead of 0.9% with no impact on timing. In end-to-end benchmarks, we demonstrate that xTern enables the deployment of TNNs achieving up to 1.6 percentage points higher CIFAR-10 classification accuracy than 2-bit networks at equal inference latency. Our results show that xTern enables RISC-V-based ultra-low-power edge AI platforms to benefit from the efficiency potential of TNNs.

Comments:	Accepted for publication at IEEE ASAP 2024
Subjects:	Hardware Architecture (cs.AR); Machine Learning (cs.LG)
Cite as:	arXiv:2405.19065 [cs.AR]
	(or arXiv:2405.19065v1 [cs.AR] for this version)
	https://doi.org/10.48550/arXiv.2405.19065

Submission history

From: Georg Rutishauser [view email]
[v1] Wed, 29 May 2024 13:16:46 UTC (1,619 KB)

Computer Science > Hardware Architecture

Title:xTern: Energy-Efficient Ternary Neural Network Inference on RISC-V-Based Edge Systems

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Hardware Architecture

Title:xTern: Energy-Efficient Ternary Neural Network Inference on RISC-V-Based Edge Systems

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators