MobiRNN: Efficient Recurrent Neural Network Execution on Mobile GPU

Cao, Qingqing; Balasubramanian, Niranjan; Balasubramanian, Aruna

Computer Science > Distributed, Parallel, and Cluster Computing

arXiv:1706.00878 (cs)

[Submitted on 3 Jun 2017]

Title:MobiRNN: Efficient Recurrent Neural Network Execution on Mobile GPU

Authors:Qingqing Cao, Niranjan Balasubramanian, Aruna Balasubramanian

View PDF

Abstract:In this paper, we explore optimizations to run Recurrent Neural Network (RNN) models locally on mobile devices. RNN models are widely used for Natural Language Processing, Machine Translation, and other tasks. However, existing mobile applications that use RNN models do so on the cloud. To address privacy and efficiency concerns, we show how RNN models can be run locally on mobile devices. Existing work on porting deep learning models to mobile devices focus on Convolution Neural Networks (CNNs) and cannot be applied directly to RNN models. In response, we present MobiRNN, a mobile-specific optimization framework that implements GPU offloading specifically for mobile GPUs. Evaluations using an RNN model for activity recognition shows that MobiRNN does significantly decrease the latency of running RNN models on phones.

Comments:	Published at 1st International Workshop on Embedded and Mobile Deep Learning colocated with MobiSys 2017
Subjects:	Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
Cite as:	arXiv:1706.00878 [cs.DC]
	(or arXiv:1706.00878v1 [cs.DC] for this version)
	https://doi.org/10.48550/arXiv.1706.00878

Submission history

From: Qingqing Cao [view email]
[v1] Sat, 3 Jun 2017 00:48:12 UTC (168 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.DC

< prev | next >

new | recent | 2017-06

Change to browse by:

cs
cs.LG
cs.NE

References & Citations

DBLP - CS Bibliography

listing | bibtex

Qingqing Cao
Niranjan Balasubramanian
Aruna Balasubramanian

export BibTeX citation

Computer Science > Distributed, Parallel, and Cluster Computing

Title:MobiRNN: Efficient Recurrent Neural Network Execution on Mobile GPU

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Distributed, Parallel, and Cluster Computing

Title:MobiRNN: Efficient Recurrent Neural Network Execution on Mobile GPU

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators