SPINN: Synergistic Progressive Inference of Neural Networks over Device and Cloud

Laskaridis, Stefanos; Venieris, Stylianos I.; Almeida, Mario; Leontiadis, Ilias; Lane, Nicholas D.

doi:10.1145/3372224.3419194

Computer Science > Machine Learning

arXiv:2008.06402 (cs)

[Submitted on 14 Aug 2020 (v1), last revised 24 Aug 2020 (this version, v2)]

Title:SPINN: Synergistic Progressive Inference of Neural Networks over Device and Cloud

Authors:Stefanos Laskaridis, Stylianos I. Venieris, Mario Almeida, Ilias Leontiadis, Nicholas D. Lane

View PDF

Abstract:Despite the soaring use of convolutional neural networks (CNNs) in mobile applications, uniformly sustaining high-performance inference on mobile has been elusive due to the excessive computational demands of modern CNNs and the increasing diversity of deployed devices. A popular alternative comprises offloading CNN processing to powerful cloud-based servers. Nevertheless, by relying on the cloud to produce outputs, emerging mission-critical and high-mobility applications, such as drone obstacle avoidance or interactive applications, can suffer from the dynamic connectivity conditions and the uncertain availability of the cloud. In this paper, we propose SPINN, a distributed inference system that employs synergistic device-cloud computation together with a progressive inference method to deliver fast and robust CNN inference across diverse settings. The proposed system introduces a novel scheduler that co-optimises the early-exit policy and the CNN splitting at run time, in order to adapt to dynamic conditions and meet user-defined service-level requirements. Quantitative evaluation illustrates that SPINN outperforms its state-of-the-art collaborative inference counterparts by up to 2x in achieved throughput under varying network conditions, reduces the server cost by up to 6.8x and improves accuracy by 20.7% under latency constraints, while providing robust operation under uncertain connectivity conditions and significant energy savings compared to cloud-centric execution.

Comments:	Accepted at the 26th Annual International Conference on Mobile Computing and Networking (MobiCom), 2020
Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (stat.ML)
Cite as:	arXiv:2008.06402 [cs.LG]
	(or arXiv:2008.06402v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2008.06402
Related DOI:	https://doi.org/10.1145/3372224.3419194

Submission history

From: Stefanos Laskaridis [view email]
[v1] Fri, 14 Aug 2020 15:00:19 UTC (2,571 KB)
[v2] Mon, 24 Aug 2020 10:24:41 UTC (4,225 KB)

Computer Science > Machine Learning

Title:SPINN: Synergistic Progressive Inference of Neural Networks over Device and Cloud

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:SPINN: Synergistic Progressive Inference of Neural Networks over Device and Cloud

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators