Skip to main content

Showing 1–4 of 4 results for author: Pacheco, R G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2202.05929  [pdf, ps, other

    cs.NI cs.LG

    Improving Image-recognition Edge Caches with a Generative Adversarial Network

    Authors: Guilherme B. Souza, Roberto G. Pacheco, Rodrigo S. Couto

    Abstract: Image recognition is an essential task in several mobile applications. For instance, a smartphone can process a landmark photo to gather more information about its location. If the device does not have enough computational resources available, it offloads the processing task to a cloud infrastructure. Although this approach solves resource shortages, it introduces a communication delay. Image-reco… ▽ More

    Submitted 11 February, 2022; originally announced February 2022.

    Comments: to appear in Proc. IEEE International Conference on Communications (ICC) 2022

  2. arXiv:2108.09343  [pdf, other

    cs.CV cs.LG cs.NI

    Early-exit deep neural networks for distorted images: providing an efficient edge offloading

    Authors: Roberto G. Pacheco, Fernanda D. V. R. Oliveira, Rodrigo S. Couto

    Abstract: Edge offloading for deep neural networks (DNNs) can be adaptive to the input's complexity by using early-exit DNNs. These DNNs have side branches throughout their architecture, allowing the inference to end earlier in the edge. The branches estimate the accuracy for a given input. If this estimated accuracy reaches a threshold, the inference ends on the edge. Otherwise, the edge offloads the infer… ▽ More

    Submitted 25 August, 2021; v1 submitted 20 August, 2021; originally announced August 2021.

    Comments: to appear in Proc. IEEE Global Communications Conference (GLOBECOM) 2021

  3. arXiv:2010.16335  [pdf, other

    cs.LG cs.NI

    Calibration-Aided Edge Inference Offloading via Adaptive Model Partitioning of Deep Neural Networks

    Authors: Roberto G. Pacheco, Rodrigo S. Couto, Osvaldo Simeone

    Abstract: Mobile devices can offload deep neural network (DNN)-based inference to the cloud, overcoming local hardware and energy limitations. However, offloading adds communication delay, thus increasing the overall inference time, and hence it should be used only when needed. An approach to address this problem consists of the use of adaptive model partitioning based on early-exit DNNs. Accordingly, the i… ▽ More

    Submitted 28 January, 2021; v1 submitted 30 October, 2020; originally announced October 2020.

    Comments: to appear in Proc. IEEE International Conference on Communications (ICC) 2021

  4. Inference Time Optimization Using BranchyNet Partitioning

    Authors: Roberto G. Pacheco, Rodrigo S. Couto

    Abstract: Deep Neural Network (DNN) applications with edge computing presents a trade-off between responsiveness and computational resources. On one hand, edge computing can provide high responsiveness deploying computational resources close to end devices, which may be prohibitive for the majority of cloud computing services. On the other hand, DNN inference requires computational power to be executed, whi… ▽ More

    Submitted 10 June, 2020; v1 submitted 1 May, 2020; originally announced May 2020.

    Comments: 8 pages, 11 figures, IEEE Symposium on Computers and Communications 2020