Skip to main content

Showing 1–2 of 2 results for author: Stuart, D M

Searching in archive cs. Search in all archives.
.
  1. arXiv:1803.03688  [pdf, other

    cs.NE

    Bit-Tactical: Exploiting Ineffectual Computations in Convolutional Neural Networks: Which, Why, and How

    Authors: Alberto Delmas, Patrick Judd, Dylan Malone Stuart, Zissis Poulos, Mostafa Mahmoud, Sayeh Sharify, Milos Nikolic, Andreas Moshovos

    Abstract: We show that, during inference with Convolutional Neural Networks (CNNs), more than 2x to $8x ineffectual work can be exposed if instead of targeting those weights and activations that are zero, we target different combinations of value stream properties. We demonstrate a practical application with Bit-Tactical (TCL), a hardware accelerator which exploits weight sparsity, per layer precision varia… ▽ More

    Submitted 9 March, 2018; originally announced March 2018.

    Comments: An earlier version of this work titled "JaZ: Enabling Innovation Towards Chaff-Free Deep Learning Computing" was submitted for blind review

  2. arXiv:1801.08621  [pdf, other

    cs.LG

    Quantization Error as a Metric for Dynamic Precision Scaling in Neural Net Training

    Authors: Ian Taras, Dylan Malone Stuart

    Abstract: Recent work has explored reduced numerical precision for parameters, activations, and gradients during neural network training as a way to reduce the computational cost of training (Na & Mukhopadhyay, 2016) (Courbariaux et al., 2014). We present a novel dynamic precision scaling (DPS) scheme. Using stochastic fixed-point rounding, a quantization-error based scaling scheme, and dynamic bit-widths d… ▽ More

    Submitted 24 January, 2019; v1 submitted 25 January, 2018; originally announced January 2018.