Skip to main content

Showing 1–1 of 1 results for author: Herbordt, M

Searching in archive stat. Search in all archives.
.
  1. arXiv:1901.01007  [pdf, other

    cs.LG cs.AR cs.DC stat.ML

    FPDeep: Scalable Acceleration of CNN Training on Deeply-Pipelined FPGA Clusters

    Authors: Tong Geng, Tianqi Wang, Ang Li, Xi Jin, Martin Herbordt

    Abstract: Deep Neural Networks (DNNs) have revolutionized numerous applications, but the demand for ever more performance remains unabated. Scaling DNN computations to larger clusters is generally done by distributing tasks in batch mode using methods such as distributed synchronous SGD. Among the issues with this approach is that to make the distributed cluster work with high utilization, the workload dist… ▽ More

    Submitted 21 June, 2020; v1 submitted 4 January, 2019; originally announced January 2019.

    Comments: Accepted by IEEE TRANSACTIONS ON COMPUTERS (TC)