Skip to main content

Showing 1–1 of 1 results for author: Ebrahimi, E

Searching in archive stat. Search in all archives.
.
  1. arXiv:1907.13257  [pdf, other

    cs.LG cs.AI cs.DC stat.ML

    Optimizing Multi-GPU Parallelization Strategies for Deep Learning Training

    Authors: Saptadeep Pal, Eiman Ebrahimi, Arslan Zulfiqar, Yaosheng Fu, Victor Zhang, Szymon Migacz, David Nellans, Puneet Gupta

    Abstract: Deploying deep learning (DL) models across multiple compute devices to train large and complex models continues to grow in importance because of the demand for faster and more frequent training. Data parallelism (DP) is the most widely used parallelization strategy, but as the number of devices in data parallel training grows, so does the communication overhead between devices. Additionally, a lar… ▽ More

    Submitted 30 July, 2019; originally announced July 2019.