Skip to main content

Showing 1–3 of 3 results for author: U-Chupala, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2305.02538  [pdf, other

    cs.LG

    Cuttlefish: Low-Rank Model Training without All the Tuning

    Authors: Hongyi Wang, Saurabh Agarwal, Pongsakorn U-chupala, Yoshiki Tanaka, Eric P. Xing, Dimitris Papailiopoulos

    Abstract: Recent research has shown that training low-rank neural networks can effectively reduce the total number of trainable parameters without sacrificing predictive accuracy, resulting in end-to-end speedups. However, low-rank model training necessitates adjusting several additional factorization hyperparameters, such as the rank of the factorization at each layer. In this paper, we tackle this challen… ▽ More

    Submitted 5 May, 2023; v1 submitted 4 May, 2023; originally announced May 2023.

    Comments: Accepted for presentation at MLSys 2023

  2. arXiv:1811.05233  [pdf

    cs.LG cs.CV

    Massively Distributed SGD: ImageNet/ResNet-50 Training in a Flash

    Authors: Hiroaki Mikami, Hisahiro Suganuma, Pongsakorn U-chupala, Yoshiki Tanaka, Yuichi Kageyama

    Abstract: Scaling the distributed deep learning to a massive GPU cluster level is challenging due to the instability of the large mini-batch training and the overhead of the gradient synchronization. We address the instability of the large mini-batch training with batch-size control and label smoothing. We address the overhead of the gradient synchronization with 2D-Torus all-reduce. Specifically, 2D-Torus… ▽ More

    Submitted 5 March, 2019; v1 submitted 13 November, 2018; originally announced November 2018.

  3. arXiv:1509.08420  [pdf, other

    cs.NI

    PRAGMA-ENT: Exposing SDN Concepts to Domain Scientists in the Pacific Rim

    Authors: Kohei Ichikawa, Mauricio Tsugawa, Jason Haga, Hiroaki Yamanaka, Te-Lung Liu, Yoshiyuki Kido, Pongsakorn U-Chupala, Che Huang, Chawanat Nakasan, Jo-Yu Chang, Li-Chi Ku, Whey-Fone Tsai, Susumu Date, Shinji Shimojo, Philip Papadopoulos, Jose Fortes

    Abstract: The Pacific Rim Application and Grid Middleware Assembly (PRAGMA) is an international community of researchers that actively collaborate to address problems and challenges of common interest in eScience. The PRAGMA Experimental Network Testbed (PRAGMA-ENT) was established with the goal of constructing an international software-defined network (SDN) testbed to offer the necessary networking support… ▽ More

    Submitted 28 September, 2015; originally announced September 2015.

    Comments: 8 pages, 12 figures, PRAGMA-ICDS 2015

    ACM Class: C.2.1; C.2.4