Skip to main content

Showing 1–1 of 1 results for author: Joshi, D C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00101  [pdf, other

    cs.LG cs.AI cs.CC cs.DC cs.NE

    Hybrid Approach to Parallel Stochastic Gradient Descent

    Authors: Aakash Sudhirbhai Vora, Dhrumil Chetankumar Joshi, Aksh Kantibhai Patel

    Abstract: Stochastic Gradient Descent is used for large datasets to train models to reduce the training time. On top of that data parallelism is widely used as a method to efficiently train neural networks using multiple worker nodes in parallel. Synchronous and asynchronous approach to data parallelism is used by most systems to train the model in parallel. However, both of them have their drawbacks. We pr… ▽ More

    Submitted 27 June, 2024; originally announced July 2024.