Skip to main content

Showing 1–1 of 1 results for author: Selfridge, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2309.12252  [pdf, other

    cs.LG cs.DC physics.comp-ph

    Parallelizing non-linear sequential models over the sequence length

    Authors: Yi Heng Lim, Qi Zhu, Joshua Selfridge, Muhammad Firmansyah Kasim

    Abstract: Sequential models, such as Recurrent Neural Networks and Neural Ordinary Differential Equations, have long suffered from slow training due to their inherent sequential nature. For many years this bottleneck has persisted, as many thought sequential models could not be parallelized. We challenge this long-held belief with our parallel algorithm that accelerates GPU evaluation of sequential models b… ▽ More

    Submitted 16 January, 2024; v1 submitted 21 September, 2023; originally announced September 2023.