Skip to main content

Showing 1–3 of 3 results for author: Kusupati, A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2002.03231  [pdf, other

    cs.LG cs.CV stat.ML

    Soft Threshold Weight Reparameterization for Learnable Sparsity

    Authors: Aditya Kusupati, Vivek Ramanujan, Raghav Somani, Mitchell Wortsman, Prateek Jain, Sham Kakade, Ali Farhadi

    Abstract: Sparsity in Deep Neural Networks (DNNs) is studied extensively with the focus of maximizing prediction accuracy given an overall parameter budget. Existing methods rely on uniform or heuristic non-uniform sparsity budgets which have sub-optimal layer-wise parameter allocation resulting in a) lower prediction accuracy or b) higher inference cost (FLOPs). This work proposes Soft Threshold Reparamete… ▽ More

    Submitted 22 June, 2020; v1 submitted 8 February, 2020; originally announced February 2020.

    Comments: 19 pages, 10 figures, Published at International Conference on Machine Learning (ICML) 2020

  2. Extreme Regression for Dynamic Search Advertising

    Authors: Yashoteja Prabhu, Aditya Kusupati, Nilesh Gupta, Manik Varma

    Abstract: This paper introduces a new learning paradigm called eXtreme Regression (XR) whose objective is to accurately predict the numerical degrees of relevance of an extremely large number of labels to a data point. XR can provide elegant solutions to many large-scale ranking and recommendation applications including Dynamic Search Advertising (DSA). XR can learn more accurate models than the recently po… ▽ More

    Submitted 20 January, 2020; v1 submitted 15 January, 2020; originally announced January 2020.

    Comments: 15 pages, 4 figures, published at WSDM 2020 as a Long Oral

  3. arXiv:1901.02358  [pdf, ps, other

    cs.LG cs.AI cs.NE stat.ML

    FastGRNN: A Fast, Accurate, Stable and Tiny Kilobyte Sized Gated Recurrent Neural Network

    Authors: Aditya Kusupati, Manish Singh, Kush Bhatia, Ashish Kumar, Prateek Jain, Manik Varma

    Abstract: This paper develops the FastRNN and FastGRNN algorithms to address the twin RNN limitations of inaccurate training and inefficient prediction. Previous approaches have improved accuracy at the expense of prediction costs making them infeasible for resource-constrained and real-time applications. Unitary RNNs have increased accuracy somewhat by restricting the range of the state transition matrix's… ▽ More

    Submitted 8 January, 2019; originally announced January 2019.

    Comments: 23 pages, 10 figures, Published at Advances in Neural Information Processing Systems (NeurIPS) 2018