Skip to main content

Showing 1–4 of 4 results for author: Daghaghi, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.13583  [pdf, other

    cs.LG

    Adaptive Sampling for Deep Learning via Efficient Nonparametric Proxies

    Authors: Shabnam Daghaghi, Benjamin Coleman, Benito Geordie, Anshumali Shrivastava

    Abstract: Data sampling is an effective method to improve the training speed of neural networks, with recent results demonstrating that it can even break the neural scaling laws. These results critically rely on high-quality scores to estimate the importance of an input to the network. We observe that there are two dominant strategies: static sampling, where the scores are determined before training, and dy… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

  2. arXiv:2103.10891  [pdf, other

    cs.LG cs.DC cs.PF

    Accelerating SLIDE Deep Learning on Modern CPUs: Vectorization, Quantizations, Memory Optimizations, and More

    Authors: Shabnam Daghaghi, Nicholas Meisburger, Mengnan Zhao, Yong Wu, Sameh Gobriel, Charlie Tai, Anshumali Shrivastava

    Abstract: Deep learning implementations on CPUs (Central Processing Units) are gaining more traction. Enhanced AI capabilities on commodity x86 architectures are commercially appealing due to the reuse of existing hardware and virtualization ease. A notable work in this direction is the SLIDE system. SLIDE is a C++ implementation of a sparse hash table based back-propagation, which was shown to be significa… ▽ More

    Submitted 5 March, 2021; originally announced March 2021.

  3. arXiv:2012.15843  [pdf, other

    cs.LG cs.AI cs.DS cs.IR

    A Tale of Two Efficient and Informative Negative Sampling Distributions

    Authors: Shabnam Daghaghi, Tharun Medini, Nicholas Meisburger, Beidi Chen, Mengnan Zhao, Anshumali Shrivastava

    Abstract: Softmax classifiers with a very large number of classes naturally occur in many applications such as natural language processing and information retrieval. The calculation of full softmax is costly from the computational and energy perspective. There have been various sampling approaches to overcome this challenge, popularly known as negative sampling (NS). Ideally, NS should sample negative class… ▽ More

    Submitted 28 July, 2021; v1 submitted 31 December, 2020; originally announced December 2020.

    Comments: Published at ICML 2021

  4. arXiv:1909.04790  [pdf, other

    cs.CV cs.LG

    SDM-Net: A Simple and Effective Model for Generalized Zero-Shot Learning

    Authors: Shabnam Daghaghi, Tharun Medini, Anshumali Shrivastava

    Abstract: Zero-Shot Learning (ZSL) is a classification task where we do not have even a single training labeled example from a set of unseen classes. Instead, we only have prior information (or description) about seen and unseen classes, often in the form of physically realizable or descriptive attributes. Lack of any single training example from a set of classes prohibits use of standard classification tec… ▽ More

    Submitted 31 December, 2020; v1 submitted 10 September, 2019; originally announced September 2019.