Skip to main content

Showing 1–4 of 4 results for author: Shrivastava, D

Searching in archive stat. Search in all archives.
.
  1. arXiv:2003.11768   

    cs.LG cs.AI cs.SE stat.ML

    On-the-Fly Adaptation of Source Code Models using Meta-Learning

    Authors: Disha Shrivastava, Hugo Larochelle, Daniel Tarlow

    Abstract: The ability to adapt to unseen, local contexts is an important challenge that successful models of source code must overcome. One of the most popular approaches for the adaptation of such models is dynamic evaluation. With dynamic evaluation, when running a model on an unseen file, the model is updated immediately after having observed each token in that file. In this work, we propose instead to f… ▽ More

    Submitted 19 September, 2020; v1 submitted 26 March, 2020; originally announced March 2020.

    Comments: This paper has been withdrawn because we found a bug in the FOMAML implementation that invalidates some of the key claims in the paper

  2. arXiv:1906.03574  [pdf, other

    cs.LG cs.AI stat.ML

    Transfer Learning by Modeling a Distribution over Policies

    Authors: Disha Shrivastava, Eeshan Gunesh Dhekane, Riashat Islam

    Abstract: Exploration and adaptation to new tasks in a transfer learning setup is a central challenge in reinforcement learning. In this work, we build on the idea of modeling a distribution over policies in a Bayesian deep reinforcement learning setup to propose a transfer strategy. Recent works have shown to induce diversity in the learned policies by maximizing the entropy of a distribution of policies (… ▽ More

    Submitted 9 June, 2019; originally announced June 2019.

    Comments: Accepted at the ICML 2019 workshop on Multi-Task and Lifelong Reinforcement Learning

  3. arXiv:1708.05840  [pdf, other

    stat.ML cs.AI cs.CV cs.DC cs.LG

    A Data and Model-Parallel, Distributed and Scalable Framework for Training of Deep Networks in Apache Spark

    Authors: Disha Shrivastava, Santanu Chaudhury, Dr. Jayadeva

    Abstract: Training deep networks is expensive and time-consuming with the training period increasing with data size and growth in model parameters. In this paper, we provide a framework for distributed training of deep networks over a cluster of CPUs in Apache Spark. The framework implements both Data Parallelism and Model Parallelism making it suitable to use for deep networks which require huge training d… ▽ More

    Submitted 19 August, 2017; originally announced August 2017.

    Comments: 12 pages

  4. arXiv:1707.05499  [pdf, other

    cs.LG cs.AI stat.ML

    A Machine Learning Approach for Evaluating Creative Artifacts

    Authors: Disha Shrivastava, Saneem Ahmed CG, Anirban Laha, Karthik Sankaranarayanan

    Abstract: Much work has been done in understanding human creativity and defining measures to evaluate creativity. This is necessary mainly for the reason of having an objective and automatic way of quantifying creative artifacts. In this work, we propose a regression-based learning framework which takes into account quantitatively the essential criteria for creativity like novelty, influence, value and unex… ▽ More

    Submitted 18 July, 2017; originally announced July 2017.

    Comments: Accepted at SIGKDD Workshop on Machine Learning for Creativity (ML4Creativity), 2017