Skip to main content

Showing 1–11 of 11 results for author: Al-Shedivat, M

Searching in archive stat. Search in all archives.
.
  1. arXiv:2102.00127  [pdf, other

    cs.LG cs.AI stat.ML

    On Data Efficiency of Meta-learning

    Authors: Maruan Al-Shedivat, Liam Li, Eric Xing, Ameet Talwalkar

    Abstract: Meta-learning has enabled learning statistical models that can be quickly adapted to new prediction tasks. Motivated by use-cases in personalized federated learning, we study the often overlooked aspect of the modern meta-learning algorithms -- their data efficiency. To shed more light on which methods are more efficient, we use techniques from algorithmic stability to derive bounds on the transfe… ▽ More

    Submitted 29 January, 2021; originally announced February 2021.

    Comments: Preliminary version. An updated version is to appear in AISTATS 2021

  2. arXiv:2010.05273  [pdf, other

    cs.LG cs.AI stat.ML

    Federated Learning via Posterior Averaging: A New Perspective and Practical Algorithms

    Authors: Maruan Al-Shedivat, Jennifer Gillenwater, Eric Xing, Afshin Rostamizadeh

    Abstract: Federated learning is typically approached as an optimization problem, where the goal is to minimize a global loss function by distributing computation across client devices that possess local data and specify different parts of the global objective. We present an alternative perspective and formulate federated learning as a posterior inference problem, where the goal is to infer a global posterio… ▽ More

    Submitted 29 January, 2021; v1 submitted 11 October, 2020; originally announced October 2020.

    Comments: ICLR 2021. Code: https://github.com/alshedivat/fedpa

  3. arXiv:2004.03473  [pdf, other

    cs.LG stat.ML

    Learning from Imperfect Annotations

    Authors: Emmanouil Antonios Platanios, Maruan Al-Shedivat, Eric Xing, Tom Mitchell

    Abstract: Many machine learning systems today are trained on large amounts of human-annotated data. Data annotation tasks that require a high level of competency make data acquisition expensive, while the resulting labels are often subjective, inconsistent, and may contain a variety of human biases. To improve the data quality, practitioners often need to collect multiple annotations per example and aggrega… ▽ More

    Submitted 7 April, 2020; originally announced April 2020.

  4. arXiv:1906.01431  [pdf, other

    cs.LG stat.ML

    Regularizing Black-box Models for Improved Interpretability (HILL 2019 Version)

    Authors: Gregory Plumb, Maruan Al-Shedivat, Eric Xing, Ameet Talwalkar

    Abstract: Most of the work on interpretable machine learning has focused on designing either inherently interpretable models, which typically trade-off accuracy for interpretability, or post-hoc explanation systems, which lack guarantees about their explanation quality. We propose an alternative to these approaches by directly regularizing a black-box model for interpretability at training time. Our approac… ▽ More

    Submitted 31 May, 2019; originally announced June 2019.

    Comments: presented at 2019 ICML Workshop on Human in the Loop Learning (HILL 2019), Long Beach, USA. arXiv admin note: substantial text overlap with arXiv:1902.06787

  5. arXiv:1904.02338  [pdf, other

    cs.LG cs.CL cs.NE stat.ML

    Consistency by Agreement in Zero-shot Neural Machine Translation

    Authors: Maruan Al-Shedivat, Ankur P. Parikh

    Abstract: Generalization and reliability of multilingual translation often highly depend on the amount of available parallel data for each language pair of interest. In this paper, we focus on zero-shot generalization---a challenging setup that tests models on translation directions they have not been optimized for at training time. To solve the problem, we (i) reformulate multilingual translation as probab… ▽ More

    Submitted 10 April, 2019; v1 submitted 3 April, 2019; originally announced April 2019.

    Comments: NAACL 2019 (14 pages, 5 figures)

  6. arXiv:1902.06787  [pdf, other

    cs.LG stat.ML

    Regularizing Black-box Models for Improved Interpretability

    Authors: Gregory Plumb, Maruan Al-Shedivat, Angel Alexander Cabrera, Adam Perer, Eric Xing, Ameet Talwalkar

    Abstract: Most of the work on interpretable machine learning has focused on designing either inherently interpretable models, which typically trade-off accuracy for interpretability, or post-hoc explanation systems, whose explanation quality can be unpredictable. Our method, ExpO, is a hybridization of these approaches that regularizes a model for explanation quality at training time. Importantly, these reg… ▽ More

    Submitted 8 November, 2020; v1 submitted 18 February, 2019; originally announced February 2019.

  7. arXiv:1811.06889  [pdf, other

    cs.LG cs.AI stat.ML

    On the Complexity of Exploration in Goal-Driven Navigation

    Authors: Maruan Al-Shedivat, Lisa Lee, Ruslan Salakhutdinov, Eric Xing

    Abstract: Building agents that can explore their environments intelligently is a challenging open problem. In this paper, we make a step towards understanding how a hierarchical design of the agent's policy can affect its exploration capabilities. First, we design EscapeRoom environments, where the agent must figure out how to navigate to the exit by accomplishing a number of intermediate tasks (\emph{subgo… ▽ More

    Submitted 16 November, 2018; originally announced November 2018.

    Comments: Relational Representation Learning Workshop (NIPS 2018)

  8. arXiv:1806.06464  [pdf, other

    cs.MA cs.AI cs.LG cs.NE stat.ML

    Learning Policy Representations in Multiagent Systems

    Authors: Aditya Grover, Maruan Al-Shedivat, Jayesh K. Gupta, Yura Burda, Harrison Edwards

    Abstract: Modeling agent behavior is central to understanding the emergence of complex phenomena in multiagent systems. Prior work in agent modeling has largely been task-specific and driven by hand-engineering domain-specific prior knowledge. We propose a general learning framework for modeling agent behavior in any multiagent system using only a handful of interaction data. Our framework casts agent model… ▽ More

    Submitted 31 July, 2018; v1 submitted 17 June, 2018; originally announced June 2018.

    Comments: ICML 2018

  9. arXiv:1705.10301  [pdf, other

    cs.LG cs.AI stat.ML

    Contextual Explanation Networks

    Authors: Maruan Al-Shedivat, Avinava Dubey, Eric P. Xing

    Abstract: Modern learning algorithms excel at producing accurate but complex models of the data. However, deploying such models in the real-world requires extra care: we must ensure their reliability, robustness, and absence of undesired biases. This motivates the development of models that are equally accurate but can be also easily inspected and assessed beyond their predictive performance. To this end, w… ▽ More

    Submitted 9 September, 2020; v1 submitted 29 May, 2017; originally announced May 2017.

    Comments: 48 pages, 18 figures, to appear in JMLR

  10. arXiv:1610.08936  [pdf, other

    cs.LG cs.AI stat.ML

    Learning Scalable Deep Kernels with Recurrent Structure

    Authors: Maruan Al-Shedivat, Andrew Gordon Wilson, Yunus Saatchi, Zhiting Hu, Eric P. Xing

    Abstract: Many applications in speech, robotics, finance, and biology deal with sequential data, where ordering matters and recurrent structures are common. However, this structure cannot be easily captured by standard kernel functions. To model such structure, we propose expressive closed-form kernel functions for Gaussian processes. The resulting model, GP-LSTM, fully encapsulates the inductive biases of… ▽ More

    Submitted 4 October, 2017; v1 submitted 27 October, 2016; originally announced October 2016.

    Comments: 37 pages, 7 figures, 5 tables. Updated to the final version that appears in JMLR, 18(82):1-37, 2017

    Journal ref: Journal of Machine Learning Research (JMLR), JMLR 18(82):1-37, 2017

  11. arXiv:1609.06390  [pdf, other

    stat.ML cs.LG

    Learning HMMs with Nonparametric Emissions via Spectral Decompositions of Continuous Matrices

    Authors: Kirthevasan Kandasamy, Maruan Al-Shedivat, Eric P. Xing

    Abstract: Recently, there has been a surge of interest in using spectral methods for estimating latent variable models. However, it is usually assumed that the distribution of the observations conditioned on the latent variables is either discrete or belongs to a parametric family. In this paper, we study the estimation of an $m$-state hidden Markov model (HMM) with only smoothness assumptions, such as Höld… ▽ More

    Submitted 20 September, 2016; originally announced September 2016.

    Comments: To appear in NIPS 2016