Skip to main content

Showing 1–43 of 43 results for author: Vinyals, O

Searching in archive stat. Search in all archives.
.
  1. arXiv:2106.04615  [pdf, other

    cs.LG cs.AI stat.ML

    Vector Quantized Models for Planning

    Authors: Sherjil Ozair, Yazhe Li, Ali Razavi, Ioannis Antonoglou, Aäron van den Oord, Oriol Vinyals

    Abstract: Recent developments in the field of model-based RL have proven successful in a range of environments, especially ones where planning is essential. However, such successes have been limited to deterministic fully-observed environments. We present a new approach that handles stochastic and partially-observable environments. Our key insight is to use discrete autoencoders to capture the multiple poss… ▽ More

    Submitted 10 June, 2021; v1 submitted 8 June, 2021; originally announced June 2021.

    Comments: ICML 2021

  2. arXiv:2101.11071  [pdf, other

    cs.LG cs.AI stat.ML

    The MineRL 2020 Competition on Sample Efficient Reinforcement Learning using Human Priors

    Authors: William H. Guss, Mario Ynocente Castro, Sam Devlin, Brandon Houghton, Noboru Sean Kuno, Crissman Loomis, Stephanie Milani, Sharada Mohanty, Keisuke Nakata, Ruslan Salakhutdinov, John Schulman, Shinya Shiroshita, Nicholay Topin, Avinash Ummadisingu, Oriol Vinyals

    Abstract: Although deep reinforcement learning has led to breakthroughs in many difficult domains, these successes have required an ever-increasing number of samples, affording only a shrinking segment of the AI community access to their development. Resolution of these limitations requires new, sample-efficient methods. To facilitate research in this direction, we propose this second iteration of the MineR… ▽ More

    Submitted 26 January, 2021; originally announced January 2021.

    Comments: 37 pages, initial submission, accepted at NeurIPS. arXiv admin note: substantial text overlap with arXiv:1904.10079

  3. arXiv:2007.03629  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Strong Generalization and Efficiency in Neural Programs

    Authors: Yujia Li, Felix Gimeno, Pushmeet Kohli, Oriol Vinyals

    Abstract: We study the problem of learning efficient algorithms that strongly generalize in the framework of neural program induction. By carefully designing the input / output interfaces of the neural model and through imitation, we are able to learn models that produce correct results for arbitrary input sizes, achieving strong generalization. Moreover, by using reinforcement learning, we optimize for pro… ▽ More

    Submitted 8 July, 2020; v1 submitted 7 July, 2020; originally announced July 2020.

  4. arXiv:2006.06380  [pdf, other

    stat.ML cs.DS cs.LG

    Pointer Graph Networks

    Authors: Petar Veličković, Lars Buesing, Matthew C. Overlan, Razvan Pascanu, Oriol Vinyals, Charles Blundell

    Abstract: Graph neural networks (GNNs) are typically applied to static graphs that are assumed to be known upfront. This static input structure is often informed purely by insight of the machine learning practitioner, and might not be optimal for the actual task the GNN is solving. In absence of reliable domain expertise, one might resort to inferring the latent graph structure, which is often difficult due… ▽ More

    Submitted 18 October, 2020; v1 submitted 11 June, 2020; originally announced June 2020.

    Comments: To appear at NeurIPS 2020 (Spotlight talk)

  5. arXiv:2003.05012  [pdf, other

    cs.LG cs.AI stat.ML

    Retrospective Analysis of the 2019 MineRL Competition on Sample Efficient Reinforcement Learning

    Authors: Stephanie Milani, Nicholay Topin, Brandon Houghton, William H. Guss, Sharada P. Mohanty, Keisuke Nakata, Oriol Vinyals, Noboru Sean Kuno

    Abstract: To facilitate research in the direction of sample efficient reinforcement learning, we held the MineRL Competition on Sample Efficient Reinforcement Learning Using Human Priors at the Thirty-third Conference on Neural Information Processing Systems (NeurIPS 2019). The primary goal of this competition was to promote the development of algorithms that use human demonstrations alongside reinforcement… ▽ More

    Submitted 18 June, 2020; v1 submitted 10 March, 2020; originally announced March 2020.

    Comments: To appear in Proceedings of Machine Learning Research: NeurIPS 2019 Competition & Demonstration Track Postproceedings. 12 pages, 2 figures

  6. arXiv:1910.06464  [pdf, other

    cs.LG cs.SD eess.AS stat.ML

    Low Bit-Rate Speech Coding with VQ-VAE and a WaveNet Decoder

    Authors: Cristina Gârbacea, Aäron van den Oord, Yazhe Li, Felicia S C Lim, Alejandro Luebs, Oriol Vinyals, Thomas C Walters

    Abstract: In order to efficiently transmit and store speech signals, speech codecs create a minimally redundant representation of the input signal which is then decoded at the receiver with the best possible perceptual quality. In this work we demonstrate that a neural network architecture based on VQ-VAE with a WaveNet decoder can be used to perform very low bit-rate speech coding with high reconstruction… ▽ More

    Submitted 14 October, 2019; originally announced October 2019.

    Comments: ICASSP 2019

    Journal ref: ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 735-739. IEEE, 2019

  7. arXiv:1910.01007  [pdf, other

    cs.CV cs.LG stat.ML

    Unsupervised Doodling and Painting with Improved SPIRAL

    Authors: John F. J. Mellor, Eunbyung Park, Yaroslav Ganin, Igor Babuschkin, Tejas Kulkarni, Dan Rosenbaum, Andy Ballard, Theophane Weber, Oriol Vinyals, S. M. Ali Eslami

    Abstract: We investigate using reinforcement learning agents as generative models of images (extending arXiv:1804.01118). A generative agent controls a simulated painting environment, and is trained with rewards provided by a discriminator network simultaneously trained to assess the realism of the agent's samples, either unconditional or reconstructions. Compared to prior work, we make a number of improvem… ▽ More

    Submitted 2 October, 2019; originally announced October 2019.

    Comments: See https://learning-to-paint.github.io for an interactive version of this paper, with videos

    ACM Class: I.2; I.4

  8. arXiv:1909.09157  [pdf, other

    cs.LG stat.ML

    Rapid Learning or Feature Reuse? Towards Understanding the Effectiveness of MAML

    Authors: Aniruddh Raghu, Maithra Raghu, Samy Bengio, Oriol Vinyals

    Abstract: An important research direction in machine learning has centered around developing meta-learning algorithms to tackle few-shot learning. An especially successful algorithm has been Model Agnostic Meta-Learning (MAML), a method that consists of two optimization loops, with the outer loop finding a meta-initialization, from which the inner loop can efficiently learn new tasks. Despite MAML's popular… ▽ More

    Submitted 12 February, 2020; v1 submitted 19 September, 2019; originally announced September 2019.

    Comments: ICLR 2020

  9. arXiv:1906.00446  [pdf, other

    cs.LG cs.CV stat.ML

    Generating Diverse High-Fidelity Images with VQ-VAE-2

    Authors: Ali Razavi, Aaron van den Oord, Oriol Vinyals

    Abstract: We explore the use of Vector Quantized Variational AutoEncoder (VQ-VAE) models for large scale image generation. To this end, we scale and enhance the autoregressive priors used in VQ-VAE to generate synthetic samples of much higher coherence and fidelity than possible before. We use simple feed-forward encoder and decoder networks, making our model an attractive candidate for applications where t… ▽ More

    Submitted 2 June, 2019; originally announced June 2019.

  10. arXiv:1905.10887  [pdf, other

    cs.LG stat.ML

    Classification Accuracy Score for Conditional Generative Models

    Authors: Suman Ravuri, Oriol Vinyals

    Abstract: Deep generative models (DGMs) of images are now sufficiently mature that they produce nearly photorealistic samples and obtain scores similar to the data distribution on heuristics such as Frechet Inception Distance (FID). These results, especially on large-scale datasets such as ImageNet, suggest that DGMs are learning the data distribution in a perceptually meaningful space and can be used in do… ▽ More

    Submitted 28 October, 2019; v1 submitted 26 May, 2019; originally announced May 2019.

  11. arXiv:1905.02494  [pdf, other

    cs.LG stat.ML

    Reinforced Genetic Algorithm Learning for Optimizing Computation Graphs

    Authors: Aditya Paliwal, Felix Gimeno, Vinod Nair, Yujia Li, Miles Lubin, Pushmeet Kohli, Oriol Vinyals

    Abstract: We present a deep reinforcement learning approach to minimizing the execution cost of neural network computation graphs in an optimizing compiler. Unlike earlier learning-based works that require training the optimizer on the same graph to be optimized, we propose a learning approach that trains an optimizer offline and then generalizes to previously unseen graphs without further training. This al… ▽ More

    Submitted 10 February, 2020; v1 submitted 7 May, 2019; originally announced May 2019.

    Comments: Accepted to ICLR 2020 https://openreview.net/forum?id=rkxDoJBYPB

  12. arXiv:1904.12787  [pdf, other

    cs.LG stat.ML

    Graph Matching Networks for Learning the Similarity of Graph Structured Objects

    Authors: Yujia Li, Chenjie Gu, Thomas Dullien, Oriol Vinyals, Pushmeet Kohli

    Abstract: This paper addresses the challenging problem of retrieval and matching of graph structured objects, and makes two key contributions. First, we demonstrate how Graph Neural Networks (GNN), which have emerged as an effective model for various supervised prediction problems defined on structured data, can be trained to produce embedding of graphs in vector spaces that enables efficient similarity rea… ▽ More

    Submitted 12 May, 2019; v1 submitted 29 April, 2019; originally announced April 2019.

    Comments: Accepted as a conference paper at ICML 2019

  13. arXiv:1901.05761  [pdf, other

    cs.LG stat.ML

    Attentive Neural Processes

    Authors: Hyunjik Kim, Andriy Mnih, Jonathan Schwarz, Marta Garnelo, Ali Eslami, Dan Rosenbaum, Oriol Vinyals, Yee Whye Teh

    Abstract: Neural Processes (NPs) (Garnelo et al 2018a;b) approach regression by learning to map a context set of observed input-output pairs to a distribution over regression functions. Each function models the distribution of the output given an input, conditioned on the context. NPs have the benefit of fitting observed data efficiently with linear complexity in the number of context input-output pairs, an… ▽ More

    Submitted 9 July, 2019; v1 submitted 17 January, 2019; originally announced January 2019.

  14. arXiv:1901.03416  [pdf, other

    cs.LG stat.ML

    Preventing Posterior Collapse with delta-VAEs

    Authors: Ali Razavi, Aäron van den Oord, Ben Poole, Oriol Vinyals

    Abstract: Due to the phenomenon of "posterior collapse," current latent variable generative models pose a challenging design choice that either weakens the capacity of the decoder or requires augmenting the objective so it does not only maximize the likelihood of the data. In this paper, we propose an alternative that utilizes the most powerful generative models as decoders, whilst optimising the variationa… ▽ More

    Submitted 10 January, 2019; originally announced January 2019.

  15. arXiv:1812.00898  [pdf, other

    cs.LG cs.CL cs.CV stat.ML

    Generating Diverse Programs with Instruction Conditioned Reinforced Adversarial Learning

    Authors: Aishwarya Agrawal, Mateusz Malinowski, Felix Hill, Ali Eslami, Oriol Vinyals, Tejas Kulkarni

    Abstract: Advances in Deep Reinforcement Learning have led to agents that perform well across a variety of sensory-motor domains. In this work, we study the setting in which an agent must learn to generate programs for diverse scenes conditioned on a given symbolic instruction. Final goals are specified to our agent via images of the scenes. A symbolic instruction consistent with the goal images is used as… ▽ More

    Submitted 3 December, 2018; originally announced December 2018.

  16. arXiv:1809.10460  [pdf, other

    cs.LG cs.SD stat.ML

    Sample Efficient Adaptive Text-to-Speech

    Authors: Yutian Chen, Yannis Assael, Brendan Shillingford, David Budden, Scott Reed, Heiga Zen, Quan Wang, Luis C. Cobo, Andrew Trask, Ben Laurie, Caglar Gulcehre, Aäron van den Oord, Oriol Vinyals, Nando de Freitas

    Abstract: We present a meta-learning approach for adaptive text-to-speech (TTS) with few data. During training, we learn a multi-speaker model using a shared conditional WaveNet core and independent learned embeddings for each speaker. The aim of training is not to produce a neural network with fixed weights, which is then deployed as a TTS system. Instead, the aim is to produce a network that requires few… ▽ More

    Submitted 16 January, 2019; v1 submitted 27 September, 2018; originally announced September 2018.

    Comments: Accepted by ICLR 2019

  17. arXiv:1807.05960  [pdf, other

    cs.LG cs.CV stat.ML

    Meta-Learning with Latent Embedding Optimization

    Authors: Andrei A. Rusu, Dushyant Rao, Jakub Sygnowski, Oriol Vinyals, Razvan Pascanu, Simon Osindero, Raia Hadsell

    Abstract: Gradient-based meta-learning techniques are both widely applicable and proficient at solving challenging few-shot learning and fast adaptation problems. However, they have practical difficulties when operating on high-dimensional parameter spaces in extreme low-data regimes. We show that it is possible to bypass these limitations by learning a data-dependent latent generative representation of mod… ▽ More

    Submitted 26 March, 2019; v1 submitted 16 July, 2018; originally announced July 2018.

  18. arXiv:1807.03819  [pdf, other

    cs.CL cs.LG stat.ML

    Universal Transformers

    Authors: Mostafa Dehghani, Stephan Gouws, Oriol Vinyals, Jakob Uszkoreit, Łukasz Kaiser

    Abstract: Recurrent neural networks (RNNs) sequentially process data by updating their state with each new data point, and have long been the de facto choice for sequence modeling tasks. However, their inherently sequential computation makes them slow to train. Feed-forward and convolutional architectures have recently been shown to achieve superior results on some sequence modeling tasks such as machine tr… ▽ More

    Submitted 5 March, 2019; v1 submitted 10 July, 2018; originally announced July 2018.

    Comments: Published at ICLR2019

  19. arXiv:1807.03748  [pdf, other

    cs.LG stat.ML

    Representation Learning with Contrastive Predictive Coding

    Authors: Aaron van den Oord, Yazhe Li, Oriol Vinyals

    Abstract: While supervised learning has enabled great progress in many applications, unsupervised learning has not seen such widespread adoption, and remains an important and challenging endeavor for artificial intelligence. In this work, we propose a universal unsupervised learning approach to extract useful representations from high-dimensional data, which we call Contrastive Predictive Coding. The key in… ▽ More

    Submitted 22 January, 2019; v1 submitted 10 July, 2018; originally announced July 2018.

  20. arXiv:1806.11006  [pdf, other

    cs.LG stat.ML

    Learning Implicit Generative Models with the Method of Learned Moments

    Authors: Suman Ravuri, Shakir Mohamed, Mihaela Rosca, Oriol Vinyals

    Abstract: We propose a method of moments (MoM) algorithm for training large-scale implicit generative models. Moment estimation in this setting encounters two problems: it is often difficult to define the millions of moments needed to learn the model parameters, and it is hard to determine which properties are useful when specifying moments. To address the first issue, we introduce a moment network, and def… ▽ More

    Submitted 28 June, 2018; originally announced June 2018.

    Comments: ICML 2018, 6 figures, 17 pages

  21. arXiv:1806.01830  [pdf, other

    cs.LG stat.ML

    Relational Deep Reinforcement Learning

    Authors: Vinicius Zambaldi, David Raposo, Adam Santoro, Victor Bapst, Yujia Li, Igor Babuschkin, Karl Tuyls, David Reichert, Timothy Lillicrap, Edward Lockhart, Murray Shanahan, Victoria Langston, Razvan Pascanu, Matthew Botvinick, Oriol Vinyals, Peter Battaglia

    Abstract: We introduce an approach for deep reinforcement learning (RL) that improves upon the efficiency, generalization capacity, and interpretability of conventional approaches through structured perception and relational reasoning. It uses self-attention to iteratively reason about the relations between entities in a scene and to guide a model-free policy. Our results show that in a novel navigation and… ▽ More

    Submitted 28 June, 2018; v1 submitted 5 June, 2018; originally announced June 2018.

  22. arXiv:1806.01822  [pdf, other

    cs.LG stat.ML

    Relational recurrent neural networks

    Authors: Adam Santoro, Ryan Faulkner, David Raposo, Jack Rae, Mike Chrzanowski, Theophane Weber, Daan Wierstra, Oriol Vinyals, Razvan Pascanu, Timothy Lillicrap

    Abstract: Memory-based neural networks model temporal data by leveraging an ability to remember information for long periods. It is unclear, however, whether they also have an ability to perform complex relational reasoning with the information they remember. Here, we first confirm our intuitions that standard memory architectures may struggle at tasks that heavily involve an understanding of the ways in wh… ▽ More

    Submitted 28 June, 2018; v1 submitted 5 June, 2018; originally announced June 2018.

  23. arXiv:1806.01261  [pdf, other

    cs.LG cs.AI stat.ML

    Relational inductive biases, deep learning, and graph networks

    Authors: Peter W. Battaglia, Jessica B. Hamrick, Victor Bapst, Alvaro Sanchez-Gonzalez, Vinicius Zambaldi, Mateusz Malinowski, Andrea Tacchetti, David Raposo, Adam Santoro, Ryan Faulkner, Caglar Gulcehre, Francis Song, Andrew Ballard, Justin Gilmer, George Dahl, Ashish Vaswani, Kelsey Allen, Charles Nash, Victoria Langston, Chris Dyer, Nicolas Heess, Daan Wierstra, Pushmeet Kohli, Matt Botvinick, Oriol Vinyals , et al. (2 additional authors not shown)

    Abstract: Artificial intelligence (AI) has undergone a renaissance recently, making major progress in key domains such as vision, language, control, and decision-making. This has been due, in part, to cheap data and cheap compute resources, which have fit the natural strengths of deep learning. However, many defining characteristics of human intelligence, which developed under much different pressures, rema… ▽ More

    Submitted 17 October, 2018; v1 submitted 4 June, 2018; originally announced June 2018.

  24. arXiv:1804.06893  [pdf, other

    cs.LG stat.ML

    A Study on Overfitting in Deep Reinforcement Learning

    Authors: Chiyuan Zhang, Oriol Vinyals, Remi Munos, Samy Bengio

    Abstract: Recent years have witnessed significant progresses in deep Reinforcement Learning (RL). Empowered with large scale neural networks, carefully designed architectures, novel training algorithms and massively parallel computing devices, researchers are able to attack many challenging RL problems. However, in machine learning, more training power comes with a potential risk of more overfitting. As dee… ▽ More

    Submitted 20 April, 2018; v1 submitted 18 April, 2018; originally announced April 2018.

  25. arXiv:1804.01118  [pdf, other

    cs.CV cs.LG stat.ML

    Synthesizing Programs for Images using Reinforced Adversarial Learning

    Authors: Yaroslav Ganin, Tejas Kulkarni, Igor Babuschkin, S. M. Ali Eslami, Oriol Vinyals

    Abstract: Advances in deep generative networks have led to impressive results in recent years. Nevertheless, such models can often waste their capacity on the minutiae of datasets, presumably due to weak inductive biases in their decoders. This is where graphics engines may come in handy since they abstract away low-level details and represent images as high-level programs. Current methods that combine deep… ▽ More

    Submitted 3 April, 2018; originally announced April 2018.

    Comments: 12 pages, 13 figures

  26. arXiv:1803.03324  [pdf, other

    cs.LG stat.ML

    Learning Deep Generative Models of Graphs

    Authors: Yujia Li, Oriol Vinyals, Chris Dyer, Razvan Pascanu, Peter Battaglia

    Abstract: Graphs are fundamental data structures which concisely capture the relational structure in many important real-world domains, such as knowledge graphs, physical and social interactions, language, and chemistry. Here we introduce a powerful new approach for learning generative models over graphs, which can capture both their structure and attributes. Our approach uses graph neural networks to expre… ▽ More

    Submitted 8 March, 2018; originally announced March 2018.

    Comments: 21 pages

  27. arXiv:1802.10542  [pdf, other

    stat.ML cs.LG

    Memory-based Parameter Adaptation

    Authors: Pablo Sprechmann, Siddhant M. Jayakumar, Jack W. Rae, Alexander Pritzel, Adrià Puigdomènech Badia, Benigno Uria, Oriol Vinyals, Demis Hassabis, Razvan Pascanu, Charles Blundell

    Abstract: Deep neural networks have excelled on a wide range of problems, from vision to language and game playing. Neural networks very gradually incorporate information into weights as they process data, requiring very low learning rates. If the training distribution shifts, the network is slow to adapt, and when it does adapt, it typically performs badly on the training distribution before the shift. Our… ▽ More

    Submitted 28 February, 2018; originally announced February 2018.

    Comments: Published as a conference paper at ICLR 2018

  28. arXiv:1802.04697  [pdf, other

    cs.AI cs.LG stat.ML

    Learning to Search with MCTSnets

    Authors: Arthur Guez, Théophane Weber, Ioannis Antonoglou, Karen Simonyan, Oriol Vinyals, Daan Wierstra, Rémi Munos, David Silver

    Abstract: Planning problems are among the most important and well-studied problems in artificial intelligence. They are most typically solved by tree search algorithms that simulate ahead into the future, evaluate future states, and back-up those evaluations to the root of a search tree. Among these algorithms, Monte-Carlo tree search (MCTS) is one of the most general, powerful and widely used. A typical im… ▽ More

    Submitted 17 July, 2018; v1 submitted 13 February, 2018; originally announced February 2018.

    Comments: ICML 2018 (camera-ready version)

  29. arXiv:1711.00436  [pdf, other

    cs.LG cs.CV cs.NE stat.ML

    Hierarchical Representations for Efficient Architecture Search

    Authors: Hanxiao Liu, Karen Simonyan, Oriol Vinyals, Chrisantha Fernando, Koray Kavukcuoglu

    Abstract: We explore efficient neural architecture search methods and show that a simple yet powerful evolutionary algorithm can discover new architectures with excellent performance. Our approach combines a novel hierarchical genetic representation scheme that imitates the modularized design pattern commonly adopted by human experts, and an expressive search space that supports complex topologies. Our algo… ▽ More

    Submitted 22 February, 2018; v1 submitted 1 November, 2017; originally announced November 2017.

    Comments: Accepted as a conference paper at ICLR 2018

  30. arXiv:1707.06203  [pdf, other

    cs.LG cs.AI stat.ML

    Imagination-Augmented Agents for Deep Reinforcement Learning

    Authors: Théophane Weber, Sébastien Racanière, David P. Reichert, Lars Buesing, Arthur Guez, Danilo Jimenez Rezende, Adria Puigdomènech Badia, Oriol Vinyals, Nicolas Heess, Yujia Li, Razvan Pascanu, Peter Battaglia, Demis Hassabis, David Silver, Daan Wierstra

    Abstract: We introduce Imagination-Augmented Agents (I2As), a novel architecture for deep reinforcement learning combining model-free and model-based aspects. In contrast to most existing model-based reinforcement learning and planning methods, which prescribe how a model should be used to arrive at a policy, I2As learn to interpret predictions from a learned environment model to construct implicit plans in… ▽ More

    Submitted 14 February, 2018; v1 submitted 19 July, 2017; originally announced July 2017.

  31. arXiv:1707.06170  [pdf, other

    cs.AI cs.LG cs.NE stat.ML

    Learning model-based planning from scratch

    Authors: Razvan Pascanu, Yujia Li, Oriol Vinyals, Nicolas Heess, Lars Buesing, Sebastien Racanière, David Reichert, Théophane Weber, Daan Wierstra, Peter Battaglia

    Abstract: Conventional wisdom holds that model-based planning is a powerful approach to sequential decision-making. It is often very challenging in practice, however, because while a model can be used to evaluate a plan, it does not prescribe how to construct a plan. Here we introduce the "Imagination-based Planner", the first model-based, sequential decision-making agent that can learn to construct, evalua… ▽ More

    Submitted 19 July, 2017; originally announced July 2017.

  32. arXiv:1704.02798  [pdf, other

    cs.LG stat.ML

    Bayesian Recurrent Neural Networks

    Authors: Meire Fortunato, Charles Blundell, Oriol Vinyals

    Abstract: In this work we explore a straightforward variational Bayes scheme for Recurrent Neural Networks. Firstly, we show that a simple adaptation of truncated backpropagation through time can yield good quality uncertainty estimates and superior regularisation at only a small extra computational cost during training, also reducing the amount of parameters by 80\%. Secondly, we demonstrate how a novel ki… ▽ More

    Submitted 9 May, 2019; v1 submitted 10 April, 2017; originally announced April 2017.

    Comments: 12th Women in Machine Learning Workshop (WiML 2017), co-located with the 31st Conference on Neural Information Processing Systems (NeurIPS 2017), Long Beach, CA, USA

  33. arXiv:1703.01988  [pdf, other

    cs.LG stat.ML

    Neural Episodic Control

    Authors: Alexander Pritzel, Benigno Uria, Sriram Srinivasan, Adrià Puigdomènech, Oriol Vinyals, Demis Hassabis, Daan Wierstra, Charles Blundell

    Abstract: Deep reinforcement learning methods attain super-human performance in a wide range of environments. Such methods are grossly inefficient, often taking orders of magnitudes more data than humans to achieve reasonable performance. We propose Neural Episodic Control: a deep reinforcement learning agent that is able to rapidly assimilate new experiences and act upon them. Our agent uses a semi-tabular… ▽ More

    Submitted 6 March, 2017; originally announced March 2017.

  34. arXiv:1610.01945  [pdf, ps, other

    cs.LG stat.ML

    Connecting Generative Adversarial Networks and Actor-Critic Methods

    Authors: David Pfau, Oriol Vinyals

    Abstract: Both generative adversarial networks (GAN) in unsupervised learning and actor-critic methods in reinforcement learning (RL) have gained a reputation for being difficult to optimize. Practitioners in both fields have amassed a large number of strategies to mitigate these instabilities and improve training. Here we show that GANs can be viewed as actor-critic methods in an environment where the acto… ▽ More

    Submitted 18 January, 2017; v1 submitted 6 October, 2016; originally announced October 2016.

    Comments: Added comments on inverse reinforcement learning

  35. arXiv:1606.04080  [pdf, other

    cs.LG stat.ML

    Matching Networks for One Shot Learning

    Authors: Oriol Vinyals, Charles Blundell, Timothy Lillicrap, Koray Kavukcuoglu, Daan Wierstra

    Abstract: Learning from a few examples remains a key challenge in machine learning. Despite recent advances in important domains such as vision and language, the standard supervised deep learning paradigm does not offer a satisfactory solution for learning new concepts rapidly from little data. In this work, we employ ideas from metric learning based on deep neural features and from recent advances that aug… ▽ More

    Submitted 29 December, 2017; v1 submitted 13 June, 2016; originally announced June 2016.

  36. arXiv:1511.06391  [pdf, other

    stat.ML cs.CL cs.LG

    Order Matters: Sequence to sequence for sets

    Authors: Oriol Vinyals, Samy Bengio, Manjunath Kudlur

    Abstract: Sequences have become first class citizens in supervised learning thanks to the resurgence of recurrent neural networks. Many complex tasks that require mapping from or to a sequence of observations can now be formulated with the sequence-to-sequence (seq2seq) framework which employs the chain rule to efficiently represent the joint probability of sequences. In many cases, however, variable sized… ▽ More

    Submitted 23 February, 2016; v1 submitted 19 November, 2015; originally announced November 2015.

    Comments: Accepted as a conference paper at ICLR 2015

  37. arXiv:1511.06114  [pdf, ps, other

    cs.LG cs.CL stat.ML

    Multi-task Sequence to Sequence Learning

    Authors: Minh-Thang Luong, Quoc V. Le, Ilya Sutskever, Oriol Vinyals, Lukasz Kaiser

    Abstract: Sequence to sequence learning has recently emerged as a new paradigm in supervised learning. To date, most of its applications focused on only one task and not much work explored this framework for multiple tasks. This paper examines three multi-task learning (MTL) settings for sequence to sequence models: (a) the oneto-many setting - where the encoder is shared between several tasks such as machi… ▽ More

    Submitted 1 March, 2016; v1 submitted 19 November, 2015; originally announced November 2015.

    Comments: 10 pages, 4 figures, ICLR 2016 camera-ready, added parsing SOTA results

  38. arXiv:1508.01211  [pdf, other

    cs.CL cs.LG cs.NE stat.ML

    Listen, Attend and Spell

    Authors: William Chan, Navdeep Jaitly, Quoc V. Le, Oriol Vinyals

    Abstract: We present Listen, Attend and Spell (LAS), a neural network that learns to transcribe speech utterances to characters. Unlike traditional DNN-HMM models, this model learns all the components of a speech recognizer jointly. Our system has two components: a listener and a speller. The listener is a pyramidal recurrent network encoder that accepts filter bank spectra as inputs. The speller is an atte… ▽ More

    Submitted 19 August, 2015; v1 submitted 5 August, 2015; originally announced August 2015.

  39. arXiv:1506.03134  [pdf, other

    stat.ML cs.CG cs.LG cs.NE

    Pointer Networks

    Authors: Oriol Vinyals, Meire Fortunato, Navdeep Jaitly

    Abstract: We introduce a new neural architecture to learn the conditional probability of an output sequence with elements that are discrete tokens corresponding to positions in an input sequence. Such problems cannot be trivially addressed by existent approaches such as sequence-to-sequence and Neural Turing Machines, because the number of target classes in each step of the output depends on the length of t… ▽ More

    Submitted 2 January, 2017; v1 submitted 9 June, 2015; originally announced June 2015.

  40. arXiv:1503.02531  [pdf, ps, other

    stat.ML cs.LG cs.NE

    Distilling the Knowledge in a Neural Network

    Authors: Geoffrey Hinton, Oriol Vinyals, Jeff Dean

    Abstract: A very simple way to improve the performance of almost any machine learning algorithm is to train many different models on the same data and then to average their predictions. Unfortunately, making predictions using a whole ensemble of models is cumbersome and may be too computationally expensive to allow deployment to a large number of users, especially if the individual models are large neural n… ▽ More

    Submitted 9 March, 2015; originally announced March 2015.

    Comments: NIPS 2014 Deep Learning Workshop

  41. arXiv:1412.7449  [pdf, other

    cs.CL cs.LG stat.ML

    Grammar as a Foreign Language

    Authors: Oriol Vinyals, Lukasz Kaiser, Terry Koo, Slav Petrov, Ilya Sutskever, Geoffrey Hinton

    Abstract: Syntactic constituency parsing is a fundamental problem in natural language processing and has been the subject of intensive research and engineering for decades. As a result, the most accurate parsers are domain specific, complex, and inefficient. In this paper we show that the domain agnostic attention-enhanced sequence-to-sequence model achieves state-of-the-art results on the most widely used… ▽ More

    Submitted 9 June, 2015; v1 submitted 23 December, 2014; originally announced December 2014.

  42. arXiv:1412.6544  [pdf, other

    cs.NE cs.LG stat.ML

    Qualitatively characterizing neural network optimization problems

    Authors: Ian J. Goodfellow, Oriol Vinyals, Andrew M. Saxe

    Abstract: Training neural networks involves solving large-scale non-convex optimization problems. This task has long been believed to be extremely difficult, with fear of local minima and other obstacles motivating a variety of schemes to improve optimization, such as unsupervised pretraining. However, modern neural networks are able to achieve negligible training error on complex tasks, using only direct t… ▽ More

    Submitted 21 May, 2015; v1 submitted 19 December, 2014; originally announced December 2014.

  43. arXiv:1111.4259  [pdf, ps, other

    stat.ML math.OC

    Krylov Subspace Descent for Deep Learning

    Authors: Oriol Vinyals, Daniel Povey

    Abstract: In this paper, we propose a second order optimization method to learn models where both the dimensionality of the parameter space and the number of training samples is high. In our method, we construct on each iteration a Krylov subspace formed by the gradient and an approximation to the Hessian matrix, and then use a subset of the training data samples to optimize over this subspace. As with the… ▽ More

    Submitted 17 November, 2011; originally announced November 2011.