-
Meta-Dataset: A Dataset of Datasets for Learning to Learn from Few Examples
Authors:
Eleni Triantafillou,
Tyler Zhu,
Vincent Dumoulin,
Pascal Lamblin,
Utku Evci,
Kelvin Xu,
Ross Goroshin,
Carles Gelada,
Kevin Swersky,
Pierre-Antoine Manzagol,
Hugo Larochelle
Abstract:
Few-shot classification refers to learning a classifier for new classes given only a few examples. While a plethora of models have emerged to tackle it, we find the procedure and datasets that are used to assess their progress lacking. To address this limitation, we propose Meta-Dataset: a new benchmark for training and evaluating models that is large-scale, consists of diverse datasets, and prese…
▽ More
Few-shot classification refers to learning a classifier for new classes given only a few examples. While a plethora of models have emerged to tackle it, we find the procedure and datasets that are used to assess their progress lacking. To address this limitation, we propose Meta-Dataset: a new benchmark for training and evaluating models that is large-scale, consists of diverse datasets, and presents more realistic tasks. We experiment with popular baselines and meta-learners on Meta-Dataset, along with a competitive method that we propose. We analyze performance as a function of various characteristics of test tasks and examine the models' ability to leverage diverse training sources for improving their generalization. We also propose a new set of baselines for quantifying the benefit of meta-learning in Meta-Dataset. Our extensive experimentation has uncovered important research challenges and we hope to inspire work in these directions.
△ Less
Submitted 8 April, 2020; v1 submitted 7 March, 2019;
originally announced March 2019.
-
Automatic differentiation in ML: Where we are and where we should be going
Authors:
Bart van Merriënboer,
Olivier Breuleux,
Arnaud Bergeron,
Pascal Lamblin
Abstract:
We review the current state of automatic differentiation (AD) for array programming in machine learning (ML), including the different approaches such as operator overloading (OO) and source transformation (ST) used for AD, graph-based intermediate representations for programs, and source languages. Based on these insights, we introduce a new graph-based intermediate representation (IR) which speci…
▽ More
We review the current state of automatic differentiation (AD) for array programming in machine learning (ML), including the different approaches such as operator overloading (OO) and source transformation (ST) used for AD, graph-based intermediate representations for programs, and source languages. Based on these insights, we introduce a new graph-based intermediate representation (IR) which specifically aims to efficiently support fully-general AD for array programming. Unlike existing dataflow programming representations in ML frameworks, our IR naturally supports function calls, higher-order functions and recursion, making ML models easier to implement. The ability to represent closures allows us to perform AD using ST without a tape, making the resulting derivative (adjoint) program amenable to ahead-of-time optimization using tools from functional language compilers, and enabling higher-order derivatives. Lastly, we introduce a proof of concept compiler toolchain called Myia which uses a subset of Python as a front end.
△ Less
Submitted 2 January, 2019; v1 submitted 26 October, 2018;
originally announced October 2018.
-
Pylearn2: a machine learning research library
Authors:
Ian J. Goodfellow,
David Warde-Farley,
Pascal Lamblin,
Vincent Dumoulin,
Mehdi Mirza,
Razvan Pascanu,
James Bergstra,
Frédéric Bastien,
Yoshua Bengio
Abstract:
Pylearn2 is a machine learning research library. This does not just mean that it is a collection of machine learning algorithms that share a common API; it means that it has been designed for flexibility and extensibility in order to facilitate research projects that involve new or unusual use cases. In this paper we give a brief history of the library, an overview of its basic philosophy, a summa…
▽ More
Pylearn2 is a machine learning research library. This does not just mean that it is a collection of machine learning algorithms that share a common API; it means that it has been designed for flexibility and extensibility in order to facilitate research projects that involve new or unusual use cases. In this paper we give a brief history of the library, an overview of its basic philosophy, a summary of the library's architecture, and a description of how the Pylearn2 community functions socially.
△ Less
Submitted 19 August, 2013;
originally announced August 2013.