Skip to main content

Showing 1–50 of 51 results for author: Goodfellow, I

Searching in archive stat. Search in all archives.
.
  1. arXiv:2003.02365  [pdf, other

    cs.CV eess.IV stat.ML

    Creating High Resolution Images with a Latent Adversarial Generator

    Authors: David Berthelot, Peyman Milanfar, Ian Goodfellow

    Abstract: Generating realistic images is difficult, and many formulations for this task have been proposed recently. If we restrict the task to that of generating a particular class of images, however, the task becomes more tractable. That is to say, instead of generating an arbitrary image as a sample from the manifold of natural images, we propose to sample images from a particular "subspace" of natural i… ▽ More

    Submitted 4 March, 2020; originally announced March 2020.

  2. arXiv:1905.02249  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    MixMatch: A Holistic Approach to Semi-Supervised Learning

    Authors: David Berthelot, Nicholas Carlini, Ian Goodfellow, Nicolas Papernot, Avital Oliver, Colin Raffel

    Abstract: Semi-supervised learning has proven to be a powerful paradigm for leveraging unlabeled data to mitigate the reliance on large labeled datasets. In this work, we unify the current dominant approaches for semi-supervised learning to produce a new algorithm, MixMatch, that works by guessing low-entropy labels for data-augmented unlabeled examples and mixing labeled and unlabeled data using MixUp. We… ▽ More

    Submitted 23 October, 2019; v1 submitted 6 May, 2019; originally announced May 2019.

  3. arXiv:1903.10346  [pdf, other

    eess.AS cs.LG cs.SD stat.ML

    Imperceptible, Robust, and Targeted Adversarial Examples for Automatic Speech Recognition

    Authors: Yao Qin, Nicholas Carlini, Ian Goodfellow, Garrison Cottrell, Colin Raffel

    Abstract: Adversarial examples are inputs to machine learning models designed by an adversary to cause an incorrect output. So far, adversarial examples have been studied most extensively in the image domain. In this domain, adversarial examples can be constructed by imperceptibly modifying images to cause misclassification, and are practical in the physical world. In contrast, current targeted adversarial… ▽ More

    Submitted 7 June, 2019; v1 submitted 22 March, 2019; originally announced March 2019.

    Comments: International Conference on Machine Learning (ICML), 2019

  4. arXiv:1903.06293  [pdf, ps, other

    cs.LG stat.ML

    A Research Agenda: Dynamic Models to Defend Against Correlated Attacks

    Authors: Ian Goodfellow

    Abstract: In this article I describe a research agenda for securing machine learning models against adversarial inputs at test time. This article does not present results but instead shares some of my thoughts about where I think that the field needs to go. Modern machine learning works very well on I.I.D. data: data for which each example is drawn {\em independently} and for which the distribution generati… ▽ More

    Submitted 14 March, 2019; originally announced March 2019.

  5. arXiv:1902.06705  [pdf, ps, other

    cs.LG cs.CR stat.ML

    On Evaluating Adversarial Robustness

    Authors: Nicholas Carlini, Anish Athalye, Nicolas Papernot, Wieland Brendel, Jonas Rauber, Dimitris Tsipras, Ian Goodfellow, Aleksander Madry, Alexey Kurakin

    Abstract: Correctly evaluating defenses against adversarial examples has proven to be extremely difficult. Despite the significant amount of recent work attempting to design defenses that withstand adaptive attacks, few have succeeded; most papers that propose defenses are quickly shown to be incorrect. We believe a large contributing factor is the difficulty of performing security evaluations. In this pa… ▽ More

    Submitted 20 February, 2019; v1 submitted 18 February, 2019; originally announced February 2019.

    Comments: Living document; source available at https://github.com/evaluating-adversarial-robustness/adv-eval-paper/

  6. arXiv:1811.05512  [pdf, other

    cs.LG stat.ML

    A domain agnostic measure for monitoring and evaluating GANs

    Authors: Paulina Grnarova, Kfir Y Levy, Aurelien Lucchi, Nathanael Perraudin, Ian Goodfellow, Thomas Hofmann, Andreas Krause

    Abstract: Generative Adversarial Networks (GANs) have shown remarkable results in modeling complex distributions, but their evaluation remains an unsettled issue. Evaluations are essential for: (i) relative assessment of different models and (ii) monitoring the progress of a single model throughout training. The latter cannot be determined by simply inspecting the generator and discriminator loss curves as… ▽ More

    Submitted 15 July, 2020; v1 submitted 13 November, 2018; originally announced November 2018.

  7. arXiv:1810.06758  [pdf, other

    stat.ML cs.LG

    Discriminator Rejection Sampling

    Authors: Samaneh Azadi, Catherine Olsson, Trevor Darrell, Ian Goodfellow, Augustus Odena

    Abstract: We propose a rejection sampling scheme using the discriminator of a GAN to approximately correct errors in the GAN generator distribution. We show that under quite strict assumptions, this will allow us to recover the data distribution exactly. We then examine where those strict assumptions break down and design a practical algorithm - called Discriminator Rejection Sampling (DRS) - that can be us… ▽ More

    Submitted 26 February, 2019; v1 submitted 15 October, 2018; originally announced October 2018.

    Comments: Published as a conference paper at ICLR 2019

  8. arXiv:1810.03307  [pdf, other

    cs.CV cs.LG stat.ML

    Local Explanation Methods for Deep Neural Networks Lack Sensitivity to Parameter Values

    Authors: Julius Adebayo, Justin Gilmer, Ian Goodfellow, Been Kim

    Abstract: Explaining the output of a complicated machine learning model like a deep neural network (DNN) is a central challenge in machine learning. Several proposed local explanation methods address this issue by identifying what dimensions of a single input are most responsible for a DNN's output. The goal of this work is to assess the sensitivity of local explanations to DNN parameter values. Somewhat su… ▽ More

    Submitted 8 October, 2018; originally announced October 2018.

    Comments: Workshop Track International Conference on Learning Representations (ICLR)

  9. arXiv:1810.03292  [pdf, other

    cs.CV cs.LG stat.ML

    Sanity Checks for Saliency Maps

    Authors: Julius Adebayo, Justin Gilmer, Michael Muelly, Ian Goodfellow, Moritz Hardt, Been Kim

    Abstract: Saliency methods have emerged as a popular tool to highlight features in an input deemed relevant for the prediction of a learned model. Several saliency methods have been proposed, often guided by visual appeal on image data. In this work, we propose an actionable methodology to evaluate what kinds of explanations a given method can and cannot provide. We find that reliance, solely, on visual ass… ▽ More

    Submitted 6 November, 2020; v1 submitted 8 October, 2018; originally announced October 2018.

    Comments: Updating Guided Backprop experiments due to bug. The results and conclusions remain the same

  10. arXiv:1809.08352  [pdf, other

    stat.ML cs.CV cs.LG

    Unrestricted Adversarial Examples

    Authors: Tom B. Brown, Nicholas Carlini, Chiyuan Zhang, Catherine Olsson, Paul Christiano, Ian Goodfellow

    Abstract: We introduce a two-player contest for evaluating the safety and robustness of machine learning systems, with a large prize pool. Unlike most prior work in ML robustness, which studies norm-constrained adversaries, we shift our focus to unconstrained adversaries. Defenders submit machine learning models, and try to achieve high accuracy and coverage on non-adversarial data while making no confident… ▽ More

    Submitted 21 September, 2018; originally announced September 2018.

  11. arXiv:1808.04888  [pdf, other

    stat.ML cs.LG

    Skill Rating for Generative Models

    Authors: Catherine Olsson, Surya Bhupatiraju, Tom Brown, Augustus Odena, Ian Goodfellow

    Abstract: We explore a new way to evaluate generative models using insights from evaluation of competitive games between human players. We show experimentally that tournaments between generators and discriminators provide an effective way to evaluate generative models. We introduce two methods for summarizing tournament outcomes: tournament win rate and skill rating. Evaluations are useful in different cont… ▽ More

    Submitted 14 August, 2018; originally announced August 2018.

  12. arXiv:1807.10875  [pdf, other

    stat.ML cs.LG

    TensorFuzz: Debugging Neural Networks with Coverage-Guided Fuzzing

    Authors: Augustus Odena, Ian Goodfellow

    Abstract: Machine learning models are notoriously difficult to interpret and debug. This is particularly true of neural networks. In this work, we introduce automated software testing techniques for neural networks that are well-suited to discovering errors which occur only for rare inputs. Specifically, we develop coverage-guided fuzzing (CGF) methods for neural networks. In CGF, random mutations of inputs… ▽ More

    Submitted 27 July, 2018; originally announced July 2018.

    Comments: Preprint - work in progress

  13. arXiv:1807.07543  [pdf, other

    cs.LG stat.ML

    Understanding and Improving Interpolation in Autoencoders via an Adversarial Regularizer

    Authors: David Berthelot, Colin Raffel, Aurko Roy, Ian Goodfellow

    Abstract: Autoencoders provide a powerful framework for learning compressed representations by encoding all of the information needed to reconstruct a data point in a latent code. In some cases, autoencoders can "interpolate": By decoding the convex combination of the latent codes for two datapoints, the autoencoder can produce an output which semantically mixes characteristics from the datapoints. In this… ▽ More

    Submitted 23 July, 2018; v1 submitted 19 July, 2018; originally announced July 2018.

  14. arXiv:1807.06732  [pdf, other

    cs.LG stat.ML

    Motivating the Rules of the Game for Adversarial Example Research

    Authors: Justin Gilmer, Ryan P. Adams, Ian Goodfellow, David Andersen, George E. Dahl

    Abstract: Advances in machine learning have led to broad deployment of systems with impressive performance on important problems. Nonetheless, these systems can be induced to make errors on data that are surprisingly similar to examples the learned system handles correctly. The existence of these errors raises a variety of questions about out-of-sample generalization and whether bad actors might use such ex… ▽ More

    Submitted 19 July, 2018; v1 submitted 17 July, 2018; originally announced July 2018.

  15. arXiv:1806.11146  [pdf, other

    cs.LG cs.CR cs.CV stat.ML

    Adversarial Reprogramming of Neural Networks

    Authors: Gamaleldin F. Elsayed, Ian Goodfellow, Jascha Sohl-Dickstein

    Abstract: Deep neural networks are susceptible to \emph{adversarial} attacks. In computer vision, well-crafted perturbations to images can cause neural networks to make mistakes such as confusing a cat with a computer. Previous adversarial attacks have been designed to degrade performance of models or cause machine learning models to produce specific outputs chosen ahead of time by the attacker. We introduc… ▽ More

    Submitted 29 November, 2018; v1 submitted 28 June, 2018; originally announced June 2018.

    Journal ref: International Conference on Learning Representations 2019

  16. arXiv:1806.04169  [pdf, other

    cs.LG cs.AI cs.CR stat.ML

    Defense Against the Dark Arts: An overview of adversarial example security research and future research directions

    Authors: Ian Goodfellow

    Abstract: This article presents a summary of a keynote lecture at the Deep Learning Security workshop at IEEE Security and Privacy 2018. This lecture summarizes the state of the art in defenses against adversarial examples and provides recommendations for future research directions on this topic.

    Submitted 11 June, 2018; originally announced June 2018.

  17. arXiv:1805.08318  [pdf, other

    stat.ML cs.LG

    Self-Attention Generative Adversarial Networks

    Authors: Han Zhang, Ian Goodfellow, Dimitris Metaxas, Augustus Odena

    Abstract: In this paper, we propose the Self-Attention Generative Adversarial Network (SAGAN) which allows attention-driven, long-range dependency modeling for image generation tasks. Traditional convolutional GANs generate high-resolution details as a function of only spatially local points in lower-resolution feature maps. In SAGAN, details can be generated using cues from all feature locations. Moreover,… ▽ More

    Submitted 14 June, 2019; v1 submitted 21 May, 2018; originally announced May 2018.

  18. arXiv:1804.09170  [pdf, other

    cs.LG stat.ML

    Realistic Evaluation of Deep Semi-Supervised Learning Algorithms

    Authors: Avital Oliver, Augustus Odena, Colin Raffel, Ekin D. Cubuk, Ian J. Goodfellow

    Abstract: Semi-supervised learning (SSL) provides a powerful framework for leveraging unlabeled data when labels are limited or expensive to obtain. SSL algorithms based on deep neural networks have recently proven successful on standard benchmark tasks. However, we argue that these benchmarks fail to address many issues that these algorithms would face in real-world applications. After creating a unified r… ▽ More

    Submitted 17 June, 2019; v1 submitted 24 April, 2018; originally announced April 2018.

    Journal ref: NeurIPS 2018 Proceedings

  19. arXiv:1804.07870  [pdf, other

    cs.LG stat.ML

    Gradient Masking Causes CLEVER to Overestimate Adversarial Perturbation Size

    Authors: Ian Goodfellow

    Abstract: A key problem in research on adversarial examples is that vulnerability to adversarial examples is usually measured by running attack algorithms. Because the attack algorithms are not optimal, the attack algorithms are prone to overestimating the size of perturbation needed to fool the target model. In other words, the attack-based methodology provides an upper-bound on the size of a perturbation… ▽ More

    Submitted 20 April, 2018; originally announced April 2018.

  20. arXiv:1804.00097  [pdf, other

    cs.CV cs.CR cs.LG stat.ML

    Adversarial Attacks and Defences Competition

    Authors: Alexey Kurakin, Ian Goodfellow, Samy Bengio, Yinpeng Dong, Fangzhou Liao, Ming Liang, Tianyu Pang, Jun Zhu, Xiaolin Hu, Cihang Xie, Jianyu Wang, Zhishuai Zhang, Zhou Ren, Alan Yuille, Sangxia Huang, Yao Zhao, Yuzhe Zhao, Zhonglin Han, Junjiajia Long, Yerkebulan Berdibekov, Takuya Akiba, Seiya Tokui, Motoki Abe

    Abstract: To accelerate research on adversarial examples and robustness of machine learning classifiers, Google Brain organized a NIPS 2017 competition that encouraged researchers to develop new methods to generate adversarial examples as well as to develop new ways to defend against them. In this chapter, we describe the structure and organization of the competition and the solutions developed by several o… ▽ More

    Submitted 30 March, 2018; originally announced April 2018.

    Comments: 36 pages, 10 figures

  21. arXiv:1803.06373  [pdf, ps, other

    cs.LG stat.ML

    Adversarial Logit Pairing

    Authors: Harini Kannan, Alexey Kurakin, Ian Goodfellow

    Abstract: In this paper, we develop improved techniques for defending against adversarial examples at scale. First, we implement the state of the art version of adversarial training at unprecedented scale on ImageNet and investigate whether it remains effective in this setting - an important open scientific question (Athalye et al., 2018). Next, we introduce enhanced defenses using a technique we call logit… ▽ More

    Submitted 16 March, 2018; originally announced March 2018.

    Comments: 10 pages

  22. arXiv:1802.08768  [pdf, other

    stat.ML cs.LG

    Is Generator Conditioning Causally Related to GAN Performance?

    Authors: Augustus Odena, Jacob Buckman, Catherine Olsson, Tom B. Brown, Christopher Olah, Colin Raffel, Ian Goodfellow

    Abstract: Recent work (Pennington et al, 2017) suggests that controlling the entire distribution of Jacobian singular values is an important design consideration in deep learning. Motivated by this, we study the distribution of singular values of the Jacobian of the generator in Generative Adversarial Networks (GANs). We find that this Jacobian generally becomes ill-conditioned at the beginning of training.… ▽ More

    Submitted 18 June, 2018; v1 submitted 23 February, 2018; originally announced February 2018.

  23. arXiv:1802.08195  [pdf, other

    cs.LG cs.CV q-bio.NC stat.ML

    Adversarial Examples that Fool both Computer Vision and Time-Limited Humans

    Authors: Gamaleldin F. Elsayed, Shreya Shankar, Brian Cheung, Nicolas Papernot, Alex Kurakin, Ian Goodfellow, Jascha Sohl-Dickstein

    Abstract: Machine learning models are vulnerable to adversarial examples: small changes to images can cause computer vision models to make mistakes such as identifying a school bus as an ostrich. However, it is still an open question whether humans are prone to similar mistakes. Here, we address this question by leveraging recent techniques that transfer adversarial examples from computer vision models with… ▽ More

    Submitted 21 May, 2018; v1 submitted 22 February, 2018; originally announced February 2018.

    Journal ref: Advances in Neural Information Processing Systems, 2018

  24. arXiv:1801.07736  [pdf, other

    stat.ML cs.AI cs.LG

    MaskGAN: Better Text Generation via Filling in the______

    Authors: William Fedus, Ian Goodfellow, Andrew M. Dai

    Abstract: Neural text generation models are often autoregressive language models or seq2seq models. These models generate text by sampling words sequentially, with each word conditioned on the previous word, and are state-of-the-art for several machine translation and summarization benchmarks. These benchmarks are often defined by validation perplexity even though this is not a direct measure of the quality… ▽ More

    Submitted 1 March, 2018; v1 submitted 23 January, 2018; originally announced January 2018.

    Comments: 16 pages, ICLR 2018

  25. arXiv:1710.08446  [pdf, other

    stat.ML cs.LG

    Many Paths to Equilibrium: GANs Do Not Need to Decrease a Divergence At Every Step

    Authors: William Fedus, Mihaela Rosca, Balaji Lakshminarayanan, Andrew M. Dai, Shakir Mohamed, Ian Goodfellow

    Abstract: Generative adversarial networks (GANs) are a family of generative models that do not minimize a single training criterion. Unlike other generative models, the data distribution is learned via a game between a generator (the generative model) and a discriminator (a teacher providing training signal) that each minimize their own cost. GANs are designed to reach a Nash equilibrium at which each playe… ▽ More

    Submitted 20 February, 2018; v1 submitted 23 October, 2017; originally announced October 2017.

    Comments: 18 pages

  26. arXiv:1708.08022  [pdf, ps, other

    stat.ML cs.CR cs.LG

    On the Protection of Private Information in Machine Learning Systems: Two Recent Approaches

    Authors: Martín Abadi, Úlfar Erlingsson, Ian Goodfellow, H. Brendan McMahan, Ilya Mironov, Nicolas Papernot, Kunal Talwar, Li Zhang

    Abstract: The recent, remarkable growth of machine learning has led to intense interest in the privacy of the data on which machine learning relies, and to new techniques for preserving privacy. However, older ideas about privacy may well remain valid and useful. This note reviews two recent works on privacy in the light of the wisdom of some of the early literature, in particular the principles distilled b… ▽ More

    Submitted 26 August, 2017; originally announced August 2017.

    Journal ref: IEEE 30th Computer Security Foundations Symposium (CSF), pages 1--6, 2017

  27. arXiv:1705.07204  [pdf, other

    stat.ML cs.CR cs.LG

    Ensemble Adversarial Training: Attacks and Defenses

    Authors: Florian Tramèr, Alexey Kurakin, Nicolas Papernot, Ian Goodfellow, Dan Boneh, Patrick McDaniel

    Abstract: Adversarial examples are perturbed inputs designed to fool machine learning models. Adversarial training injects such examples into training data to increase robustness. To scale this technique to large datasets, perturbations are crafted using fast single-step methods that maximize a linear approximation of the model's loss. We show that this form of adversarial training converges to a degenerate… ▽ More

    Submitted 26 April, 2020; v1 submitted 19 May, 2017; originally announced May 2017.

    Comments: 22 pages, 5 figures, International Conference on Learning Representations (ICLR) 2018 (amended in April 2020 to include subsequent attacks that significantly reduced the robustness of our models)

  28. arXiv:1704.03453  [pdf, other

    stat.ML cs.CR cs.LG

    The Space of Transferable Adversarial Examples

    Authors: Florian Tramèr, Nicolas Papernot, Ian Goodfellow, Dan Boneh, Patrick McDaniel

    Abstract: Adversarial examples are maliciously perturbed inputs designed to mislead machine learning (ML) models at test-time. They often transfer: the same adversarial example fools more than one model. In this work, we propose novel methods for estimating the previously unknown dimensionality of the space of adversarial inputs. We find that adversarial examples span a contiguous subspace of large (~25)… ▽ More

    Submitted 23 May, 2017; v1 submitted 11 April, 2017; originally announced April 2017.

    Comments: 15 pages, 7 figures

  29. arXiv:1702.02284  [pdf, other

    cs.LG cs.CR stat.ML

    Adversarial Attacks on Neural Network Policies

    Authors: Sandy Huang, Nicolas Papernot, Ian Goodfellow, Yan Duan, Pieter Abbeel

    Abstract: Machine learning classifiers are known to be vulnerable to inputs maliciously constructed by adversaries to force misclassification. Such adversarial examples have been extensively studied in the context of computer vision applications. In this work, we show adversarial attacks are also effective when targeting neural network policies in reinforcement learning. Specifically, we show existing adver… ▽ More

    Submitted 7 February, 2017; originally announced February 2017.

  30. arXiv:1611.01236  [pdf, other

    cs.CV cs.CR cs.LG stat.ML

    Adversarial Machine Learning at Scale

    Authors: Alexey Kurakin, Ian Goodfellow, Samy Bengio

    Abstract: Adversarial examples are malicious inputs designed to fool machine learning models. They often transfer from one model to another, allowing attackers to mount black box attacks without knowledge of the target model's parameters. Adversarial training is the process of explicitly training a model on adversarial examples, in order to make it more robust to attack or to reduce its test error on clean… ▽ More

    Submitted 10 February, 2017; v1 submitted 3 November, 2016; originally announced November 2016.

    Comments: 17 pages, 5 figures

  31. arXiv:1610.05755  [pdf, other

    stat.ML cs.CR cs.LG

    Semi-supervised Knowledge Transfer for Deep Learning from Private Training Data

    Authors: Nicolas Papernot, Martín Abadi, Úlfar Erlingsson, Ian Goodfellow, Kunal Talwar

    Abstract: Some machine learning applications involve training data that is sensitive, such as the medical histories of patients in a clinical trial. A model may inadvertently and implicitly store some of its training data; careful analysis of the model may therefore reveal sensitive information. To address this problem, we demonstrate a generally applicable approach to providing strong privacy guarantees… ▽ More

    Submitted 3 March, 2017; v1 submitted 18 October, 2016; originally announced October 2016.

    Comments: Accepted to ICLR 17 as an oral

  32. arXiv:1610.00768  [pdf, ps, other

    cs.LG cs.CR stat.ML

    Technical Report on the CleverHans v2.1.0 Adversarial Examples Library

    Authors: Nicolas Papernot, Fartash Faghri, Nicholas Carlini, Ian Goodfellow, Reuben Feinman, Alexey Kurakin, Cihang Xie, Yash Sharma, Tom Brown, Aurko Roy, Alexander Matyasko, Vahid Behzadan, Karen Hambardzumyan, Zhishuai Zhang, Yi-Lin Juang, Zhi Li, Ryan Sheatsley, Abhibhav Garg, Jonathan Uesato, Willi Gierke, Yinpeng Dong, David Berthelot, Paul Hendricks, Jonas Rauber, Rujun Long , et al. (1 additional authors not shown)

    Abstract: CleverHans is a software library that provides standardized reference implementations of adversarial example construction techniques and adversarial training. The library may be used to develop more robust machine learning models and to provide standardized benchmarks of models' performance in the adversarial setting. Benchmarks constructed without a standardized implementation of adversarial exam… ▽ More

    Submitted 27 June, 2018; v1 submitted 3 October, 2016; originally announced October 2016.

    Comments: Technical report for https://github.com/tensorflow/cleverhans

  33. arXiv:1607.02533  [pdf, other

    cs.CV cs.CR cs.LG stat.ML

    Adversarial examples in the physical world

    Authors: Alexey Kurakin, Ian Goodfellow, Samy Bengio

    Abstract: Most existing machine learning classifiers are highly vulnerable to adversarial examples. An adversarial example is a sample of input data which has been modified very slightly in a way that is intended to cause a machine learning classifier to misclassify it. In many cases, these modifications can be so subtle that a human observer does not even notice the modification at all, yet the classifier… ▽ More

    Submitted 10 February, 2017; v1 submitted 8 July, 2016; originally announced July 2016.

    Comments: 14 pages, 6 figures. Demo available at https://youtu.be/zQ_uMenoBCk

  34. arXiv:1607.00133  [pdf, other

    stat.ML cs.CR cs.LG

    Deep Learning with Differential Privacy

    Authors: Martín Abadi, Andy Chu, Ian Goodfellow, H. Brendan McMahan, Ilya Mironov, Kunal Talwar, Li Zhang

    Abstract: Machine learning techniques based on neural networks are achieving remarkable results in a wide variety of domains. Often, the training of models requires large, representative datasets, which may be crowdsourced and contain sensitive information. The models should not expose private information in these datasets. Addressing this goal, we develop new algorithmic techniques for learning and a refin… ▽ More

    Submitted 24 October, 2016; v1 submitted 1 July, 2016; originally announced July 2016.

    Journal ref: Proceedings of the 2016 ACM SIGSAC Conference on Computer and Communications Security (ACM CCS), pp. 308-318, 2016

  35. arXiv:1605.07725  [pdf, ps, other

    stat.ML cs.LG

    Adversarial Training Methods for Semi-Supervised Text Classification

    Authors: Takeru Miyato, Andrew M. Dai, Ian Goodfellow

    Abstract: Adversarial training provides a means of regularizing supervised learning algorithms while virtual adversarial training is able to extend supervised learning algorithms to the semi-supervised setting. However, both methods require making small perturbations to numerous entries of the input vector, which is inappropriate for sparse high-dimensional inputs such as one-hot word representations. We ex… ▽ More

    Submitted 16 November, 2021; v1 submitted 25 May, 2016; originally announced May 2016.

    Comments: Published as a conference paper at ICLR 2017

  36. arXiv:1510.01799  [pdf, ps, other

    stat.ML cs.LG

    Efficient Per-Example Gradient Computations

    Authors: Ian Goodfellow

    Abstract: This technical report describes an efficient technique for computing the norm of the gradient of the loss function for a neural network with respect to its parameters. This gradient norm can be computed efficiently for every example.

    Submitted 9 October, 2015; v1 submitted 6 October, 2015; originally announced October 2015.

    Comments: This revision fixed some typos. Many thanks to Hugo Larochelle for reporting them!

  37. arXiv:1412.6572  [pdf, other

    stat.ML cs.LG

    Explaining and Harnessing Adversarial Examples

    Authors: Ian J. Goodfellow, Jonathon Shlens, Christian Szegedy

    Abstract: Several machine learning models, including neural networks, consistently misclassify adversarial examples---inputs formed by applying small but intentionally worst-case perturbations to examples from the dataset, such that the perturbed input results in the model outputting an incorrect answer with high confidence. Early attempts at explaining this phenomenon focused on nonlinearity and overfittin… ▽ More

    Submitted 20 March, 2015; v1 submitted 19 December, 2014; originally announced December 2014.

  38. arXiv:1412.6544  [pdf, other

    cs.NE cs.LG stat.ML

    Qualitatively characterizing neural network optimization problems

    Authors: Ian J. Goodfellow, Oriol Vinyals, Andrew M. Saxe

    Abstract: Training neural networks involves solving large-scale non-convex optimization problems. This task has long been believed to be extremely difficult, with fear of local minima and other obstacles motivating a variety of schemes to improve optimization, such as unsupervised pretraining. However, modern neural networks are able to achieve negligible training error on complex tasks, using only direct t… ▽ More

    Submitted 21 May, 2015; v1 submitted 19 December, 2014; originally announced December 2014.

  39. arXiv:1412.6515  [pdf, other

    stat.ML

    On distinguishability criteria for estimating generative models

    Authors: Ian J. Goodfellow

    Abstract: Two recently introduced criteria for estimation of generative models are both based on a reduction to binary classification. Noise-contrastive estimation (NCE) is an estimation procedure in which a generative model is trained to be able to distinguish data samples from noise samples. Generative adversarial networks (GANs) are pairs of generator and discriminator networks, with the generator networ… ▽ More

    Submitted 21 May, 2015; v1 submitted 19 December, 2014; originally announced December 2014.

    Comments: This version adds a figure that appeared on the poster at ICLR, changes the template to say that the paper was accepted as a workshop contribution (previously it was under a review as a conference submission), and fixes some typos

  40. arXiv:1406.2661  [pdf, other

    stat.ML cs.LG

    Generative Adversarial Networks

    Authors: Ian J. Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, Yoshua Bengio

    Abstract: We propose a new framework for estimating generative models via an adversarial process, in which we simultaneously train two models: a generative model G that captures the data distribution, and a discriminative model D that estimates the probability that a sample came from the training data rather than G. The training procedure for G is to maximize the probability of D making a mistake. This fram… ▽ More

    Submitted 10 June, 2014; originally announced June 2014.

  41. arXiv:1312.6211  [pdf, other

    stat.ML cs.LG cs.NE

    An Empirical Investigation of Catastrophic Forgetting in Gradient-Based Neural Networks

    Authors: Ian J. Goodfellow, Mehdi Mirza, Da Xiao, Aaron Courville, Yoshua Bengio

    Abstract: Catastrophic forgetting is a problem faced by many machine learning models and algorithms. When trained on one task, then trained on a second task, many machine learning models "forget" how to perform the first task. This is widely believed to be a serious problem for neural networks. Here, we investigate the extent to which the catastrophic forgetting problem occurs for modern neural networks, co… ▽ More

    Submitted 3 March, 2015; v1 submitted 21 December, 2013; originally announced December 2013.

  42. arXiv:1312.6197  [pdf, other

    stat.ML cs.LG cs.NE

    An empirical analysis of dropout in piecewise linear networks

    Authors: David Warde-Farley, Ian J. Goodfellow, Aaron Courville, Yoshua Bengio

    Abstract: The recently introduced dropout training criterion for neural networks has been the subject of much attention due to its simplicity and remarkable effectiveness as a regularizer, as well as its interpretation as a training procedure for an exponentially large ensemble of networks that share parameters. In this work we empirically investigate several questions related to the efficacy of dropout, sp… ▽ More

    Submitted 2 January, 2014; v1 submitted 20 December, 2013; originally announced December 2013.

    Comments: Extensive updates; 8 pages plus acknowledgements/references

  43. arXiv:1312.5258  [pdf, other

    stat.ML cs.LG

    On the Challenges of Physical Implementations of RBMs

    Authors: Vincent Dumoulin, Ian J. Goodfellow, Aaron Courville, Yoshua Bengio

    Abstract: Restricted Boltzmann machines (RBMs) are powerful machine learning models, but learning and some kinds of inference in the model require sampling-based approximations, which, in classical digital computers, are implemented using expensive MCMC. Physical computation offers the opportunity to reduce the cost of sampling by building physical systems whose natural dynamics correspond to drawing sample… ▽ More

    Submitted 24 October, 2014; v1 submitted 18 December, 2013; originally announced December 2013.

    Journal ref: Proc. AAAI 2014, pp. 1199-1205

  44. arXiv:1308.4214  [pdf, ps, other

    stat.ML cs.LG cs.MS

    Pylearn2: a machine learning research library

    Authors: Ian J. Goodfellow, David Warde-Farley, Pascal Lamblin, Vincent Dumoulin, Mehdi Mirza, Razvan Pascanu, James Bergstra, Frédéric Bastien, Yoshua Bengio

    Abstract: Pylearn2 is a machine learning research library. This does not just mean that it is a collection of machine learning algorithms that share a common API; it means that it has been designed for flexibility and extensibility in order to facilitate research projects that involve new or unusual use cases. In this paper we give a brief history of the library, an overview of its basic philosophy, a summa… ▽ More

    Submitted 19 August, 2013; originally announced August 2013.

    Comments: 9 pages

  45. arXiv:1307.0414  [pdf, other

    stat.ML cs.LG

    Challenges in Representation Learning: A report on three machine learning contests

    Authors: Ian J. Goodfellow, Dumitru Erhan, Pierre Luc Carrier, Aaron Courville, Mehdi Mirza, Ben Hamner, Will Cukierski, Yichuan Tang, David Thaler, Dong-Hyun Lee, Yingbo Zhou, Chetan Ramaiah, Fangxiang Feng, Ruifan Li, Xiaojie Wang, Dimitris Athanasakis, John Shawe-Taylor, Maxim Milakov, John Park, Radu Ionescu, Marius Popescu, Cristian Grozea, James Bergstra, Jingjing Xie, Lukasz Romaszko , et al. (3 additional authors not shown)

    Abstract: The ICML 2013 Workshop on Challenges in Representation Learning focused on three challenges: the black box learning challenge, the facial expression recognition challenge, and the multimodal learning challenge. We describe the datasets created for these challenges and summarize the results of the competitions. We provide suggestions for organizers of future challenges and some comments on what kin… ▽ More

    Submitted 1 July, 2013; originally announced July 2013.

    Comments: 8 pages, 2 figures

  46. arXiv:1302.4389  [pdf, other

    stat.ML cs.LG

    Maxout Networks

    Authors: Ian J. Goodfellow, David Warde-Farley, Mehdi Mirza, Aaron Courville, Yoshua Bengio

    Abstract: We consider the problem of designing models to leverage a recently introduced approximate model averaging technique called dropout. We define a simple new model called maxout (so named because its output is the max of a set of inputs, and because it is a natural companion to dropout) designed to both facilitate optimization by dropout and improve the accuracy of dropout's fast approximate model av… ▽ More

    Submitted 20 September, 2013; v1 submitted 18 February, 2013; originally announced February 2013.

    Comments: This is the version of the paper that appears in ICML 2013

    Journal ref: JMLR WCP 28 (3): 1319-1327, 2013

  47. arXiv:1301.5088  [pdf, ps, other

    stat.ML cs.LG

    Piecewise Linear Multilayer Perceptrons and Dropout

    Authors: Ian J. Goodfellow

    Abstract: We propose a new type of hidden layer for a multilayer perceptron, and demonstrate that it obtains the best reported performance for an MLP on the MNIST dataset.

    Submitted 22 January, 2013; originally announced January 2013.

  48. arXiv:1301.3568  [pdf, other

    stat.ML cs.LG

    Joint Training Deep Boltzmann Machines for Classification

    Authors: Ian J. Goodfellow, Aaron Courville, Yoshua Bengio

    Abstract: We introduce a new method for training deep Boltzmann machines jointly. Prior methods of training DBMs require an initial learning pass that trains the model greedily, one layer at a time, or do not perform well on classification tasks. In our approach, we train all layers of the DBM simultaneously, using a novel training procedure called multi-prediction training. The resulting model can either b… ▽ More

    Submitted 1 May, 2013; v1 submitted 15 January, 2013; originally announced January 2013.

    Comments: Major revision with new techniques and experiments. This version includes new material put on the poster for the ICLR workshop

  49. arXiv:1212.2686  [pdf, ps, other

    stat.ML cs.LG

    Joint Training of Deep Boltzmann Machines

    Authors: Ian Goodfellow, Aaron Courville, Yoshua Bengio

    Abstract: We introduce a new method for training deep Boltzmann machines jointly. Prior methods require an initial learning pass that trains the deep Boltzmann machine greedily, one layer at a time, or do not perform well on classifi- cation tasks.

    Submitted 11 December, 2012; originally announced December 2012.

    Comments: 4 pages

  50. arXiv:1206.6407  [pdf

    cs.LG stat.ML

    Large-Scale Feature Learning With Spike-and-Slab Sparse Coding

    Authors: Ian Goodfellow, Aaron Courville, Yoshua Bengio

    Abstract: We consider the problem of object recognition with a large number of classes. In order to overcome the low amount of labeled examples available in this setting, we introduce a new feature learning and extraction procedure based on a factor model we call spike-and-slab sparse coding (S3C). Prior work on S3C has not prioritized the ability to exploit parallel architectures and scale S3C to the enorm… ▽ More

    Submitted 27 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the 29th International Conference on Machine Learning (ICML 2012). arXiv admin note: substantial text overlap with arXiv:1201.3382