Skip to main content

Showing 1–19 of 19 results for author: Fischer, I

Searching in archive stat. Search in all archives.
.
  1. arXiv:2410.02217  [pdf, other

    cs.LG cs.CV stat.ML

    Stochastic Sampling from Deterministic Flow Models

    Authors: Saurabh Singh, Ian Fischer

    Abstract: Deterministic flow models, such as rectified flows, offer a general framework for learning a deterministic transport map between two distributions, realized as the vector field for an ordinary differential equation (ODE). However, they are sensitive to model estimation and discretization errors and do not permit different samples conditioned on an intermediate state, limiting their application. We… ▽ More

    Submitted 3 October, 2024; originally announced October 2024.

    Comments: Submitted to ICLR 2025

  2. arXiv:2211.09981  [pdf, other

    cs.LG cs.AI stat.ML

    Weighted Ensemble Self-Supervised Learning

    Authors: Yangjun Ruan, Saurabh Singh, Warren Morningstar, Alexander A. Alemi, Sergey Ioffe, Ian Fischer, Joshua V. Dillon

    Abstract: Ensembling has proven to be a powerful technique for boosting model performance, uncertainty estimation, and robustness in supervised learning. Advances in self-supervised learning (SSL) enable leveraging large unlabeled corpora for state-of-the-art few-shot and supervised learning performance. In this paper, we explore how ensemble methods can improve recent SSL techniques by developing a framewo… ▽ More

    Submitted 9 April, 2023; v1 submitted 17 November, 2022; originally announced November 2022.

    Comments: Accepted by ICLR 2023

  3. arXiv:2206.04114  [pdf, other

    cs.AI cs.LG cs.RO stat.ML

    Deep Hierarchical Planning from Pixels

    Authors: Danijar Hafner, Kuang-Huei Lee, Ian Fischer, Pieter Abbeel

    Abstract: Intelligent agents need to select long sequences of actions to solve complex tasks. While humans easily break down tasks into subgoals and reach them through millions of muscle commands, current artificial intelligence is limited to tasks with horizons of a few hundred decisions, despite large compute budgets. Research on hierarchical reinforcement learning aims to overcome this limitation but has… ▽ More

    Submitted 8 June, 2022; originally announced June 2022.

    Comments: Website: https://danijar.com/director

  4. arXiv:2203.02592  [pdf, other

    stat.ML cs.LG stat.ME

    Sparsity-Inducing Categorical Prior Improves Robustness of the Information Bottleneck

    Authors: Anirban Samaddar, Sandeep Madireddy, Prasanna Balaprakash, Tapabrata Maiti, Gustavo de los Campos, Ian Fischer

    Abstract: The information bottleneck framework provides a systematic approach to learning representations that compress nuisance information in the input and extract semantically meaningful information about predictions. However, the choice of a prior distribution that fixes the dimensionality across all the data can restrict the flexibility of this approach for learning robust representations. We present a… ▽ More

    Submitted 27 October, 2022; v1 submitted 4 March, 2022; originally announced March 2022.

  5. arXiv:2011.08711  [pdf, other

    stat.ML cs.LG

    VIB is Half Bayes

    Authors: Alexander A Alemi, Warren R Morningstar, Ben Poole, Ian Fischer, Joshua V Dillon

    Abstract: In discriminative settings such as regression and classification there are two random variables at play, the inputs X and the targets Y. Here, we demonstrate that the Variational Information Bottleneck can be viewed as a compromise between fully empirical and fully Bayesian objectives, attempting to minimize the risks due to finite sampling of Y only. We argue that this approach provides some of t… ▽ More

    Submitted 17 November, 2020; originally announced November 2020.

  6. arXiv:2007.12401  [pdf, other

    cs.LG cs.AI cs.IT cs.RO stat.ML

    Predictive Information Accelerates Learning in RL

    Authors: Kuang-Huei Lee, Ian Fischer, Anthony Liu, Yijie Guo, Honglak Lee, John Canny, Sergio Guadarrama

    Abstract: The Predictive Information is the mutual information between the past and the future, I(X_past; X_future). We hypothesize that capturing the predictive information is useful in RL, since the ability to model what will happen next is necessary for success on many tasks. To test our hypothesis, we train Soft Actor-Critic (SAC) agents from pixels with an auxiliary task that learns a compressed repres… ▽ More

    Submitted 25 October, 2020; v1 submitted 24 July, 2020; originally announced July 2020.

    Comments: To appear at NeurIPS 2020

  7. arXiv:2007.12335  [pdf, other

    cs.LG stat.ML

    Cycles in Causal Learning

    Authors: Katie Everett, Ian Fischer

    Abstract: In the causal learning setting, we wish to learn cause-and-effect relationships between variables such that we can correctly infer the effect of an intervention. While the difference between a cyclic structure and an acyclic structure may be just a single edge, cyclic causal structures have qualitatively different behavior under intervention: cycles cause feedback loops when the downstream effect… ▽ More

    Submitted 23 July, 2020; originally announced July 2020.

  8. arXiv:2002.05380  [pdf, other

    cs.LG stat.ML

    CEB Improves Model Robustness

    Authors: Ian Fischer, Alexander A. Alemi

    Abstract: We demonstrate that the Conditional Entropy Bottleneck (CEB) can improve model robustness. CEB is an easy strategy to implement and works in tandem with data augmentation procedures. We report results of a large scale adversarial robustness study on CIFAR-10, as well as the ImageNet-C Common Corruptions Benchmark, ImageNet-A, and PGD attacks.

    Submitted 13 February, 2020; originally announced February 2020.

  9. arXiv:2002.05379  [pdf, other

    cs.LG stat.ML

    The Conditional Entropy Bottleneck

    Authors: Ian Fischer

    Abstract: Much of the field of Machine Learning exhibits a prominent set of failure modes, including vulnerability to adversarial examples, poor out-of-distribution (OoD) detection, miscalibration, and willingness to memorize random labelings of datasets. We characterize these as failures of robust generalization, which extends the traditional measure of generalization as accuracy or related metrics on a he… ▽ More

    Submitted 13 February, 2020; originally announced February 2020.

  10. arXiv:2001.01878  [pdf, other

    cs.LG cs.IT stat.ML

    Phase Transitions for the Information Bottleneck in Representation Learning

    Authors: Tailin Wu, Ian Fischer

    Abstract: In the Information Bottleneck (IB), when tuning the relative strength between compression and prediction terms, how do the two terms behave, and what's their relationship with the dataset and the learned representation? In this paper, we set out to answer these questions by studying multiple phase transitions in the IB objective: $\text{IB}_β[p(z|x)] = I(X; Z) - βI(Y; Z)$ defined on the encoding d… ▽ More

    Submitted 6 January, 2020; originally announced January 2020.

    Comments: ICLR 2020; 27 pages, 7 figures

  11. arXiv:1907.07331  [pdf, other

    cs.LG cs.IT stat.ML

    Learnability for the Information Bottleneck

    Authors: Tailin Wu, Ian Fischer, Isaac L. Chuang, Max Tegmark

    Abstract: The Information Bottleneck (IB) method (\cite{tishby2000information}) provides an insightful and principled approach for balancing compression and prediction for representation learning. The IB objective $I(X;Z)-βI(Y;Z)$ employs a Lagrange multiplier $β$ to tune this trade-off. However, in practice, not only is $β$ chosen empirically without theoretical guidance, there is also a lack of theoretica… ▽ More

    Submitted 17 July, 2019; originally announced July 2019.

    Comments: Accepted at UAI 2019

  12. arXiv:1905.07478  [pdf, other

    cs.LG stat.ML

    Dueling Decoders: Regularizing Variational Autoencoder Latent Spaces

    Authors: Bryan Seybold, Emily Fertig, Alex Alemi, Ian Fischer

    Abstract: Variational autoencoders learn unsupervised data representations, but these models frequently converge to minima that fail to preserve meaningful semantic information. For example, variational autoencoders with autoregressive decoders often collapse into autodecoders, where they learn to ignore the encoder input. In this work, we demonstrate that adding an auxiliary decoder to regularize the laten… ▽ More

    Submitted 17 May, 2019; originally announced May 2019.

    Comments: 16 pages, 9 figures, supplemental

  13. arXiv:1811.04551  [pdf, other

    cs.LG cs.AI stat.ML

    Learning Latent Dynamics for Planning from Pixels

    Authors: Danijar Hafner, Timothy Lillicrap, Ian Fischer, Ruben Villegas, David Ha, Honglak Lee, James Davidson

    Abstract: Planning has been very successful for control tasks with known environment dynamics. To leverage planning in unknown environments, the agent needs to learn the dynamics from interactions with the world. However, learning dynamics models that are accurate enough for planning has been a long-standing challenge, especially in image-based domains. We propose the Deep Planning Network (PlaNet), a purel… ▽ More

    Submitted 4 June, 2019; v1 submitted 11 November, 2018; originally announced November 2018.

    Comments: 20 pages, 12 figures, 1 table

  14. arXiv:1807.04162  [pdf, other

    cs.LG cond-mat.stat-mech stat.ML

    TherML: Thermodynamics of Machine Learning

    Authors: Alexander A. Alemi, Ian Fischer

    Abstract: In this work we offer a framework for reasoning about a wide class of existing objectives in machine learning. We develop a formal correspondence between this work and thermodynamics and discuss its implications.

    Submitted 4 October, 2018; v1 submitted 11 July, 2018; originally announced July 2018.

    Comments: Presented at the ICML 2018 workshop on Theoretical Foundations and Applications of Deep Generative Models

  15. arXiv:1807.00906  [pdf, other

    cs.LG stat.ML

    Uncertainty in the Variational Information Bottleneck

    Authors: Alexander A. Alemi, Ian Fischer, Joshua V. Dillon

    Abstract: We present a simple case study, demonstrating that Variational Information Bottleneck (VIB) can improve a network's classification calibration as well as its ability to detect out-of-distribution data. Without explicitly being designed to do so, VIB gives two natural metrics for handling and quantifying uncertainty.

    Submitted 2 July, 2018; originally announced July 2018.

    Comments: 10 pages, 7 figures. Accepted to UAI 2018 - Uncertainty in Deep Learning Workshop

  16. arXiv:1802.04874  [pdf, other

    stat.ML cs.LG

    GILBO: One Metric to Measure Them All

    Authors: Alexander A. Alemi, Ian Fischer

    Abstract: We propose a simple, tractable lower bound on the mutual information contained in the joint generative density of any latent variable generative model: the GILBO (Generative Information Lower BOund). It offers a data-independent measure of the complexity of the learned latent variable description, giving the log of the effective description length. It is well-defined for both VAEs and GANs. We com… ▽ More

    Submitted 10 January, 2019; v1 submitted 13 February, 2018; originally announced February 2018.

    Comments: Accepted at NeurIPS 2018

  17. arXiv:1711.00464  [pdf, other

    cs.LG stat.ML

    Fixing a Broken ELBO

    Authors: Alexander A. Alemi, Ben Poole, Ian Fischer, Joshua V. Dillon, Rif A. Saurous, Kevin Murphy

    Abstract: Recent work in unsupervised representation learning has focused on learning deep directed latent-variable models. Fitting these models by maximizing the marginal likelihood or evidence is typically intractable, thus a common approximation is to maximize the evidence lower bound (ELBO) instead. However, maximum likelihood training (whether exact or approximate) does not necessarily result in a good… ▽ More

    Submitted 13 February, 2018; v1 submitted 1 November, 2017; originally announced November 2017.

    Comments: 21 pages, 9 figures

  18. arXiv:1705.10762  [pdf, other

    cs.LG cs.CV stat.ML

    Generative Models of Visually Grounded Imagination

    Authors: Ramakrishna Vedantam, Ian Fischer, Jonathan Huang, Kevin Murphy

    Abstract: It is easy for people to imagine what a man with pink hair looks like, even if they have never seen such a person before. We call the ability to create images of novel semantic concepts visually grounded imagination. In this paper, we show how we can modify variational auto-encoders to perform this task. Our method uses a novel training objective, and a novel product-of-experts inference network,… ▽ More

    Submitted 9 November, 2018; v1 submitted 30 May, 2017; originally announced May 2017.

    Comments: International Conference on Learning Representations (ICLR), 2018

  19. arXiv:1702.06832  [pdf, other

    stat.ML cs.LG

    Adversarial examples for generative models

    Authors: Jernej Kos, Ian Fischer, Dawn Song

    Abstract: We explore methods of producing adversarial examples on deep generative models such as the variational autoencoder (VAE) and the VAE-GAN. Deep learning architectures are known to be vulnerable to adversarial examples, but previous work has focused on the application of adversarial examples to classification tasks. Deep generative models have recently become popular due to their ability to model in… ▽ More

    Submitted 22 February, 2017; originally announced February 2017.