Skip to main content

Showing 1–43 of 43 results for author: Fischer, I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2509.14622  [pdf, ps, other

    cs.CR cs.AI cs.LG

    Adversarial Distilled Retrieval-Augmented Guarding Model for Online Malicious Intent Detection

    Authors: Yihao Guo, Haocheng Bian, Liutong Zhou, Ze Wang, Zhaoyi Zhang, Francois Kawala, Milan Dean, Ian Fischer, Yuantao Peng, Noyan Tokgozoglu, Ivan Barrientos, Riyaaz Shaik, Rachel Li, Chandru Venkataraman, Reza Shifteh Far, Moses Pawar, Venkat Sundaranatha, Michael Xu, Frank Chu

    Abstract: With the deployment of Large Language Models (LLMs) in interactive applications, online malicious intent detection has become increasingly critical. However, existing approaches fall short of handling diverse and complex user queries in real time. To address these challenges, we introduce ADRAG (Adversarial Distilled Retrieval-Augmented Guard), a two-stage framework for robust and efficient online… ▽ More

    Submitted 18 September, 2025; originally announced September 2025.

  2. arXiv:2501.09891  [pdf, other

    cs.AI

    Evolving Deeper LLM Thinking

    Authors: Kuang-Huei Lee, Ian Fischer, Yueh-Hua Wu, Dave Marwood, Shumeet Baluja, Dale Schuurmans, Xinyun Chen

    Abstract: We explore an evolutionary search strategy for scaling inference time compute in Large Language Models. The proposed approach, Mind Evolution, uses a language model to generate, recombine and refine candidate responses. The proposed approach avoids the need to formalize the underlying inference problem whenever a solution evaluator is available. Controlling for inference cost, we find that Mind Ev… ▽ More

    Submitted 16 January, 2025; originally announced January 2025.

  3. arXiv:2412.03206  [pdf, other

    cs.ET physics.optics

    Experimental reservoir computing with diffractively coupled VCSELs

    Authors: Moritz Pflüger, Daniel Brunner, Tobias Heuser, James A. Lott, Stephan Reitzenstein, Ingo Fischer

    Abstract: We present experiments on reservoir computing (RC) using a network of vertical-cavity surface-emitting lasers (VCSELs) that we diffractively couple via an external cavity. Our optical reservoir computer consists of 24 physical VCSEL nodes. We evaluate the system's memory and solve the 2-bit XOR task and the 3-bit header recognition (HR) task with bit error ratios (BERs) below 1\,\% and the 2-bit d… ▽ More

    Submitted 4 December, 2024; originally announced December 2024.

    Journal ref: Optics Letters 49, 2285 (2024)

  4. arXiv:2410.02217  [pdf, other

    cs.LG cs.CV stat.ML

    Stochastic Sampling from Deterministic Flow Models

    Authors: Saurabh Singh, Ian Fischer

    Abstract: Deterministic flow models, such as rectified flows, offer a general framework for learning a deterministic transport map between two distributions, realized as the vector field for an ordinary differential equation (ODE). However, they are sensitive to model estimation and discretization errors and do not permit different samples conditioned on an intermediate state, limiting their application. We… ▽ More

    Submitted 3 October, 2024; originally announced October 2024.

    Comments: Submitted to ICLR 2025

  5. arXiv:2405.07236  [pdf, other

    cs.LG nlin.AO

    Adaptive control of recurrent neural networks using conceptors

    Authors: Guillaume Pourcel, Mirko Goldmann, Ingo Fischer, Miguel C. Soriano

    Abstract: Recurrent Neural Networks excel at predicting and generating complex high-dimensional temporal patterns. Due to their inherent nonlinear dynamics and memory, they can learn unbounded temporal dependencies from data. In a Machine Learning setting, the network's parameters are adapted during a training phase to match the requirements of a given task/problem increasing its computational capabilities.… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

  6. arXiv:2402.09727  [pdf, other

    cs.CL cs.AI cs.IR

    A Human-Inspired Reading Agent with Gist Memory of Very Long Contexts

    Authors: Kuang-Huei Lee, Xinyun Chen, Hiroki Furuta, John Canny, Ian Fischer

    Abstract: Current Large Language Models (LLMs) are not only limited to some maximum context length, but also are not able to robustly consume long inputs. To address these limitations, we propose ReadAgent, an LLM agent system that increases effective context length up to 20x in our experiments. Inspired by how humans interactively read long documents, we implement ReadAgent as a simple prompting system tha… ▽ More

    Submitted 22 July, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

    Comments: Website: https://read-agent.github.io

  7. arXiv:2211.09981  [pdf, other

    cs.LG cs.AI stat.ML

    Weighted Ensemble Self-Supervised Learning

    Authors: Yangjun Ruan, Saurabh Singh, Warren Morningstar, Alexander A. Alemi, Sergey Ioffe, Ian Fischer, Joshua V. Dillon

    Abstract: Ensembling has proven to be a powerful technique for boosting model performance, uncertainty estimation, and robustness in supervised learning. Advances in self-supervised learning (SSL) enable leveraging large unlabeled corpora for state-of-the-art few-shot and supervised learning performance. In this paper, we explore how ensemble methods can improve recent SSL techniques by developing a framewo… ▽ More

    Submitted 9 April, 2023; v1 submitted 17 November, 2022; originally announced November 2022.

    Comments: Accepted by ICLR 2023

  8. arXiv:2210.08217  [pdf, other

    cs.RO cs.AI cs.IT cs.LG

    PI-QT-Opt: Predictive Information Improves Multi-Task Robotic Reinforcement Learning at Scale

    Authors: Kuang-Huei Lee, Ted Xiao, Adrian Li, Paul Wohlhart, Ian Fischer, Yao Lu

    Abstract: The predictive information, the mutual information between the past and future, has been shown to be a useful representation learning auxiliary loss for training reinforcement learning agents, as the ability to model what will happen next is critical to success on many control tasks. While existing studies are largely restricted to training specialist agents on single-task settings in simulation,… ▽ More

    Submitted 24 November, 2022; v1 submitted 15 October, 2022; originally announced October 2022.

    Comments: CoRL 2022. 21 pages, 9 figures. The supplementary video is available at https://kuanghuei.github.io/piqtopt

  9. arXiv:2207.14133  [pdf, other

    cs.LG nlin.CD

    Learning unseen coexisting attractors

    Authors: Daniel J. Gauthier, Ingo Fischer, André Röhm

    Abstract: Reservoir computing is a machine learning approach that can generate a surrogate model of a dynamical system. It can learn the underlying dynamical system using fewer trainable parameters and hence smaller training data sets than competing approaches. Recently, a simpler formulation, known as next-generation reservoir computing, removes many algorithm metaparameters and identifies a well-performin… ▽ More

    Submitted 28 July, 2022; originally announced July 2022.

    Comments: 8 pages, 7 figures

  10. arXiv:2206.04114  [pdf, other

    cs.AI cs.LG cs.RO stat.ML

    Deep Hierarchical Planning from Pixels

    Authors: Danijar Hafner, Kuang-Huei Lee, Ian Fischer, Pieter Abbeel

    Abstract: Intelligent agents need to select long sequences of actions to solve complex tasks. While humans easily break down tasks into subgoals and reach them through millions of muscle commands, current artificial intelligence is limited to tasks with horizons of a few hundred decisions, despite large compute budgets. Research on hierarchical reinforcement learning aims to overcome this limitation but has… ▽ More

    Submitted 8 June, 2022; originally announced June 2022.

    Comments: Website: https://danijar.com/director

  11. arXiv:2205.15241  [pdf, other

    cs.AI cs.LG

    Multi-Game Decision Transformers

    Authors: Kuang-Huei Lee, Ofir Nachum, Mengjiao Yang, Lisa Lee, Daniel Freeman, Winnie Xu, Sergio Guadarrama, Ian Fischer, Eric Jang, Henryk Michalewski, Igor Mordatch

    Abstract: A longstanding goal of the field of AI is a method for learning a highly capable, generalist agent from diverse experience. In the subfields of vision and language, this was largely achieved by scaling up transformer-based models and training them on large, diverse datasets. Motivated by this progress, we investigate whether the same strategy can be used to produce generalist reinforcement learnin… ▽ More

    Submitted 15 October, 2022; v1 submitted 30 May, 2022; originally announced May 2022.

    Comments: NeurIPS 2022. 24 pages, 16 figures. Additional information, videos and code can be seen at https://sites.google.com/view/multi-game-transformers

  12. arXiv:2205.07886  [pdf, other

    cs.LG cs.AI

    An Empirical Investigation of Representation Learning for Imitation

    Authors: Xin Chen, Sam Toyer, Cody Wild, Scott Emmons, Ian Fischer, Kuang-Huei Lee, Neel Alex, Steven H Wang, Ping Luo, Stuart Russell, Pieter Abbeel, Rohin Shah

    Abstract: Imitation learning often needs a large demonstration set in order to handle the full range of situations that an agent might find itself in during deployment. However, collecting expert demonstrations can be expensive. Recent work in vision, reinforcement learning, and NLP has shown that auxiliary representation learning objectives can reduce the need for large amounts of expensive, task-specific… ▽ More

    Submitted 16 May, 2022; originally announced May 2022.

    Comments: Accepted to NeurIPS2021 Datasets and Benchmarks Track

  13. arXiv:2203.02592  [pdf, other

    stat.ML cs.LG stat.ME

    Sparsity-Inducing Categorical Prior Improves Robustness of the Information Bottleneck

    Authors: Anirban Samaddar, Sandeep Madireddy, Prasanna Balaprakash, Tapabrata Maiti, Gustavo de los Campos, Ian Fischer

    Abstract: The information bottleneck framework provides a systematic approach to learning representations that compress nuisance information in the input and extract semantically meaningful information about predictions. However, the choice of a prior distribution that fixes the dimensionality across all the data can restrict the flexibility of this approach for learning robust representations. We present a… ▽ More

    Submitted 27 October, 2022; v1 submitted 4 March, 2022; originally announced March 2022.

  14. arXiv:2111.03706  [pdf, other

    cs.LG nlin.AO

    Learn one size to infer all: Exploiting translational symmetries in delay-dynamical and spatio-temporal systems using scalable neural networks

    Authors: Mirko Goldmann, Claudio R. Mirasso, Ingo Fischer, Miguel C. Soriano

    Abstract: We design scalable neural networks adapted to translational symmetries in dynamical systems, capable of inferring untrained high-dimensional dynamics for different system sizes. We train these networks to predict the dynamics of delay-dynamical and spatio-temporal systems for a single size. Then, we drive the networks by their own predictions. We demonstrate that by scaling the size of the trained… ▽ More

    Submitted 5 July, 2024; v1 submitted 5 November, 2021; originally announced November 2021.

  15. Tutorial: Photonic Neural Networks in Delay Systems

    Authors: D. Brunner, B. Penkovsky, B. A. Marquez, M. Jaquot, I. Fischer, L. Larger

    Abstract: Photonic delay systems have revolutionized the hardware implementation of Recurrent Neural Networks and Reservoir Computing in particular. The fundamental principles of Reservoir Computing strongly benefit a realization in such complex analog systems. Especially delay systems, potentially providing large numbers of degrees of freedom even in simple architectures, can efficiently be exploited for i… ▽ More

    Submitted 5 November, 2021; originally announced November 2021.

    Journal ref: Journal of Applied Physics 124, 152004 (2018)

  16. arXiv:2109.12909  [pdf, other

    cs.LG cs.CV cs.IT

    Compressive Visual Representations

    Authors: Kuang-Huei Lee, Anurag Arnab, Sergio Guadarrama, John Canny, Ian Fischer

    Abstract: Learning effective visual representations that generalize well without human supervision is a fundamental problem in order to apply Machine Learning to a wide variety of tasks. Recently, two families of self-supervised methods, contrastive learning and latent bootstrapping, exemplified by SimCLR and BYOL respectively, have made significant progress. In this work, we hypothesize that adding explici… ▽ More

    Submitted 4 December, 2021; v1 submitted 27 September, 2021; originally announced September 2021.

    Comments: NeurIPS 2021. 27 pages, 4 figures. Code and pretrained models at https://github.com/google-research/compressive-visual-representations

  17. arXiv:2108.04074  [pdf, other

    cs.LG nlin.AO

    Model-free inference of unseen attractors: Reconstructing phase space features from a single noisy trajectory using reservoir computing

    Authors: André Röhm, Daniel J. Gauthier, Ingo Fischer

    Abstract: Reservoir computers are powerful tools for chaotic time series prediction. They can be trained to approximate phase space flows and can thus both predict future values to a high accuracy, as well as reconstruct the general properties of a chaotic attractor without requiring a model. In this work, we show that the ability to learn the dynamics of a complex system can be extended to systems with co-… ▽ More

    Submitted 30 September, 2021; v1 submitted 6 August, 2021; originally announced August 2021.

    Journal ref: Chaos 31, 103127 (2021)

  18. Deep Neural Networks using a Single Neuron: Folded-in-Time Architecture using Feedback-Modulated Delay Loops

    Authors: Florian Stelzer, André Röhm, Raul Vicente, Ingo Fischer, Serhiy Yanchuk

    Abstract: Deep neural networks are among the most widely applied machine learning tools showing outstanding performance in a broad range of tasks. We present a method for folding a deep neural network of arbitrary size into a single neuron with multiple time-delayed feedback loops. This single-neuron deep neural network comprises only a single nonlinearity and appropriately adjusted modulations of the feedb… ▽ More

    Submitted 6 June, 2021; v1 submitted 19 November, 2020; originally announced November 2020.

  19. arXiv:2011.08711  [pdf, other

    stat.ML cs.LG

    VIB is Half Bayes

    Authors: Alexander A Alemi, Warren R Morningstar, Ben Poole, Ian Fischer, Joshua V Dillon

    Abstract: In discriminative settings such as regression and classification there are two random variables at play, the inputs X and the targets Y. Here, we demonstrate that the Variational Information Bottleneck can be viewed as a compromise between fully empirical and fully Bayesian objectives, attempting to minimize the risks due to finite sampling of Y only. We argue that this approach provides some of t… ▽ More

    Submitted 17 November, 2020; originally announced November 2020.

  20. arXiv:2007.12401  [pdf, other

    cs.LG cs.AI cs.IT cs.RO stat.ML

    Predictive Information Accelerates Learning in RL

    Authors: Kuang-Huei Lee, Ian Fischer, Anthony Liu, Yijie Guo, Honglak Lee, John Canny, Sergio Guadarrama

    Abstract: The Predictive Information is the mutual information between the past and the future, I(X_past; X_future). We hypothesize that capturing the predictive information is useful in RL, since the ability to model what will happen next is necessary for success on many tasks. To test our hypothesis, we train Soft Actor-Critic (SAC) agents from pixels with an auxiliary task that learns a compressed repres… ▽ More

    Submitted 25 October, 2020; v1 submitted 24 July, 2020; originally announced July 2020.

    Comments: To appear at NeurIPS 2020

  21. arXiv:2007.12335  [pdf, other

    cs.LG stat.ML

    Cycles in Causal Learning

    Authors: Katie Everett, Ian Fischer

    Abstract: In the causal learning setting, we wish to learn cause-and-effect relationships between variables such that we can correctly infer the effect of an intervention. While the difference between a cyclic structure and an acyclic structure may be just a single edge, cyclic causal structures have qualitatively different behavior under intervention: cycles cause feedback loops when the downstream effect… ▽ More

    Submitted 23 July, 2020; originally announced July 2020.

  22. arXiv:2006.13933  [pdf, other

    cs.ET physics.app-ph

    Developing of a photonic hardware platform for brain-inspired computing based on $5\times5$ VCSEL arrays

    Authors: T. Heuser, M. Pflüger, I. Fischer, J. A. Lott, D. Brunner, S. Reitzenstein

    Abstract: Brain-inspired computing concepts like artificial neural networks have become promising alternatives to classical von Neumann computer architectures. Photonic neural networks target the realizations of neurons, network connections and potentially learning in photonic substrates. Here, we report the development of a nanophotonic hardware platform of fast and energy-efficient photonic neurons via ar… ▽ More

    Submitted 24 June, 2020; originally announced June 2020.

  23. arXiv:2006.06752  [pdf, other

    cs.CV

    An Unsupervised Information-Theoretic Perceptual Quality Metric

    Authors: Sangnie Bhardwaj, Ian Fischer, Johannes Ballé, Troy Chinen

    Abstract: Tractable models of human perception have proved to be challenging to build. Hand-designed models such as MS-SSIM remain popular predictors of human image quality judgements due to their simplicity and speed. Recent modern deep learning approaches can perform better, but they rely on supervised data which can be costly to gather: large sets of class labels such as ImageNet, image quality ratings,… ▽ More

    Submitted 10 January, 2021; v1 submitted 11 June, 2020; originally announced June 2020.

    Comments: 19 pages, 10 figures. Presented at NeurIPS 2020. Code available at https://github.com/google-research/perceptual-quality

  24. arXiv:2002.05380  [pdf, other

    cs.LG stat.ML

    CEB Improves Model Robustness

    Authors: Ian Fischer, Alexander A. Alemi

    Abstract: We demonstrate that the Conditional Entropy Bottleneck (CEB) can improve model robustness. CEB is an easy strategy to implement and works in tandem with data augmentation procedures. We report results of a large scale adversarial robustness study on CIFAR-10, as well as the ImageNet-C Common Corruptions Benchmark, ImageNet-A, and PGD attacks.

    Submitted 13 February, 2020; originally announced February 2020.

  25. arXiv:2002.05379  [pdf, other

    cs.LG stat.ML

    The Conditional Entropy Bottleneck

    Authors: Ian Fischer

    Abstract: Much of the field of Machine Learning exhibits a prominent set of failure modes, including vulnerability to adversarial examples, poor out-of-distribution (OoD) detection, miscalibration, and willingness to memorize random labelings of datasets. We characterize these as failures of robust generalization, which extends the traditional measure of generalization as accuracy or related metrics on a he… ▽ More

    Submitted 13 February, 2020; originally announced February 2020.

  26. arXiv:2001.01878  [pdf, other

    cs.LG cs.IT stat.ML

    Phase Transitions for the Information Bottleneck in Representation Learning

    Authors: Tailin Wu, Ian Fischer

    Abstract: In the Information Bottleneck (IB), when tuning the relative strength between compression and prediction terms, how do the two terms behave, and what's their relationship with the dataset and the learned representation? In this paper, we set out to answer these questions by studying multiple phase transitions in the IB objective: $\text{IB}_β[p(z|x)] = I(X; Z) - βI(Y; Z)$ defined on the encoding d… ▽ More

    Submitted 6 January, 2020; originally announced January 2020.

    Comments: ICLR 2020; 27 pages, 7 figures

  27. arXiv:1907.09578  [pdf, other

    cs.CV cs.IT cs.LG

    Information-Bottleneck Approach to Salient Region Discovery

    Authors: Andrey Zhmoginov, Ian Fischer, Mark Sandler

    Abstract: We propose a new method for learning image attention masks in a semi-supervised setting based on the Information Bottleneck principle. Provided with a set of labeled images, the mask generation model is minimizing mutual information between the input and the masked image while maximizing the mutual information between the same masked image and the image label. In contrast with other approaches, ou… ▽ More

    Submitted 14 February, 2020; v1 submitted 22 July, 2019; originally announced July 2019.

  28. arXiv:1907.07331  [pdf, other

    cs.LG cs.IT stat.ML

    Learnability for the Information Bottleneck

    Authors: Tailin Wu, Ian Fischer, Isaac L. Chuang, Max Tegmark

    Abstract: The Information Bottleneck (IB) method (\cite{tishby2000information}) provides an insightful and principled approach for balancing compression and prediction for representation learning. The IB objective $I(X;Z)-βI(Y;Z)$ employs a Lagrange multiplier $β$ to tune this trade-off. However, in practice, not only is $β$ chosen empirically without theoretical guidance, there is also a lack of theoretica… ▽ More

    Submitted 17 July, 2019; originally announced July 2019.

    Comments: Accepted at UAI 2019

  29. arXiv:1905.07478  [pdf, other

    cs.LG stat.ML

    Dueling Decoders: Regularizing Variational Autoencoder Latent Spaces

    Authors: Bryan Seybold, Emily Fertig, Alex Alemi, Ian Fischer

    Abstract: Variational autoencoders learn unsupervised data representations, but these models frequently converge to minima that fail to preserve meaningful semantic information. For example, variational autoencoders with autoregressive decoders often collapse into autodecoders, where they learn to ignore the encoder input. In this work, we demonstrate that adding an auxiliary decoder to regularize the laten… ▽ More

    Submitted 17 May, 2019; originally announced May 2019.

    Comments: 16 pages, 9 figures, supplemental

  30. arXiv:1811.04551  [pdf, other

    cs.LG cs.AI stat.ML

    Learning Latent Dynamics for Planning from Pixels

    Authors: Danijar Hafner, Timothy Lillicrap, Ian Fischer, Ruben Villegas, David Ha, Honglak Lee, James Davidson

    Abstract: Planning has been very successful for control tasks with known environment dynamics. To leverage planning in unknown environments, the agent needs to learn the dynamics from interactions with the world. However, learning dynamics models that are accurate enough for planning has been a long-standing challenge, especially in image-based domains. We propose the Deep Planning Network (PlaNet), a purel… ▽ More

    Submitted 4 June, 2019; v1 submitted 11 November, 2018; originally announced November 2018.

    Comments: 20 pages, 12 figures, 1 table

  31. arXiv:1807.04162  [pdf, other

    cs.LG cond-mat.stat-mech stat.ML

    TherML: Thermodynamics of Machine Learning

    Authors: Alexander A. Alemi, Ian Fischer

    Abstract: In this work we offer a framework for reasoning about a wide class of existing objectives in machine learning. We develop a formal correspondence between this work and thermodynamics and discuss its implications.

    Submitted 4 October, 2018; v1 submitted 11 July, 2018; originally announced July 2018.

    Comments: Presented at the ICML 2018 workshop on Theoretical Foundations and Applications of Deep Generative Models

  32. arXiv:1807.00906  [pdf, other

    cs.LG stat.ML

    Uncertainty in the Variational Information Bottleneck

    Authors: Alexander A. Alemi, Ian Fischer, Joshua V. Dillon

    Abstract: We present a simple case study, demonstrating that Variational Information Bottleneck (VIB) can improve a network's classification calibration as well as its ability to detect out-of-distribution data. Without explicitly being designed to do so, VIB gives two natural metrics for handling and quantifying uncertainty.

    Submitted 2 July, 2018; originally announced July 2018.

    Comments: 10 pages, 7 figures. Accepted to UAI 2018 - Uncertainty in Deep Learning Workshop

  33. arXiv:1802.04874  [pdf, other

    stat.ML cs.LG

    GILBO: One Metric to Measure Them All

    Authors: Alexander A. Alemi, Ian Fischer

    Abstract: We propose a simple, tractable lower bound on the mutual information contained in the joint generative density of any latent variable generative model: the GILBO (Generative Information Lower BOund). It offers a data-independent measure of the complexity of the learned latent variable description, giving the log of the effective description length. It is well-defined for both VAEs and GANs. We com… ▽ More

    Submitted 10 January, 2019; v1 submitted 13 February, 2018; originally announced February 2018.

    Comments: Accepted at NeurIPS 2018

  34. arXiv:1711.05133  [pdf, other

    cs.NE physics.optics

    Reinforcement Learning in a large scale photonic Recurrent Neural Network

    Authors: Julian Bueno, Sheler Maktoobi, Luc Froehly, Ingo Fischer, Maxime Jacquot, Laurent Larger, Daniel Brunner

    Abstract: Photonic Neural Network implementations have been gaining considerable attention as a potentially disruptive future technology. Demonstrating learning in large scale neural networks is essential to establish photonic machine learning substrates as viable information processing systems. Realizing photonic Neural Networks with numerous nonlinear nodes in a fully parallel and efficient learning hardw… ▽ More

    Submitted 15 November, 2017; v1 submitted 14 November, 2017; originally announced November 2017.

    Journal ref: Optica Vol. 5, Issue 6, pp. 756-760 (2018)

  35. arXiv:1711.00464  [pdf, other

    cs.LG stat.ML

    Fixing a Broken ELBO

    Authors: Alexander A. Alemi, Ben Poole, Ian Fischer, Joshua V. Dillon, Rif A. Saurous, Kevin Murphy

    Abstract: Recent work in unsupervised representation learning has focused on learning deep directed latent-variable models. Fitting these models by maximizing the marginal likelihood or evidence is typically intractable, thus a common approximation is to maximize the evidence lower bound (ELBO) instead. However, maximum likelihood training (whether exact or approximate) does not necessarily result in a good… ▽ More

    Submitted 13 February, 2018; v1 submitted 1 November, 2017; originally announced November 2017.

    Comments: 21 pages, 9 figures

  36. arXiv:1705.10762  [pdf, other

    cs.LG cs.CV stat.ML

    Generative Models of Visually Grounded Imagination

    Authors: Ramakrishna Vedantam, Ian Fischer, Jonathan Huang, Kevin Murphy

    Abstract: It is easy for people to imagine what a man with pink hair looks like, even if they have never seen such a person before. We call the ability to create images of novel semantic concepts visually grounded imagination. In this paper, we show how we can modify variational auto-encoders to perform this task. Our method uses a novel training objective, and a novel product-of-experts inference network,… ▽ More

    Submitted 9 November, 2018; v1 submitted 30 May, 2017; originally announced May 2017.

    Comments: International Conference on Learning Representations (ICLR), 2018

  37. arXiv:1703.09387  [pdf, other

    cs.NE cs.AI cs.CV

    Adversarial Transformation Networks: Learning to Generate Adversarial Examples

    Authors: Shumeet Baluja, Ian Fischer

    Abstract: Multiple different approaches of generating adversarial examples have been proposed to attack deep neural networks. These approaches involve either directly computing gradients with respect to the image pixels, or directly solving an optimization on the image pixels. In this work, we present a fundamentally new method for generating adversarial examples that is fast to execute and provides excepti… ▽ More

    Submitted 27 March, 2017; originally announced March 2017.

  38. arXiv:1702.06832  [pdf, other

    stat.ML cs.LG

    Adversarial examples for generative models

    Authors: Jernej Kos, Ian Fischer, Dawn Song

    Abstract: We explore methods of producing adversarial examples on deep generative models such as the variational autoencoder (VAE) and the VAE-GAN. Deep learning architectures are known to be vulnerable to adversarial examples, but previous work has focused on the application of adversarial examples to classification tasks. Deep generative models have recently become popular due to their ability to model in… ▽ More

    Submitted 22 February, 2017; originally announced February 2017.

  39. arXiv:1612.00410  [pdf, other

    cs.LG cs.IT

    Deep Variational Information Bottleneck

    Authors: Alexander A. Alemi, Ian Fischer, Joshua V. Dillon, Kevin Murphy

    Abstract: We present a variational approximation to the information bottleneck of Tishby et al. (1999). This variational approach allows us to parameterize the information bottleneck model using a neural network and leverage the reparameterization trick for efficient training. We call this method "Deep Variational Information Bottleneck", or Deep VIB. We show that models trained with the VIB objective outpe… ▽ More

    Submitted 23 October, 2019; v1 submitted 1 December, 2016; originally announced December 2016.

    Comments: 19 pages, 8 figures, Accepted to ICLR17

    Journal ref: Proceedings of the International Conference on Learning Representations (ICLR) 2017

  40. arXiv:1611.10012  [pdf, other

    cs.CV

    Speed/accuracy trade-offs for modern convolutional object detectors

    Authors: Jonathan Huang, Vivek Rathod, Chen Sun, Menglong Zhu, Anoop Korattikara, Alireza Fathi, Ian Fischer, Zbigniew Wojna, Yang Song, Sergio Guadarrama, Kevin Murphy

    Abstract: The goal of this paper is to serve as a guide for selecting a detection architecture that achieves the right speed/memory/accuracy balance for a given application and platform. To this end, we investigate various ways to trade accuracy for speed and memory usage in modern convolutional object detection systems. A number of successful systems have been proposed in recent years, but apples-to-apples… ▽ More

    Submitted 24 April, 2017; v1 submitted 30 November, 2016; originally announced November 2016.

    Comments: Accepted to CVPR 2017

  41. CONDENSE: A Reconfigurable Knowledge Acquisition Architecture for Future 5G IoT

    Authors: Dejan Vukobratovic, Dusan Jakovetic, Vitaly Skachek, Dragana Bajovic, Dino Sejdinovic, Gunes Karabulut Kurt, Camilla Hollanti, Ingo Fischer

    Abstract: In forthcoming years, the Internet of Things (IoT) will connect billions of smart devices generating and uploading a deluge of data to the cloud. If successfully extracted, the knowledge buried in the data can significantly improve the quality of life and foster economic growth. However, a critical bottleneck for realising the efficient IoT is the pressure it puts on the existing communication inf… ▽ More

    Submitted 12 September, 2016; originally announced September 2016.

    Comments: 17 pages, 7 figures in IEEE Access, Vol. 4, 2016

  42. arXiv:1501.02592  [pdf, other

    cs.NE cs.LG

    Photonic Delay Systems as Machine Learning Implementations

    Authors: Michiel Hermans, Miguel Soriano, Joni Dambre, Peter Bienstman, Ingo Fischer

    Abstract: Nonlinear photonic delay systems present interesting implementation platforms for machine learning models. They can be extremely fast, offer great degrees of parallelism and potentially consume far less power than digital processors. So far they have been successfully employed for signal processing using the Reservoir Computing paradigm. In this paper we show that their range of applicability can… ▽ More

    Submitted 12 January, 2015; originally announced January 2015.

    Journal ref: Journal of Machine Learning Research, vol. 16, pp. 2081-2097 (2015)

  43. Reservoir computing with a single time-delay autonomous Boolean node

    Authors: Nicholas D. Haynes, Miguel C. Soriano, David P. Rosin, Ingo Fischer, Daniel J. Gauthier

    Abstract: We demonstrate reservoir computing with a physical system using a single autonomous Boolean logic element with time-delay feedback. The system generates a chaotic transient with a window of consistency lasting between 30 and 300 ns, which we show is sufficient for reservoir computing. We then characterize the dependence of computational performance on system parameters to find the best operating p… ▽ More

    Submitted 30 January, 2015; v1 submitted 4 November, 2014; originally announced November 2014.

    Comments: 5 pages, 5 figures

    Journal ref: Physical Review E 91, 020801(R)(1-5) (2015)