Skip to main content

Showing 1–16 of 16 results for author: Krueger, D

Searching in archive stat. Search in all archives.
.
  1. arXiv:2410.20432  [pdf, other

    cs.LG cs.CR stat.ML

    Integrating uncertainty quantification into randomized smoothing based robustness guarantees

    Authors: Sina Däubener, Kira Maag, David Krueger, Asja Fischer

    Abstract: Deep neural networks have proven to be extremely powerful, however, they are also vulnerable to adversarial attacks which can cause hazardous incorrect predictions in safety-critical applications. Certified robustness via randomized smoothing gives a probabilistic guarantee that the smoothed classifier's predictions will not change within an $\ell_2$-ball around a given input. On the other hand (u… ▽ More

    Submitted 27 October, 2024; originally announced October 2024.

  2. arXiv:2406.15753  [pdf, other

    cs.LG cs.AI stat.ML

    The Perils of Optimizing Learned Reward Functions: Low Training Error Does Not Guarantee Low Regret

    Authors: Lukas Fluri, Leon Lang, Alessandro Abate, Patrick Forré, David Krueger, Joar Skalse

    Abstract: In reinforcement learning, specifying reward functions that capture the intended task can be very challenging. Reward learning aims to address this issue by learning the reward function. However, a learned reward model may have a low error on the data distribution, and yet subsequently produce a policy with large regret. We say that such a reward model has an error-regret mismatch. The main source… ▽ More

    Submitted 4 March, 2025; v1 submitted 22 June, 2024; originally announced June 2024.

    Comments: 70 pages, 4 figures

  3. arXiv:2211.14827  [pdf, other

    cs.LG cs.AI stat.ML

    Domain Generalization for Robust Model-Based Offline Reinforcement Learning

    Authors: Alan Clark, Shoaib Ahmed Siddiqui, Robert Kirk, Usman Anwar, Stephen Chung, David Krueger

    Abstract: Existing offline reinforcement learning (RL) algorithms typically assume that training data is either: 1) generated by a known policy, or 2) of entirely unknown origin. We consider multi-demonstrator offline RL, a middle ground where we know which demonstrators generated each dataset, but make no assumptions about the underlying policies of the demonstrators. This is the most natural setting when… ▽ More

    Submitted 27 November, 2022; originally announced November 2022.

    Comments: Accepted to the NeurIPS 2022 Workshops on Distribution Shifts and Offline Reinforcement Learning

  4. arXiv:2209.13085  [pdf, other

    cs.LG stat.ML

    Defining and Characterizing Reward Hacking

    Authors: Joar Skalse, Nikolaus H. R. Howe, Dmitrii Krasheninnikov, David Krueger

    Abstract: We provide the first formal definition of reward hacking, a phenomenon where optimizing an imperfect proxy reward function leads to poor performance according to the true reward function. We say that a proxy is unhackable if increasing the expected proxy return can never decrease the expected true return. Intuitively, it might be possible to create an unhackable proxy by leaving some terms out of… ▽ More

    Submitted 5 March, 2025; v1 submitted 26 September, 2022; originally announced September 2022.

    Comments: 23 pages; modified (fix typo in Figure 1, update link to code in Appendix, remove unrendered characters from arXiv abstract)

  5. arXiv:2011.06709  [pdf, other

    cs.LG cs.AI stat.ML

    Active Reinforcement Learning: Observing Rewards at a Cost

    Authors: David Krueger, Jan Leike, Owain Evans, John Salvatier

    Abstract: Active reinforcement learning (ARL) is a variant on reinforcement learning where the agent does not observe the reward unless it chooses to pay a query cost c > 0. The central question of ARL is how to quantify the long-term value of reward information. Even in multi-armed bandits, computing the value of this information is intractable and we have to rely on heuristics. We propose and evaluate sev… ▽ More

    Submitted 24 November, 2020; v1 submitted 12 November, 2020; originally announced November 2020.

    Comments: Originally appeared at the NeurIPS 2016 "Future of Interactive Learning Machines (FILM)" workshop

  6. arXiv:2009.09153  [pdf, other

    cs.LG cs.AI stat.ML

    Hidden Incentives for Auto-Induced Distributional Shift

    Authors: David Krueger, Tegan Maharaj, Jan Leike

    Abstract: Decisions made by machine learning systems have increasing influence on the world, yet it is common for machine learning algorithms to assume that no such influence exists. An example is the use of the i.i.d. assumption in content recommendation. In fact, the (choice of) content displayed can change users' perceptions and preferences, or even drive them away, causing a shift in the distribution of… ▽ More

    Submitted 18 September, 2020; originally announced September 2020.

  7. Stationary vine copula models for multivariate time series

    Authors: Thomas Nagler, Daniel Krüger, Aleksey Min

    Abstract: Multivariate time series exhibit two types of dependence: across variables and across time points. Vine copulas are graphical models for the dependence and can conveniently capture both types of dependence in the same model. We derive the maximal class of graph structures that guarantee stationarity under a natural and verifiable condition called translation invariance. We propose computationally… ▽ More

    Submitted 14 March, 2022; v1 submitted 13 August, 2020; originally announced August 2020.

    Journal ref: Journal of Econometrics, 227(2):305-324, 2022

  8. arXiv:2003.00688  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Out-of-Distribution Generalization via Risk Extrapolation (REx)

    Authors: David Krueger, Ethan Caballero, Joern-Henrik Jacobsen, Amy Zhang, Jonathan Binas, Dinghuai Zhang, Remi Le Priol, Aaron Courville

    Abstract: Distributional shift is one of the major obstacles when transferring machine learning prediction systems from the lab to the real world. To tackle this problem, we assume that variation across training domains is representative of the variation we might encounter at test time, but also that shifts at test time may be more extreme in magnitude. In particular, we show that reducing differences in ri… ▽ More

    Submitted 25 February, 2021; v1 submitted 2 March, 2020; originally announced March 2020.

  9. arXiv:1811.07871  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Scalable agent alignment via reward modeling: a research direction

    Authors: Jan Leike, David Krueger, Tom Everitt, Miljan Martic, Vishal Maini, Shane Legg

    Abstract: One obstacle to applying reinforcement learning algorithms to real-world problems is the lack of suitable reward functions. Designing such reward functions is difficult in part because the user only has an implicit understanding of the task objective. This gives rise to the agent alignment problem: how do we create agents that behave in accordance with the user's intentions? We outline a high-leve… ▽ More

    Submitted 19 November, 2018; originally announced November 2018.

  10. arXiv:1806.07528  [pdf, other

    stat.ML cs.LG

    Uncertainty in Multitask Transfer Learning

    Authors: Alexandre Lacoste, Boris Oreshkin, Wonchang Chung, Thomas Boquet, Negar Rostamzadeh, David Krueger

    Abstract: Using variational Bayes neural networks, we develop an algorithm capable of accumulating knowledge into a prior from multiple different tasks. The result is a rich and meaningful prior capable of few-shot learning on new tasks. The posterior can go beyond the mean field approximation and yields good uncertainty on the performed experiments. Analysis on toy tasks shows that it can learn from signif… ▽ More

    Submitted 6 July, 2018; v1 submitted 19 June, 2018; originally announced June 2018.

  11. arXiv:1804.00779  [pdf, other

    cs.LG stat.ML

    Neural Autoregressive Flows

    Authors: Chin-Wei Huang, David Krueger, Alexandre Lacoste, Aaron Courville

    Abstract: Normalizing flows and autoregressive models have been successfully combined to produce state-of-the-art results in density estimation, via Masked Autoregressive Flows (MAF), and to accelerate state-of-the-art WaveNet-based speech synthesis to 20x faster than real-time, via Inverse Autoregressive Flows (IAF). We unify and generalize these approaches, replacing the (conditionally) affine univariate… ▽ More

    Submitted 2 April, 2018; originally announced April 2018.

    Comments: 16 pages, 10 figures, 3 tables

  12. arXiv:1712.05016  [pdf, other

    stat.ML cs.LG

    Deep Prior

    Authors: Alexandre Lacoste, Thomas Boquet, Negar Rostamzadeh, Boris Oreshkin, Wonchang Chung, David Krueger

    Abstract: The recent literature on deep learning offers new tools to learn a rich probability distribution over high dimensional data such as images or sounds. In this work we investigate the possibility of learning the prior distribution over neural network parameters using such tools. Our resulting variational Bayes algorithm generalizes well to new tasks, even when very few training examples are provided… ▽ More

    Submitted 15 December, 2017; v1 submitted 13 December, 2017; originally announced December 2017.

    Comments: Workshop paper, Accepted at Bayesian Deep Learning workshop, NIPS 2017

  13. arXiv:1710.04759  [pdf, other

    stat.ML cs.AI cs.LG

    Bayesian Hypernetworks

    Authors: David Krueger, Chin-Wei Huang, Riashat Islam, Ryan Turner, Alexandre Lacoste, Aaron Courville

    Abstract: We study Bayesian hypernetworks: a framework for approximate Bayesian inference in neural networks. A Bayesian hypernetwork $\h$ is a neural network which learns to transform a simple noise distribution, $p(\vecε) = \N(\vec 0,\mat I)$, to a distribution $q(\pp) := q(h(\vecε))$ over the parameters $\pp$ of another neural network (the "primary network")\@. We train $q$ with variational inference, us… ▽ More

    Submitted 24 April, 2018; v1 submitted 12 October, 2017; originally announced October 2017.

    Comments: David Krueger and Chin-Wei Huang contributed equally

  14. arXiv:1706.05394  [pdf, other

    stat.ML cs.LG

    A Closer Look at Memorization in Deep Networks

    Authors: Devansh Arpit, Stanisław Jastrzębski, Nicolas Ballas, David Krueger, Emmanuel Bengio, Maxinder S. Kanwal, Tegan Maharaj, Asja Fischer, Aaron Courville, Yoshua Bengio, Simon Lacoste-Julien

    Abstract: We examine the role of memorization in deep learning, drawing connections to capacity, generalization, and adversarial robustness. While deep networks are capable of memorizing noise data, our results suggest that they tend to prioritize learning simple patterns first. In our experiments, we expose qualitative differences in gradient-based optimization of deep neural networks (DNNs) on noise vs. r… ▽ More

    Submitted 1 July, 2017; v1 submitted 16 June, 2017; originally announced June 2017.

    Comments: Appears in Proceedings of the 34th International Conference on Machine Learning (ICML 2017), Devansh Arpit, Stanisław Jastrzębski, Nicolas Ballas, and David Krueger contributed equally to this work

  15. arXiv:1511.08400  [pdf, other

    cs.NE cs.CL cs.LG stat.ML

    Regularizing RNNs by Stabilizing Activations

    Authors: David Krueger, Roland Memisevic

    Abstract: We stabilize the activations of Recurrent Neural Networks (RNNs) by penalizing the squared distance between successive hidden states' norms. This penalty term is an effective regularizer for RNNs including LSTMs and IRNNs, improving performance on character-level language modeling and phoneme recognition, and outperforming weight noise and dropout. We achieve competitive performance (18.6\% PE… ▽ More

    Submitted 26 April, 2016; v1 submitted 26 November, 2015; originally announced November 2015.

  16. arXiv:1402.3337  [pdf, other

    stat.ML cs.CV cs.LG cs.NE

    Zero-bias autoencoders and the benefits of co-adapting features

    Authors: Kishore Konda, Roland Memisevic, David Krueger

    Abstract: Regularized training of an autoencoder typically results in hidden unit biases that take on large negative values. We show that negative biases are a natural result of using a hidden layer whose responsibility is to both represent the input data and act as a selection mechanism that ensures sparsity of the representation. We then show that negative biases impede the learning of data distributions… ▽ More

    Submitted 8 April, 2015; v1 submitted 13 February, 2014; originally announced February 2014.