Skip to main content

Showing 1–9 of 9 results for author: Dauphin, Y

Searching in archive stat. Search in all archives.
.
  1. arXiv:2503.17338  [pdf, other

    cs.AI cs.LG stat.ML

    Capturing Individual Human Preferences with Reward Features

    Authors: André Barreto, Vincent Dumoulin, Yiran Mao, Nicolas Perez-Nieves, Bobak Shahriari, Yann Dauphin, Doina Precup, Hugo Larochelle

    Abstract: Reinforcement learning from human feedback usually models preferences using a reward model that does not distinguish between people. We argue that this is unlikely to be a good design choice in contexts with high potential for disagreement, like in the training of large language models. We propose a method to specialise a reward model to a person or group of people. Our approach builds on the obse… ▽ More

    Submitted 21 March, 2025; originally announced March 2025.

  2. arXiv:2502.02407  [pdf, other

    cs.LG cs.CL stat.ML

    Avoiding spurious sharpness minimization broadens applicability of SAM

    Authors: Sidak Pal Singh, Hossein Mobahi, Atish Agarwala, Yann Dauphin

    Abstract: Curvature regularization techniques like Sharpness Aware Minimization (SAM) have shown great promise in improving generalization on vision tasks. However, we find that SAM performs poorly in domains like natural language processing (NLP), often degrading performance -- even with twice the compute budget. We investigate the discrepancy across domains and find that in the NLP setting, SAM is dominat… ▽ More

    Submitted 4 February, 2025; originally announced February 2025.

  3. arXiv:1911.05248  [pdf, other

    cs.LG cs.AI cs.CV cs.HC stat.ML

    What Do Compressed Deep Neural Networks Forget?

    Authors: Sara Hooker, Aaron Courville, Gregory Clark, Yann Dauphin, Andrea Frome

    Abstract: Deep neural network pruning and quantization techniques have demonstrated it is possible to achieve high levels of compression with surprisingly little degradation to test set accuracy. However, this measure of performance conceals significant differences in how different classes and images are impacted by model compression techniques. We find that models with radically different numbers of weight… ▽ More

    Submitted 5 September, 2021; v1 submitted 12 November, 2019; originally announced November 2019.

  4. arXiv:1903.05168  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    On the Pitfalls of Measuring Emergent Communication

    Authors: Ryan Lowe, Jakob Foerster, Y-Lan Boureau, Joelle Pineau, Yann Dauphin

    Abstract: How do we know if communication is emerging in a multi-agent system? The vast majority of recent papers on emergent communication show that adding a communication channel leads to an increase in reward or task success. This is a useful indicator, but provides only a coarse measure of the agent's learned communication abilities. As we move towards more complex environments, it becomes imperative to… ▽ More

    Submitted 12 March, 2019; originally announced March 2019.

    Comments: AAMAS 2019. 13 pages

  5. arXiv:1901.09321  [pdf, other

    cs.LG cs.CV stat.ML

    Fixup Initialization: Residual Learning Without Normalization

    Authors: Hongyi Zhang, Yann N. Dauphin, Tengyu Ma

    Abstract: Normalization layers are a staple in state-of-the-art deep neural network architectures. They are widely believed to stabilize training, enable higher learning rate, accelerate convergence and improve generalization, though the reason for their effectiveness is still an active research topic. In this work, we challenge the commonly-held beliefs by showing that none of the perceived benefits is uni… ▽ More

    Submitted 11 March, 2019; v1 submitted 27 January, 2019; originally announced January 2019.

    Comments: Updating reference. Accepted for publication at ICLR 2019; see https://openreview.net/forum?id=H1gsz30cKX

  6. arXiv:1710.09412  [pdf, other

    cs.LG stat.ML

    mixup: Beyond Empirical Risk Minimization

    Authors: Hongyi Zhang, Moustapha Cisse, Yann N. Dauphin, David Lopez-Paz

    Abstract: Large deep neural networks are powerful, but exhibit undesirable behaviors such as memorization and sensitivity to adversarial examples. In this work, we propose mixup, a simple learning principle to alleviate these issues. In essence, mixup trains a neural network on convex combinations of pairs of examples and their labels. By doing so, mixup regularizes the neural network to favor simple linear… ▽ More

    Submitted 27 April, 2018; v1 submitted 25 October, 2017; originally announced October 2017.

    Comments: ICLR camera ready version. Changes vs V1: fix repo URL; add ablation studies; add mixup + dropout etc

  7. arXiv:1704.08847  [pdf, other

    stat.ML cs.AI cs.CR cs.LG

    Parseval Networks: Improving Robustness to Adversarial Examples

    Authors: Moustapha Cisse, Piotr Bojanowski, Edouard Grave, Yann Dauphin, Nicolas Usunier

    Abstract: We introduce Parseval networks, a form of deep neural networks in which the Lipschitz constant of linear, convolutional and aggregation layers is constrained to be smaller than 1. Parseval networks are empirically and theoretically motivated by an analysis of the robustness of the predictions made by deep neural networks when their input is subject to an adversarial perturbation. The most importan… ▽ More

    Submitted 1 May, 2017; v1 submitted 28 April, 2017; originally announced April 2017.

    Comments: submitted

  8. arXiv:1406.2572  [pdf, other

    cs.LG math.OC stat.ML

    Identifying and attacking the saddle point problem in high-dimensional non-convex optimization

    Authors: Yann Dauphin, Razvan Pascanu, Caglar Gulcehre, Kyunghyun Cho, Surya Ganguli, Yoshua Bengio

    Abstract: A central challenge to many fields of science and engineering involves minimizing non-convex error functions over continuous, high dimensional spaces. Gradient descent or quasi-Newton methods are almost ubiquitously used to perform such minimizations, and it is often thought that a main source of difficulty for these local methods to find the global minimum is the proliferation of local minima wit… ▽ More

    Submitted 10 June, 2014; originally announced June 2014.

    Comments: The theoretical review and analysis in this article draw heavily from arXiv:1405.4604 [cs.LG]

  9. arXiv:1206.6434  [pdf

    cs.LG stat.ML

    A Generative Process for Sampling Contractive Auto-Encoders

    Authors: Salah Rifai, Yoshua Bengio, Yann Dauphin, Pascal Vincent

    Abstract: The contractive auto-encoder learns a representation of the input data that captures the local manifold structure around each data point, through the leading singular vectors of the Jacobian of the transformation from input to representation. The corresponding singular values specify how much local variation is plausible in directions associated with the corresponding singular vectors, while remai… ▽ More

    Submitted 27 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the 29th International Conference on Machine Learning (ICML 2012)