Skip to main content

Showing 1–8 of 8 results for author: Bouchacourt, D

Searching in archive stat. Search in all archives.
.
  1. arXiv:2311.08815  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Self-Supervised Disentanglement by Leveraging Structure in Data Augmentations

    Authors: Cian Eastwood, Julius von Kügelgen, Linus Ericsson, Diane Bouchacourt, Pascal Vincent, Bernhard Schölkopf, Mark Ibrahim

    Abstract: Self-supervised representation learning often uses data augmentations to induce some invariance to "style" attributes of the data. However, with downstream tasks generally unknown at training time, it is difficult to deduce a priori which attributes of the data are indeed "style" and can be safely discarded. To deal with this, current approaches try to retain some style information by tuning the d… ▽ More

    Submitted 20 August, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

  2. arXiv:2309.16748  [pdf, other

    cs.LG cs.AI stat.ML

    Discovering environments with XRM

    Authors: Mohammad Pezeshki, Diane Bouchacourt, Mark Ibrahim, Nicolas Ballas, Pascal Vincent, David Lopez-Paz

    Abstract: Environment annotations are essential for the success of many out-of-distribution (OOD) generalization methods. Unfortunately, these are costly to obtain and often limited by human annotators' biases. To achieve robust generalization, it is essential to develop algorithms for automatic environment discovery within datasets. Current proposals, which divide examples based on their training error, su… ▽ More

    Submitted 19 July, 2024; v1 submitted 28 September, 2023; originally announced September 2023.

    Comments: Oral at ICML 2024

  3. arXiv:2306.00802  [pdf, other

    stat.ML cs.CL cs.LG

    Birth of a Transformer: A Memory Viewpoint

    Authors: Alberto Bietti, Vivien Cabannes, Diane Bouchacourt, Herve Jegou, Leon Bottou

    Abstract: Large language models based on transformers have achieved great empirical successes. However, as they are deployed more widely, there is a growing need to better understand their internal mechanisms in order to make them more reliable. These models appear to store vast amounts of knowledge from their training data, and to adapt quickly to new information provided in their context or prompt. We stu… ▽ More

    Submitted 6 November, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: NeurIPS 2023

  4. arXiv:2210.07347  [pdf, other

    cs.LG stat.ML

    Disentanglement of Correlated Factors via Hausdorff Factorized Support

    Authors: Karsten Roth, Mark Ibrahim, Zeynep Akata, Pascal Vincent, Diane Bouchacourt

    Abstract: A grand goal in deep learning research is to learn representations capable of generalizing across distribution shifts. Disentanglement is one promising direction aimed at aligning a model's representation with the underlying factors generating the data (e.g. color or background). Existing disentanglement methods, however, rely on an often unrealistic assumption: that factors are statistically inde… ▽ More

    Submitted 25 February, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: Accepted to ICLR 2023

  5. arXiv:2207.09960  [pdf, other

    stat.ML cs.CY cs.LG

    Measuring and signing fairness as performance under multiple stakeholder distributions

    Authors: David Lopez-Paz, Diane Bouchacourt, Levent Sagun, Nicolas Usunier

    Abstract: As learning machines increase their influence on decisions concerning human lives, analyzing their fairness properties becomes a subject of central importance. Yet, our best tools for measuring the fairness of learning systems are rigid fairness metrics encapsulated as mathematical one-liners, offer limited power to the stakeholders involved in the prediction task, and are easy to manipulate when… ▽ More

    Submitted 20 July, 2022; originally announced July 2022.

  6. arXiv:2009.13962  [pdf, other

    cs.LG stat.ML

    Think before you act: A simple baseline for compositional generalization

    Authors: Christina Heinze-Deml, Diane Bouchacourt

    Abstract: Contrarily to humans who have the ability to recombine familiar expressions to create novel ones, modern neural networks struggle to do so. This has been emphasized recently with the introduction of the benchmark dataset "gSCAN" (Ruis et al. 2020), aiming to evaluate models' performance at compositional generalization in grounded language understanding. In this work, we challenge the gSCAN benchma… ▽ More

    Submitted 1 October, 2020; v1 submitted 29 September, 2020; originally announced September 2020.

  7. arXiv:1905.11852  [pdf, other

    cs.LG stat.ML

    EDUCE: Explaining model Decisions through Unsupervised Concepts Extraction

    Authors: Diane Bouchacourt, Ludovic Denoyer

    Abstract: Providing explanations along with predictions is crucial in some text processing tasks. Therefore, we propose a new self-interpretable model that performs output prediction and simultaneously provides an explanation in terms of the presence of particular concepts in the input. To do so, our model's prediction relies solely on a low-dimensional binary representation of the input, where each feature… ▽ More

    Submitted 27 September, 2019; v1 submitted 28 May, 2019; originally announced May 2019.

  8. arXiv:1705.08841  [pdf, other

    cs.LG stat.ML

    Multi-Level Variational Autoencoder: Learning Disentangled Representations from Grouped Observations

    Authors: Diane Bouchacourt, Ryota Tomioka, Sebastian Nowozin

    Abstract: We would like to learn a representation of the data which decomposes an observation into factors of variation which we can independently control. Specifically, we want to use minimal supervision to learn a latent representation that reflects the semantics behind a specific grouping of the data, where within a group the samples share a common factor of variation. For example, consider a collection… ▽ More

    Submitted 24 May, 2017; originally announced May 2017.