Skip to main content

Showing 1–6 of 6 results for author: Cisse, M

Searching in archive stat. Search in all archives.
.
  1. arXiv:2006.13485  [pdf, other

    cs.LG stat.ML

    Fairness with Overlapping Groups

    Authors: Forest Yang, Moustapha Cisse, Sanmi Koyejo

    Abstract: In algorithmically fair prediction problems, a standard goal is to ensure the equality of fairness metrics across multiple overlapping groups simultaneously. We reconsider this standard fair classification problem using a probabilistic population analysis, which, in turn, reveals the Bayes-optimal classifier. Our approach unifies a variety of existing group-fair classification methods and enables… ▽ More

    Submitted 24 June, 2020; originally announced June 2020.

  2. arXiv:2006.06049  [pdf, other

    cs.LG stat.ML

    On Mixup Regularization

    Authors: Luigi Carratino, Moustapha Cissé, Rodolphe Jenatton, Jean-Philippe Vert

    Abstract: Mixup is a data augmentation technique that creates new examples as convex combinations of training points and labels. This simple technique has empirically shown to improve the accuracy of many state-of-the-art models in different settings and applications, but the reasons behind this empirical success remain poorly understood. In this paper we take a substantial step in explaining the theoretica… ▽ More

    Submitted 17 October, 2022; v1 submitted 10 June, 2020; originally announced June 2020.

  3. arXiv:1711.11443  [pdf, other

    cs.LG cs.AI cs.CV cs.CY stat.ML

    ConvNets and ImageNet Beyond Accuracy: Understanding Mistakes and Uncovering Biases

    Authors: Pierre Stock, Moustapha Cisse

    Abstract: ConvNets and Imagenet have driven the recent success of deep learning for image classification. However, the marked slowdown in performance improvement combined with the lack of robustness of neural networks to adversarial examples and their tendency to exhibit undesirable biases question the reliability of these methods. This work investigates these questions from the perspective of the end-user… ▽ More

    Submitted 20 July, 2018; v1 submitted 30 November, 2017; originally announced November 2017.

    Comments: ECCV 2018 camera-ready

  4. arXiv:1710.09412  [pdf, other

    cs.LG stat.ML

    mixup: Beyond Empirical Risk Minimization

    Authors: Hongyi Zhang, Moustapha Cisse, Yann N. Dauphin, David Lopez-Paz

    Abstract: Large deep neural networks are powerful, but exhibit undesirable behaviors such as memorization and sensitivity to adversarial examples. In this work, we propose mixup, a simple learning principle to alleviate these issues. In essence, mixup trains a neural network on convex combinations of pairs of examples and their labels. By doing so, mixup regularizes the neural network to favor simple linear… ▽ More

    Submitted 27 April, 2018; v1 submitted 25 October, 2017; originally announced October 2017.

    Comments: ICLR camera ready version. Changes vs V1: fix repo URL; add ablation studies; add mixup + dropout etc

  5. arXiv:1707.05373  [pdf, other

    stat.ML cs.AI cs.CR cs.CV cs.LG

    Houdini: Fooling Deep Structured Prediction Models

    Authors: Moustapha Cisse, Yossi Adi, Natalia Neverova, Joseph Keshet

    Abstract: Generating adversarial examples is a critical step for evaluating and improving the robustness of learning machines. So far, most existing methods only work for classification and are not designed to alter the true performance measure of the problem at hand. We introduce a novel flexible approach named Houdini for generating adversarial examples specifically tailored for the final performance meas… ▽ More

    Submitted 17 July, 2017; originally announced July 2017.

    Comments: 12 pages, 8 figures, under review

  6. arXiv:1704.08847  [pdf, other

    stat.ML cs.AI cs.CR cs.LG

    Parseval Networks: Improving Robustness to Adversarial Examples

    Authors: Moustapha Cisse, Piotr Bojanowski, Edouard Grave, Yann Dauphin, Nicolas Usunier

    Abstract: We introduce Parseval networks, a form of deep neural networks in which the Lipschitz constant of linear, convolutional and aggregation layers is constrained to be smaller than 1. Parseval networks are empirically and theoretically motivated by an analysis of the robustness of the predictions made by deep neural networks when their input is subject to an adversarial perturbation. The most importan… ▽ More

    Submitted 1 May, 2017; v1 submitted 28 April, 2017; originally announced April 2017.

    Comments: submitted