Skip to main content

Showing 1–25 of 25 results for author: Béthune, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.18230  [pdf, other

    cs.LG cs.AI

    Follow the Energy, Find the Path: Riemannian Metrics from Energy-Based Models

    Authors: Louis Béthune, David Vigouroux, Yilun Du, Rufin VanRullen, Thomas Serre, Victor Boutin

    Abstract: What is the shortest path between two data points lying in a high-dimensional space? While the answer is trivial in Euclidean geometry, it becomes significantly more complex when the data lies on a curved manifold -- requiring a Riemannian metric to describe the space's local curvature. Estimating such a metric, however, remains a major challenge in high dimensions. In this work, we propose a me… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

  2. arXiv:2504.07151  [pdf, ps, other

    cs.LG

    Deep Sturm--Liouville: From Sample-Based to 1D Regularization with Learnable Orthogonal Basis Functions

    Authors: David Vigouroux, Joseba Dalmau, Louis Béthune, Victor Boutin

    Abstract: Although Artificial Neural Networks (ANNs) have achieved remarkable success across various tasks, they still suffer from limited generalization. We hypothesize that this limitation arises from the traditional sample-based (0--dimensionnal) regularization used in ANNs. To overcome this, we introduce \textit{Deep Sturm--Liouville} (DSL), a novel function approximator that enables continuous 1D regul… ▽ More

    Submitted 9 April, 2025; originally announced April 2025.

  3. arXiv:2503.10576  [pdf, other

    stat.ML cs.LG

    Sample and Map from a Single Convex Potential: Generation using Conjugate Moment Measures

    Authors: Nina Vesseron, Louis Béthune, Marco Cuturi

    Abstract: The canonical approach in generative modeling is to split model fitting into two blocks: define first how to sample noise (e.g. Gaussian) and choose next what to do with it (e.g. using a single map or flows). We explore in this work an alternative route that ties sampling and mapping. We find inspiration in moment measures, a result that states that for any measure $ρ$, there exists a unique conve… ▽ More

    Submitted 26 May, 2025; v1 submitted 13 March, 2025; originally announced March 2025.

  4. arXiv:2502.12786  [pdf, other

    stat.ML cs.LG

    Composition and Control with Distilled Energy Diffusion Models and Sequential Monte Carlo

    Authors: James Thornton, Louis Bethune, Ruixiang Zhang, Arwen Bradley, Preetum Nakkiran, Shuangfei Zhai

    Abstract: Diffusion models may be formulated as a time-indexed sequence of energy-based models, where the score corresponds to the negative gradient of an energy function. As opposed to learning the score directly, an energy parameterization is attractive as the energy itself can be used to control generation via Monte Carlo samplers. Architectural constraints and training instability in energy parameterize… ▽ More

    Submitted 18 February, 2025; originally announced February 2025.

    Comments: Initial submission to openreview on October 3, 2024 (https://openreview.net/forum?id=6GyX0YRw8P); accepted to AISTATS 2025

  5. arXiv:2502.06042  [pdf, other

    cs.LG cs.CL

    Scaling Laws for Forgetting during Finetuning with Pretraining Data Injection

    Authors: Louis Bethune, David Grangier, Dan Busbridge, Eleonora Gualdoni, Marco Cuturi, Pierre Ablin

    Abstract: A widespread strategy to obtain a language model that performs well on a target domain is to finetune a pretrained model to perform unsupervised next-token prediction on data from that target domain. Finetuning presents two challenges: (i) if the amount of target data is limited, as in most practical applications, the model will quickly overfit, and (ii) the model will drift away from the original… ▽ More

    Submitted 26 May, 2025; v1 submitted 9 February, 2025; originally announced February 2025.

    Comments: 19 pages, 15 figures, preprint

  6. arXiv:2502.03609  [pdf, other

    stat.ML cs.LG

    Multivariate Conformal Prediction using Optimal Transport

    Authors: Michal Klein, Louis Bethune, Eugene Ndiaye, Marco Cuturi

    Abstract: Conformal prediction (CP) quantifies the uncertainty of machine learning models by constructing sets of plausible outputs. These sets are constructed by leveraging a so-called conformity score, a quantity computed using the input point of interest, a prediction model, and past observations. CP sets are then obtained by evaluating the conformity score of all possible outputs, and selecting them acc… ▽ More

    Submitted 5 February, 2025; originally announced February 2025.

  7. arXiv:2411.14402  [pdf, other

    cs.CV cs.LG

    Multimodal Autoregressive Pre-training of Large Vision Encoders

    Authors: Enrico Fini, Mustafa Shukor, Xiujun Li, Philipp Dufter, Michal Klein, David Haldimann, Sai Aitharaju, Victor Guilherme Turrisi da Costa, Louis Béthune, Zhe Gan, Alexander T Toshev, Marcin Eichner, Moin Nabi, Yinfei Yang, Joshua M. Susskind, Alaaeldin El-Nouby

    Abstract: We introduce a novel method for pre-training of large-scale vision encoders. Building on recent advancements in autoregressive pre-training of vision models, we extend this framework to a multimodal setting, i.e., images and text. In this paper, we present AIMV2, a family of generalist vision encoders characterized by a straightforward pre-training process, scalability, and remarkable performance… ▽ More

    Submitted 21 November, 2024; originally announced November 2024.

    Comments: https://github.com/apple/ml-aim

  8. arXiv:2410.06025  [pdf, other

    cs.CV cs.LG stat.ML

    Shielded Diffusion: Generating Novel and Diverse Images using Sparse Repellency

    Authors: Michael Kirchhof, James Thornton, Louis Béthune, Pierre Ablin, Eugene Ndiaye, Marco Cuturi

    Abstract: The adoption of text-to-image diffusion models raises concerns over reliability, drawing scrutiny under the lens of various metrics like calibration, fairness, or compute efficiency. We focus in this work on two issues that arise when deploying these models: a lack of diversity when prompting images, and a tendency to recreate images from the training set. To solve both problems, we propose a meth… ▽ More

    Submitted 28 May, 2025; v1 submitted 8 October, 2024; originally announced October 2024.

    Comments: Accepted at ICML 2025

  9. arXiv:2407.09505  [pdf, other

    cs.CV cs.AI cs.GR

    1-Lipschitz Neural Distance Fields

    Authors: Guillaume Coiffier, Louis Bethune

    Abstract: Neural implicit surfaces are a promising tool for geometry processing that represent a solid object as the zero level set of a neural network. Usually trained to approximate a signed distance function of the considered object, these methods exhibit great visual fidelity and quality near the surface, yet their properties tend to degrade with distance, making geometrical queries hard to perform with… ▽ More

    Submitted 14 June, 2024; originally announced July 2024.

    Comments: 17 pages, 19 figures

  10. arXiv:2407.06723  [pdf, other

    cs.CV cs.AI cs.LG

    Graph-Based Captioning: Enhancing Visual Descriptions by Interconnecting Region Captions

    Authors: Yu-Guan Hsieh, Cheng-Yu Hsieh, Shih-Ying Yeh, Louis Béthune, Hadi Pour Ansari, Pavan Kumar Anasosalu Vasu, Chun-Liang Li, Ranjay Krishna, Oncel Tuzel, Marco Cuturi

    Abstract: Humans describe complex scenes with compositionality, using simple text descriptions enriched with links and relationships. While vision-language research has aimed to develop models with compositional understanding capabilities, this is not reflected yet in existing datasets which, for the most part, still use plain text to describe images. In this work, we propose a new annotation strategy, grap… ▽ More

    Submitted 26 February, 2025; v1 submitted 9 July, 2024; originally announced July 2024.

    Comments: 59 pages, 42 figures

  11. arXiv:2407.06076  [pdf, other

    cs.CV cs.AI

    Understanding Visual Feature Reliance through the Lens of Complexity

    Authors: Thomas Fel, Louis Bethune, Andrew Kyle Lampinen, Thomas Serre, Katherine Hermann

    Abstract: Recent studies suggest that deep learning models inductive bias towards favoring simpler features may be one of the sources of shortcut learning. Yet, there has been limited focus on understanding the complexity of the myriad features that models learn. In this work, we introduce a new metric for quantifying feature complexity, based on $\mathscr{V}$-information and capturing whether a feature req… ▽ More

    Submitted 28 October, 2024; v1 submitted 8 July, 2024; originally announced July 2024.

    Journal ref: Conference on Neural Information Processing Systems (NeurIPS), Dec 2024

  12. arXiv:2312.06499  [pdf, other

    cs.CL stat.ML

    TaCo: Targeted Concept Erasure Prevents Non-Linear Classifiers From Detecting Protected Attributes

    Authors: Fanny Jourdan, Louis Béthune, Agustin Picard, Laurent Risser, Nicholas Asher

    Abstract: Ensuring fairness in NLP models is crucial, as they often encode sensitive attributes like gender and ethnicity, leading to biased outcomes. Current concept erasure methods attempt to mitigate this by modifying final latent representations to remove sensitive information without retraining the entire model. However, these methods typically rely on linear classifiers, which leave models vulnerable… ▽ More

    Submitted 16 October, 2024; v1 submitted 11 December, 2023; originally announced December 2023.

  13. arXiv:2306.07304  [pdf, other

    cs.LG cs.AI

    A Holistic Approach to Unifying Automatic Concept Extraction and Concept Importance Estimation

    Authors: Thomas Fel, Victor Boutin, Mazda Moayeri, Rémi Cadène, Louis Bethune, Léo andéol, Mathieu Chalvidal, Thomas Serre

    Abstract: In recent years, concept-based approaches have emerged as some of the most promising explainability methods to help us interpret the decisions of Artificial Neural Networks (ANNs). These methods seek to discover intelligible visual 'concepts' buried within the complex patterns of ANN activations in two key steps: (1) concept extraction followed by (2) importance estimation. While these two steps a… ▽ More

    Submitted 29 October, 2023; v1 submitted 11 June, 2023; originally announced June 2023.

    Journal ref: Conference on Neural Information Processing Systems (NeurIPS), 2023

  14. arXiv:2305.16202  [pdf, other

    cs.LG cs.CR

    DP-SGD Without Clipping: The Lipschitz Neural Network Way

    Authors: Louis Bethune, Thomas Massena, Thibaut Boissin, Yannick Prudent, Corentin Friedrich, Franck Mamalet, Aurelien Bellet, Mathieu Serrurier, David Vigouroux

    Abstract: State-of-the-art approaches for training Differentially Private (DP) Deep Neural Networks (DNN) face difficulties to estimate tight bounds on the sensitivity of the network's layers, and instead rely on a process of per-sample gradient clipping. This clipping process not only biases the direction of gradients but also proves costly both in memory consumption and in computation. To provide sensitiv… ▽ More

    Submitted 22 February, 2024; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: 46 pages, published at International Conferences on Learning Representations (ICLR), 2024

  15. arXiv:2303.01978  [pdf, other

    cs.LG

    Robust One-Class Classification with Signed Distance Function using 1-Lipschitz Neural Networks

    Authors: Louis Bethune, Paul Novello, Thibaut Boissin, Guillaume Coiffier, Mathieu Serrurier, Quentin Vincenot, Andres Troya-Galvis

    Abstract: We propose a new method, dubbed One Class Signed Distance Function (OCSDF), to perform One Class Classification (OCC) by provably learning the Signed Distance Function (SDF) to the boundary of the support of any distribution. The distance to the support can be interpreted as a normality score, and its approximation using 1-Lipschitz neural networks provides robustness bounds against $l2$ adversari… ▽ More

    Submitted 1 April, 2024; v1 submitted 26 January, 2023; originally announced March 2023.

    Comments: 27 pages, 11 figures, International Conference on Machine Learning 2023, (ICML 2023)

  16. arXiv:2211.10154  [pdf, other

    cs.CV cs.AI

    CRAFT: Concept Recursive Activation FacTorization for Explainability

    Authors: Thomas Fel, Agustin Picard, Louis Bethune, Thibaut Boissin, David Vigouroux, Julien Colin, Rémi Cadène, Thomas Serre

    Abstract: Attribution methods, which employ heatmaps to identify the most influential regions of an image that impact model decisions, have gained widespread popularity as a type of explainability method. However, recent research has exposed the limited practical value of these methods, attributed in part to their narrow focus on the most prominent regions of an image -- revealing "where" the model looks, b… ▽ More

    Submitted 28 March, 2023; v1 submitted 17 November, 2022; originally announced November 2022.

    Journal ref: Proceedings of the IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR), 2023

  17. arXiv:2210.06784  [pdf, other

    cs.ET cs.LG quant-ph

    Efficient circuit implementation for coined quantum walks on binary trees and application to reinforcement learning

    Authors: Thomas Mullor, David Vigouroux, Louis Bethune

    Abstract: Quantum walks on binary trees are used in many quantum algorithms to achieve important speedup over classical algorithms. The formulation of this kind of algorithms as quantum circuit presents the advantage of being easily readable, executable on circuit based quantum computers and simulators and optimal on the usage of resources. We propose a strategy to compose quantum circuit that performs quan… ▽ More

    Submitted 14 October, 2022; v1 submitted 13 October, 2022; originally announced October 2022.

    Journal ref: ACM/IEEE International Workshop on Quantum Computing, Dec 2022, Seattle, United States

  18. arXiv:2210.06574  [pdf, other

    stat.ML cs.LG

    Gaussian Processes on Distributions based on Regularized Optimal Transport

    Authors: François Bachoc, Louis Béthune, Alberto Gonzalez-Sanz, Jean-Michel Loubes

    Abstract: We present a novel kernel over the space of probability measures based on the dual formulation of optimal regularized transport. We propose an Hilbertian embedding of the space of probabilities using their Sinkhorn potentials, which are solutions of the dual entropic relaxed optimal transport between the probabilities and a reference measure $\mathcal{U}$. We prove that this construction enables t… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

  19. arXiv:2206.06854  [pdf, other

    cs.AI cs.CR cs.CV cs.LG stat.ML

    On the explainable properties of 1-Lipschitz Neural Networks: An Optimal Transport Perspective

    Authors: Mathieu Serrurier, Franck Mamalet, Thomas Fel, Louis Béthune, Thibaut Boissin

    Abstract: Input gradients have a pivotal role in a variety of applications, including adversarial attack algorithms for evaluating model robustness, explainable AI techniques for generating Saliency Maps, and counterfactual explanations.However, Saliency Maps generated by traditional neural networks are often noisy and provide limited insights. In this paper, we demonstrate that, on the contrary, the Salien… ▽ More

    Submitted 2 February, 2024; v1 submitted 14 June, 2022; originally announced June 2022.

    Journal ref: Conference on Neural Information Processing Systems (NeurIPS), Neural Information Processing Systems Foundation, Dec 2023, New Orleans (Louisiana), United States

  20. arXiv:2206.04394  [pdf, other

    cs.LG cs.AI

    Xplique: A Deep Learning Explainability Toolbox

    Authors: Thomas Fel, Lucas Hervier, David Vigouroux, Antonin Poche, Justin Plakoo, Remi Cadene, Mathieu Chalvidal, Julien Colin, Thibaut Boissin, Louis Bethune, Agustin Picard, Claire Nicodeme, Laurent Gardes, Gregory Flandin, Thomas Serre

    Abstract: Today's most advanced machine-learning models are hardly scrutable. The key challenge for explainability methods is to help assisting researchers in opening up these black boxes, by revealing the strategy that led to a given decision, by characterizing their internal states or by studying the underlying data representation. To address this challenge, we have developed Xplique: a software library f… ▽ More

    Submitted 9 June, 2022; originally announced June 2022.

  21. arXiv:2202.07965  [pdf, other

    stat.ML cs.LG

    GAN Estimation of Lipschitz Optimal Transport Maps

    Authors: Alberto González-Sanz, Lucas de Lara, Louis Béthune, Jean-Michel Loubes

    Abstract: This paper introduces the first statistically consistent estimator of the optimal transport map between two probability distributions, based on neural networks. Building on theoretical and practical advances in the field of Lipschitz neural networks, we define a Lipschitz-constrained generative adversarial network penalized by the quadratic transportation cost. Then, we demonstrate that, under reg… ▽ More

    Submitted 16 February, 2022; originally announced February 2022.

  22. arXiv:2104.05097  [pdf, other

    cs.LG cs.AI stat.ML

    Pay attention to your loss: understanding misconceptions about 1-Lipschitz neural networks

    Authors: Louis Béthune, Thibaut Boissin, Mathieu Serrurier, Franck Mamalet, Corentin Friedrich, Alberto González-Sanz

    Abstract: Lipschitz constrained networks have gathered considerable attention in the deep learning community, with usages ranging from Wasserstein distance estimation to the training of certifiably robust classifiers. However they remain commonly considered as less accurate, and their properties in learning are still not fully understood. In this paper we clarify the matter: when it comes to classification… ▽ More

    Submitted 17 October, 2022; v1 submitted 11 April, 2021; originally announced April 2021.

    Comments: 36 pages, 17 figures, NEURIPS 2022

  23. arXiv:2011.12737  [pdf, ps, other

    cs.LG

    Ranking Deep Learning Generalization using Label Variation in Latent Geometry Graphs

    Authors: Carlos Lassance, Louis Béthune, Myriam Bontonou, Mounia Hamidouche, Vincent Gripon

    Abstract: Measuring the generalization performance of a Deep Neural Network (DNN) without relying on a validation set is a difficult task. In this work, we propose exploiting Latent Geometry Graphs (LGGs) to represent the latent spaces of trained DNN architectures. Such graphs are obtained by connecting samples that yield similar latent representations at a given layer of the considered DNN. We then obtain… ▽ More

    Submitted 25 November, 2020; originally announced November 2020.

    Comments: Short paper describing submission that got the 3rd place on the NeurIPS 2020 Predicting Generalization in Deep Learning (PGDL) competition. We hope to update this with more analysis when the full data is made available

  24. arXiv:2007.04238  [pdf, other

    cs.LG cs.CV stat.ML

    Predicting the Accuracy of a Few-Shot Classifier

    Authors: Myriam Bontonou, Louis Béthune, Vincent Gripon

    Abstract: In the context of few-shot learning, one cannot measure the generalization ability of a trained classifier using validation sets, due to the small number of labeled samples. In this paper, we are interested in finding alternatives to answer the question: is my classifier generalizing well to previously unseen data? We first analyze the reasons for the variability of generalization performances. We… ▽ More

    Submitted 8 July, 2020; originally announced July 2020.

  25. arXiv:2007.03373  [pdf, other

    cs.LG cs.CV stat.ML

    Hierarchical and Unsupervised Graph Representation Learning with Loukas's Coarsening

    Authors: Louis Béthune, Yacouba Kaloga, Pierre Borgnat, Aurélien Garivier, Amaury Habrard

    Abstract: We propose a novel algorithm for unsupervised graph representation learning with attributed graphs. It combines three advantages addressing some current limitations of the literature: i) The model is inductive: it can embed new graphs without re-training in the presence of new data; ii) The method takes into account both micro-structures and macro-structures by looking at the attributed graphs at… ▽ More

    Submitted 17 August, 2020; v1 submitted 7 July, 2020; originally announced July 2020.

    Comments: 19 pages, 15 figures, submitted