Skip to main content

Showing 1–10 of 10 results for author: Issenhuth, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.09570  [pdf, other

    cs.LG cs.AI cs.CV

    Improving Consistency Models with Generator-Augmented Flows

    Authors: Thibaut Issenhuth, Sangchul Lee, Ludovic Dos Santos, Jean-Yves Franceschi, Chansoo Kim, Alain Rakotomamonjy

    Abstract: Consistency models imitate the multi-step sampling of score-based diffusion in a single forward pass of a neural network. They can be learned in two ways: consistency distillation and consistency training. The former relies on the true velocity field of the corresponding differential equation, approximated by a pre-trained neural network. In contrast, the latter uses a single-sample Monte Carlo es… ▽ More

    Submitted 5 February, 2025; v1 submitted 13 June, 2024; originally announced June 2024.

  2. arXiv:2305.16150  [pdf, other

    cs.LG cs.CV cs.NE stat.ML

    Unifying GANs and Score-Based Diffusion as Generative Particle Models

    Authors: Jean-Yves Franceschi, Mike Gartrell, Ludovic Dos Santos, Thibaut Issenhuth, Emmanuel de Bézenac, Mickaël Chen, Alain Rakotomamonjy

    Abstract: Particle-based deep generative models, such as gradient flows and score-based diffusion models, have recently gained traction thanks to their striking performance. Their principle of displacing particle distributions using differential equations is conventionally seen as opposed to the previously widespread generative adversarial networks (GANs), which involve training a pushforward generator netw… ▽ More

    Submitted 21 December, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

    Journal ref: Thirty-seventh Conference on Neural Information Processing Systems, Neural Information Processing Systems Foundation, Dec. 2023, New Orleans, LA, USA

  3. arXiv:2207.10541  [pdf, other

    cs.LG cs.AI stat.ML

    Unveiling the Latent Space Geometry of Push-Forward Generative Models

    Authors: Thibaut Issenhuth, Ugo Tanielian, Jérémie Mary, David Picard

    Abstract: Many deep generative models are defined as a push-forward of a Gaussian measure by a continuous generator, such as Generative Adversarial Networks (GANs) or Variational Auto-Encoders (VAEs). This work explores the latent space of such deep generative models. A key issue with these models is their tendency to output samples outside of the support of the target distribution when learning disconnecte… ▽ More

    Submitted 15 May, 2023; v1 submitted 21 July, 2022; originally announced July 2022.

  4. arXiv:2111.15264  [pdf, other

    cs.CV cs.LG

    EdiBERT, a generative model for image editing

    Authors: Thibaut Issenhuth, Ugo Tanielian, Jérémie Mary, David Picard

    Abstract: Advances in computer vision are pushing the limits of im-age manipulation, with generative models sampling detailed images on various tasks. However, a specialized model is often developed and trained for each specific task, even though many image edition tasks share similarities. In denoising, inpainting, or image compositing, one always aims at generating a realistic image from a low-quality one… ▽ More

    Submitted 21 July, 2022; v1 submitted 30 November, 2021; originally announced November 2021.

  5. arXiv:2110.09803  [pdf, other

    cs.LG cs.CV

    Latent reweighting, an almost free improvement for GANs

    Authors: Thibaut Issenhuth, Ugo Tanielian, David Picard, Jeremie Mary

    Abstract: Standard formulations of GANs, where a continuous function deforms a connected latent space, have been shown to be misspecified when fitting different classes of images. In particular, the generator will necessarily sample some low-quality images in between the classes. Rather than modifying the architecture, a line of works aims at improving the sampling quality from pre-trained generators at the… ▽ More

    Submitted 19 October, 2021; originally announced October 2021.

  6. arXiv:2007.02721  [pdf, other

    cs.CV cs.LG eess.IV

    Do Not Mask What You Do Not Need to Mask: a Parser-Free Virtual Try-On

    Authors: Thibaut Issenhuth, Jérémie Mary, Clément Calauzènes

    Abstract: The 2D virtual try-on task has recently attracted a great interest from the research community, for its direct potential applications in online shopping as well as for its inherent and non-addressed scientific challenges. This task requires fitting an in-shop cloth image on the image of a person, which is highly challenging because it involves cloth warping, image compositing, and synthesizing. Ca… ▽ More

    Submitted 29 July, 2020; v1 submitted 3 July, 2020; originally announced July 2020.

    Comments: Accepted at ECCV 2020. arXiv admin note: text overlap with arXiv:1906.01347

  7. arXiv:2006.04596  [pdf, other

    stat.ML cs.LG

    Learning disconnected manifolds: a no GANs land

    Authors: Ugo Tanielian, Thibaut Issenhuth, Elvis Dohmatob, Jeremie Mary

    Abstract: Typical architectures of Generative AdversarialNetworks make use of a unimodal latent distribution transformed by a continuous generator. Consequently, the modeled distribution always has connected support which is cumbersome when learning a disconnected set of manifolds. We formalize this problem by establishing a no free lunch theorem for the disconnected manifold learning stating an upper bound… ▽ More

    Submitted 10 December, 2020; v1 submitted 8 June, 2020; originally announced June 2020.

    Comments: 24 pages

    Journal ref: PMLR 119:9418-9427, 2020

  8. arXiv:1906.01347  [pdf, other

    cs.CV

    End-to-End Learning of Geometric Deformations of Feature Maps for Virtual Try-On

    Authors: Thibaut Issenhuth, Jérémie Mary, Clément Calauzènes

    Abstract: The 2D virtual try-on task has recently attracted a lot of interest from the research community, for its direct potential applications in online shopping as well as for its inherent and non-addressed scientific challenges. This task requires to fit an in-shop cloth image on the image of a person. It is highly challenging because it requires to warp the cloth on the target person while preserving i… ▽ More

    Submitted 10 June, 2019; v1 submitted 4 June, 2019; originally announced June 2019.

  9. Face Detection in the Operating Room: Comparison of State-of-the-art Methods and a Self-supervised Approach

    Authors: Thibaut Issenhuth, Vinkle Srivastav, Afshin Gangi, Nicolas Padoy

    Abstract: Purpose: Face detection is a needed component for the automatic analysis and assistance of human activities during surgical procedures. Efficient face detection algorithms can indeed help to detect and identify the persons present in the room, and also be used to automatically anonymize the data. However, current algorithms trained on natural images do not generalize well to the operating room (OR… ▽ More

    Submitted 3 December, 2018; v1 submitted 29 November, 2018; originally announced November 2018.

    Comments: 13 pages

  10. arXiv:1808.08180  [pdf, other

    cs.CV

    MVOR: A Multi-view RGB-D Operating Room Dataset for 2D and 3D Human Pose Estimation

    Authors: Vinkle Srivastav, Thibaut Issenhuth, Abdolrahim Kadkhodamohammadi, Michel de Mathelin, Afshin Gangi, Nicolas Padoy

    Abstract: Person detection and pose estimation is a key requirement to develop intelligent context-aware assistance systems. To foster the development of human pose estimation methods and their applications in the Operating Room (OR), we release the Multi-View Operating Room (MVOR) dataset, the first public dataset recorded during real clinical interventions. It consists of 732 synchronized multi-view frame… ▽ More

    Submitted 20 August, 2021; v1 submitted 24 August, 2018; originally announced August 2018.

    Comments: Dataset and code is available at https://github.com/camma-public/mvor. The paper was presented in the MICCAI-LABELS 2018 (https://labels.tue-image.nl/previous-editions/labels-2018/)