Skip to main content

Showing 1–4 of 4 results for author: Delmas, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2409.06535  [pdf, other

    cs.CV

    PoseEmbroider: Towards a 3D, Visual, Semantic-aware Human Pose Representation

    Authors: Ginger Delmas, Philippe Weinzaepfel, Francesc Moreno-Noguer, Grégory Rogez

    Abstract: Aligning multiple modalities in a latent space, such as images and texts, has shown to produce powerful semantic visual representations, fueling tasks like image captioning, text-to-image generation, or image grounding. In the context of human-centric vision, albeit CLIP-like representations encode most standard human poses relatively well (such as standing or sitting), they lack sufficient acuten… ▽ More

    Submitted 10 September, 2024; originally announced September 2024.

    Comments: Published in ECCV 2024

  2. arXiv:2309.08480  [pdf, other

    cs.CV

    PoseFix: Correcting 3D Human Poses with Natural Language

    Authors: Ginger Delmas, Philippe Weinzaepfel, Francesc Moreno-Noguer, Grégory Rogez

    Abstract: Automatically producing instructions to modify one's posture could open the door to endless applications, such as personalized coaching and in-home physical therapy. Tackling the reverse problem (i.e., refining a 3D pose based on some natural language feedback) could help for assisted 3D character animation or robot teaching, for instance. Although a few recent works explore the connections betwee… ▽ More

    Submitted 17 January, 2024; v1 submitted 15 September, 2023; originally announced September 2023.

    Comments: Published in ICCV 2023

  3. PoseScript: Linking 3D Human Poses and Natural Language

    Authors: Ginger Delmas, Philippe Weinzaepfel, Thomas Lucas, Francesc Moreno-Noguer, Grégory Rogez

    Abstract: Natural language plays a critical role in many computer vision applications, such as image captioning, visual question answering, and cross-modal retrieval, to provide fine-grained semantic information. Unfortunately, while human pose is key to human understanding, current 3D human pose datasets lack detailed language descriptions. To address this issue, we have introduced the PoseScript dataset.… ▽ More

    Submitted 10 September, 2024; v1 submitted 21 October, 2022; originally announced October 2022.

    Comments: TPAMI 2024, extended version of the ECCV 2022 paper

  4. arXiv:2203.08101  [pdf, other

    cs.CV cs.IR

    ARTEMIS: Attention-based Retrieval with Text-Explicit Matching and Implicit Similarity

    Authors: Ginger Delmas, Rafael Sampaio de Rezende, Gabriela Csurka, Diane Larlus

    Abstract: An intuitive way to search for images is to use queries composed of an example image and a complementary text. While the first provides rich and implicit context for the search, the latter explicitly calls for new traits, or specifies how some elements of the example image should be changed to retrieve the desired target image. Current approaches typically combine the features of each of the two e… ▽ More

    Submitted 16 May, 2022; v1 submitted 15 March, 2022; originally announced March 2022.

    Comments: Published in ICLR 2022