Skip to main content

Showing 1–10 of 10 results for author: Bourmaud, G

.
  1. arXiv:2503.07561  [pdf, other

    cs.CV

    Alligat0R: Pre-Training Through Co-Visibility Segmentation for Relative Camera Pose Regression

    Authors: Thibaut Loiseau, Guillaume Bourmaud, Vincent Lepetit

    Abstract: Pre-training techniques have greatly advanced computer vision, with CroCo's cross-view completion approach yielding impressive results in tasks like 3D reconstruction and pose regression. However, this method requires substantial overlap between training pairs, limiting its effectiveness. We introduce Alligat0R, a novel pre-training approach that reformulates cross-view learning as a co-visibility… ▽ More

    Submitted 10 March, 2025; originally announced March 2025.

  2. arXiv:2502.19955  [pdf, other

    cs.CV

    RUBIK: A Structured Benchmark for Image Matching across Geometric Challenges

    Authors: Thibaut Loiseau, Guillaume Bourmaud

    Abstract: Camera pose estimation is crucial for many computer vision applications, yet existing benchmarks offer limited insight into method limitations across different geometric challenges. We introduce RUBIK, a novel benchmark that systematically evaluates image matching methods across well-defined geometric difficulty levels. Using three complementary criteria - overlap, scale ratio, and viewpoint angle… ▽ More

    Submitted 27 February, 2025; originally announced February 2025.

  3. arXiv:2410.21301  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    Evaluating the Posterior Sampling Ability of Plug&Play Diffusion Methods in Sparse-View CT

    Authors: Liam Moroy, Guillaume Bourmaud, Frédéric Champagnat, Jean-François Giovannelli

    Abstract: Plug&Play (PnP) diffusion models are state-of-the-art methods in computed tomography (CT) reconstruction. Such methods usually consider applications where the sinogram contains a sufficient amount of information for the posterior distribution to be concentrated around a single mode, and consequently are evaluated using image-to-image metrics such as PSNR/SSIM. Instead, we are interested in reconst… ▽ More

    Submitted 18 March, 2025; v1 submitted 21 October, 2024; originally announced October 2024.

  4. arXiv:2402.08671  [pdf, other

    cs.CV cs.AI

    Are Semi-Dense Detector-Free Methods Good at Matching Local Features?

    Authors: Matthieu Vilain, Rémi Giraud, Hugo Germain, Guillaume Bourmaud

    Abstract: Semi-dense detector-free approaches (SDF), such as LoFTR, are currently among the most popular image matching methods. While SDF methods are trained to establish correspondences between two images, their performances are almost exclusively evaluated using relative pose estimation metrics. Thus, the link between their ability to establish correspondences and the quality of the resulting estimated p… ▽ More

    Submitted 1 June, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

  5. arXiv:2106.09711  [pdf, other

    cs.CV

    Visual Correspondence Hallucination

    Authors: Hugo Germain, Vincent Lepetit, Guillaume Bourmaud

    Abstract: Given a pair of partially overlapping source and target images and a keypoint in the source image, the keypoint's correspondent in the target image can be either visible, occluded or outside the field of view. Local feature matching methods are only able to identify the correspondent's location when it is visible, while humans can also hallucinate its location when it is occluded or outside the fi… ▽ More

    Submitted 2 February, 2022; v1 submitted 17 June, 2021; originally announced June 2021.

  6. arXiv:2103.07153  [pdf, other

    cs.CV

    Neural Reprojection Error: Merging Feature Learning and Camera Pose Estimation

    Authors: Hugo Germain, Vincent Lepetit, Guillaume Bourmaud

    Abstract: Absolute camera pose estimation is usually addressed by sequentially solving two distinct subproblems: First a feature matching problem that seeks to establish putative 2D-3D correspondences, and then a Perspective-n-Point problem that minimizes, with respect to the camera pose, the sum of so-called Reprojection Errors (RE). We argue that generating putative 2D-3D correspondences 1) leads to an im… ▽ More

    Submitted 12 March, 2021; originally announced March 2021.

  7. arXiv:2004.01673  [pdf, other

    cs.CV

    S2DNet: Learning Accurate Correspondences for Sparse-to-Dense Feature Matching

    Authors: Hugo Germain, Guillaume Bourmaud, Vincent Lepetit

    Abstract: Establishing robust and accurate correspondences is a fundamental backbone to many computer vision algorithms. While recent learning-based feature matching methods have shown promising results in providing robust correspondences under challenging conditions, they are often limited in terms of precision. In this paper, we introduce S2DNet, a novel feature matching pipeline, designed and trained to… ▽ More

    Submitted 3 April, 2020; originally announced April 2020.

  8. arXiv:1907.03965  [pdf, other

    cs.CV

    Sparse-to-Dense Hypercolumn Matching for Long-Term Visual Localization

    Authors: Hugo Germain, Guillaume Bourmaud, Vincent Lepetit

    Abstract: We propose a novel approach to feature point matching, suitable for robust and accurate outdoor visual localization in long-term scenarios. Given a query image, we first match it against a database of registered reference images, using recent retrieval techniques. This gives us a first estimate of the camera pose. To refine this estimate, like previous approaches, we match 2D points across the que… ▽ More

    Submitted 20 August, 2019; v1 submitted 8 July, 2019; originally announced July 2019.

  9. arXiv:1812.03707  [pdf, other

    cs.CV

    Improving Nighttime Retrieval-Based Localization

    Authors: Hugo Germain, Guillaume Bourmaud, Vincent Lepetit

    Abstract: Outdoor visual localization is a crucial component to many computer vision systems. We propose an approach to localization from images that is designed to explicitly handle the strong variations in appearance happening between daytime and nighttime. As revealed by recent long-term localization benchmarks, both traditional feature-based and retrieval-based approaches still struggle to handle such c… ▽ More

    Submitted 5 April, 2019; v1 submitted 10 December, 2018; originally announced December 2018.

  10. arXiv:1303.3134  [pdf

    cs.HC cs.CV

    Egocentric vision IT technologies for Alzheimer disease assessment and studies

    Authors: Hugo Boujut, Vincent Buso, Guillaume Bourmaud, Jenny Benois-Pineau, Rémi Mégret, Jean-Philippe Domenger, Yann Gaëstel, Jean-François Dartigues

    Abstract: Egocentric vision technology consists in capturing the actions of persons from their own visual point of view using wearable camera sensors. We apply this new paradigm to instrumental activities monitoring with the objective of providing new tools for the clinical evaluation of the impact of the disease on persons with dementia. In this paper, we introduce the current state of the development of t… ▽ More

    Submitted 13 March, 2013; originally announced March 2013.

    Comments: RITS - Recherche en Imagerie et Technologies pour la Santé, France (2013)