Skip to main content

Showing 1–50 of 178 results for author: Fua, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.05952  [pdf, ps, other

    cs.CV

    High-Fidelity and Generalizable Neural Surface Reconstruction with Sparse Feature Volumes

    Authors: Aoxiang Fan, Corentin Dumery, Nicolas Talabot, Hieu Le, Pascal Fua

    Abstract: Generalizable neural surface reconstruction has become a compelling technique to reconstruct from few images without per-scene optimization, where dense 3D feature volume has proven effective as a global representation of scenes. However, the dense representation does not scale well to increasing voxel resolutions, severely limiting the reconstruction quality. We thus present a sparse representati… ▽ More

    Submitted 8 July, 2025; originally announced July 2025.

  2. arXiv:2507.04408  [pdf, ps, other

    cs.CV

    A View-consistent Sampling Method for Regularized Training of Neural Radiance Fields

    Authors: Aoxiang Fan, Corentin Dumery, Nicolas Talabot, Pascal Fua

    Abstract: Neural Radiance Fields (NeRF) has emerged as a compelling framework for scene representation and 3D recovery. To improve its performance on real-world data, depth regularizations have proven to be the most effective ones. However, depth estimation models not only require expensive 3D supervision in training, but also suffer from generalization issues. As a result, the depth estimations can be erro… ▽ More

    Submitted 6 July, 2025; originally announced July 2025.

    Comments: ICCV 2025 accepted

  3. arXiv:2504.08353  [pdf, other

    cs.GR cs.CV cs.LG

    Single View Garment Reconstruction Using Diffusion Mapping Via Pattern Coordinates

    Authors: Ren Li, Cong Cao, Corentin Dumery, Yingxuan You, Hao Li, Pascal Fua

    Abstract: Reconstructing 3D clothed humans from images is fundamental to applications like virtual try-on, avatar creation, and mixed reality. While recent advances have enhanced human body recovery, accurate reconstruction of garment geometry -- especially for loose-fitting clothing -- remains an open challenge. We present a novel method for high-fidelity 3D garment reconstruction from single images that b… ▽ More

    Submitted 15 May, 2025; v1 submitted 11 April, 2025; originally announced April 2025.

    Comments: SIGGRAPH 2025

  4. arXiv:2503.13317  [pdf, other

    stat.ML cs.LG

    Do you understand epistemic uncertainty? Think again! Rigorous frequentist epistemic uncertainty estimation in regression

    Authors: Enrico Foglia, Benjamin Bobbia, Nikita Durasov, Michael Bauerheim, Pascal Fua, Stephane Moreau, Thierry Jardin

    Abstract: Quantifying model uncertainty is critical for understanding prediction reliability, yet distinguishing between aleatoric and epistemic uncertainty remains challenging. We extend recent work from classification to regression to provide a novel frequentist approach to epistemic and aleatoric uncertainty estimation. We train models to generate conditional predictions by feeding their initial output b… ▽ More

    Submitted 17 March, 2025; originally announced March 2025.

  5. arXiv:2503.06748  [pdf, other

    cs.CV

    DiffAtlas: GenAI-fying Atlas Segmentation via Image-Mask Diffusion

    Authors: Hantao Zhang, Yuhe Liu, Jiancheng Yang, Weidong Guo, Xinyuan Wang, Pascal Fua

    Abstract: Accurate medical image segmentation is crucial for precise anatomical delineation. Deep learning models like U-Net have shown great success but depend heavily on large datasets and struggle with domain shifts, complex structures, and limited training samples. Recent studies have explored diffusion models for segmentation by iteratively refining masks. However, these methods still retain the conven… ▽ More

    Submitted 9 March, 2025; originally announced March 2025.

    Comments: 11 pages

  6. arXiv:2503.06740  [pdf, other

    cs.CV

    D3DR: Lighting-Aware Object Insertion in Gaussian Splatting

    Authors: Vsevolod Skorokhodov, Nikita Durasov, Pascal Fua

    Abstract: Gaussian Splatting has become a popular technique for various 3D Computer Vision tasks, including novel view synthesis, scene reconstruction, and dynamic scene rendering. However, the challenge of natural-looking object insertion, where the object's appearance seamlessly matches the scene, remains unsolved. In this work, we propose a method, dubbed D3DR, for inserting a 3DGS-parametrized object in… ▽ More

    Submitted 9 March, 2025; originally announced March 2025.

  7. arXiv:2502.12985  [pdf, other

    cs.CV cs.AI

    PartSDF: Part-Based Implicit Neural Representation for Composite 3D Shape Parametrization and Optimization

    Authors: Nicolas Talabot, Olivier Clerc, Arda Cinar Demirtas, Doruk Oner, Pascal Fua

    Abstract: Accurate 3D shape representation is essential in engineering applications such as design, optimization, and simulation. In practice, engineering workflows require structured, part-aware representations, as objects are inherently designed as assemblies of distinct components. However, most existing methods either model shapes holistically or decompose them without predefined part structures, limiti… ▽ More

    Submitted 18 February, 2025; originally announced February 2025.

    Comments: 22 pages, 14 figures

  8. arXiv:2412.13183  [pdf, ps, other

    cs.CV

    Real-time Free-view Human Rendering from Sparse-view RGB Videos using Double Unprojected Textures

    Authors: Guoxing Sun, Rishabh Dabral, Heming Zhu, Pascal Fua, Christian Theobalt, Marc Habermann

    Abstract: Real-time free-view human rendering from sparse-view RGB inputs is a challenging task due to the sensor scarcity and the tight time budget. To ensure efficiency, recent methods leverage 2D CNNs operating in texture space to learn rendering primitives. However, they either jointly learn geometry and appearance, or completely ignore sparse image information for geometry estimation, significantly har… ▽ More

    Submitted 20 June, 2025; v1 submitted 17 December, 2024; originally announced December 2024.

    Comments: Accepted at CVPR 2025, Project page: https://vcai.mpi-inf.mpg.de/projects/DUT/

  9. arXiv:2412.02589  [pdf, other

    cs.CV

    MedTet: An Online Motion Model for 4D Heart Reconstruction

    Authors: Yihong Chen, Jiancheng Yang, Deniz Sayin Mercadier, Hieu Le, Pascal Fua

    Abstract: We present a novel approach to reconstruction of 3D cardiac motion from sparse intraoperative data. While existing methods can accurately reconstruct 3D organ geometries from full 3D volumetric imaging, they cannot be used during surgical interventions where usually limited observed data, such as a few 2D frames or 1D signals, is available in real-time. We propose a versatile framework for reconst… ▽ More

    Submitted 3 December, 2024; originally announced December 2024.

  10. arXiv:2411.19149  [pdf, other

    cs.CV

    Counting Stacked Objects

    Authors: Corentin Dumery, Noa Etté, Aoxiang Fan, Ren Li, Jingyi Xu, Hieu Le, Pascal Fua

    Abstract: Visual object counting is a fundamental computer vision task underpinning numerous real-world applications, from cell counting in biomedicine to traffic and wildlife monitoring. However, existing methods struggle to handle the challenge of stacked 3D objects in which most objects are hidden by those above them. To address this important yet underexplored problem, we propose a novel 3D counting app… ▽ More

    Submitted 18 March, 2025; v1 submitted 28 November, 2024; originally announced November 2024.

    Comments: 13 pages

  11. arXiv:2411.16466  [pdf, other

    cs.CV cs.LG

    No Identity, no problem: Motion through detection for people tracking

    Authors: Martin Engilberge, F. Wilke Grosche, Pascal Fua

    Abstract: Tracking-by-detection has become the de facto standard approach to people tracking. To increase robustness, some approaches incorporate re-identification using appearance models and regressing motion offset, which requires costly identity annotations. In this paper, we propose exploiting motion clues while providing supervision only for the detections, which is much easier to do. Our algorithm pre… ▽ More

    Submitted 25 November, 2024; originally announced November 2024.

    Comments: Accepted in TMLR November 2024

  12. arXiv:2410.23910  [pdf, other

    cs.CV

    Uncertainty Estimation for 3D Object Detection via Evidential Learning

    Authors: Nikita Durasov, Rafid Mahmood, Jiwoong Choi, Marc T. Law, James Lucas, Pascal Fua, Jose M. Alvarez

    Abstract: 3D object detection is an essential task for computer vision applications in autonomous vehicles and robotics. However, models often struggle to quantify detection reliability, leading to poor performance on unfamiliar scenes. We introduce a framework for quantifying uncertainty in 3D object detection by leveraging an evidential learning loss on Bird's Eye View representations in the 3D detector.… ▽ More

    Submitted 31 October, 2024; originally announced October 2024.

  13. arXiv:2410.22422  [pdf, other

    cs.CV

    Gradient Distance Function

    Authors: Hieu Le, Federico Stella, Benoit Guillard, Pascal Fua

    Abstract: Unsigned Distance Functions (UDFs) can be used to represent non-watertight surfaces in a deep learning framework. However, UDFs tend to be brittle and difficult to learn, in part because the surface is located exactly where the UDF is non-differentiable. In this work, we show that Gradient Distance Functions (GDFs) can remedy this by being differentiable at the surface while still being able to re… ▽ More

    Submitted 29 October, 2024; originally announced October 2024.

    Comments: We developed this concurrently with 'Neural Vector Field,' and there are similarities between the two works so please pay them a visit as well. Here, we demonstrate how directly learning the gradient vector is much easier than learning the UDF

  14. arXiv:2410.04201  [pdf, other

    cs.CV

    IT$^3$: Idempotent Test-Time Training

    Authors: Nikita Durasov, Assaf Shocher, Doruk Oner, Gal Chechik, Alexei A. Efros, Pascal Fua

    Abstract: Deep learning models often struggle when deployed in real-world settings due to distribution shifts between training and test data. While existing approaches like domain adaptation and test-time training (TTT) offer partial solutions, they typically require additional data or domain-specific auxiliary tasks. We present Idempotent Test-Time Training (IT$^3$), a novel approach that enables on-the-fl… ▽ More

    Submitted 25 May, 2025; v1 submitted 5 October, 2024; originally announced October 2024.

    Comments: Accepted at ICML 2025

  15. arXiv:2409.06231  [pdf, other

    cs.CV

    A Latent Implicit 3D Shape Model for Multiple Levels of Detail

    Authors: Benoit Guillard, Marc Habermann, Christian Theobalt, Pascal Fua

    Abstract: Implicit neural representations map a shape-specific latent code and a 3D coordinate to its corresponding signed distance (SDF) value. However, this approach only offers a single level of detail. Emulating low levels of detail can be achieved with shallow networks, but the generated shapes are typically not smooth. Alternatively, some network designs offer multiple levels of detail, but are limite… ▽ More

    Submitted 10 September, 2024; originally announced September 2024.

    Comments: Published in GCPR 2024 proceedings

  16. arXiv:2408.09928  [pdf, other

    cs.CV cs.GR

    Enforcing View-Consistency in Class-Agnostic 3D Segmentation Fields

    Authors: Corentin Dumery, Aoxiang Fan, Ren Li, Nicolas Talabot, Pascal Fua

    Abstract: Radiance Fields have become a powerful tool for modeling 3D scenes from multiple images. However, they remain difficult to segment into semantically meaningful regions. Some methods work well using 2D semantic masks, but they generalize poorly to class-agnostic segmentations. More recent methods circumvent this issue by using contrastive learning to optimize a high-dimensional 3D feature field ins… ▽ More

    Submitted 3 April, 2025; v1 submitted 19 August, 2024; originally announced August 2024.

    Comments: CVPRW 2025, presented at the 4th Workshop on Open-World 3D Scene Understanding with Foundation Models. Project page: https://corentindumery.github.io/projects/disconerf.html

  17. arXiv:2407.18381  [pdf, other

    cs.CV

    Neural Surface Detection for Unsigned Distance Fields

    Authors: Federico Stella, Nicolas Talabot, Hieu Le, Pascal Fua

    Abstract: Extracting surfaces from Signed Distance Fields (SDFs) can be accomplished using traditional algorithms, such as Marching Cubes. However, since they rely on sign flips across the surface, these algorithms cannot be used directly on Unsigned Distance Fields (UDFs). In this work, we introduce a deep-learning approach to taking a UDF and turning it locally into an SDF, so that it can be effectively t… ▽ More

    Submitted 27 October, 2024; v1 submitted 25 July, 2024; originally announced July 2024.

    Comments: Accepted to ECCV 2024

  18. arXiv:2407.14352  [pdf, ps, other

    cs.CV

    Vision-Based Power Line Cables and Pylons Detection for Low Flying Aircraft

    Authors: Jakub Gwizdała, Doruk Oner, Soumava Kumar Roy, Mian Akbar Shah, Ad Eberhard, Ivan Egorov, Philipp Krüsi, Grigory Yakushev, Pascal Fua

    Abstract: Power lines are dangerous for low-flying aircraft, especially in low-visibility conditions. Thus, a vision-based system able to analyze the aircraft's surroundings and to provide the pilots with a "second pair of eyes" can contribute to enhancing their safety. To this end, we have developed a deep learning approach to jointly detect power line cables and pylons from images captured at distances of… ▽ More

    Submitted 30 July, 2024; v1 submitted 19 July, 2024; originally announced July 2024.

    Comments: Added several declarations at the end of the publication

  19. arXiv:2406.09250  [pdf, other

    cs.CV cs.AI cs.LG

    MirrorCheck: Efficient Adversarial Defense for Vision-Language Models

    Authors: Samar Fares, Klea Ziu, Toluwani Aremu, Nikita Durasov, Martin Takáč, Pascal Fua, Karthik Nandakumar, Ivan Laptev

    Abstract: Vision-Language Models (VLMs) are becoming increasingly vulnerable to adversarial attacks as various novel attack strategies are being proposed against these models. While existing defenses excel in unimodal contexts, they currently fall short in safeguarding VLMs against adversarial threats. To mitigate this vulnerability, we propose a novel, yet elegantly simple approach for detecting adversaria… ▽ More

    Submitted 17 October, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

  20. arXiv:2405.13781  [pdf, other

    cs.CV

    Addressing the Elephant in the Room: Robust Animal Re-Identification with Unsupervised Part-Based Feature Alignment

    Authors: Yingxue Yu, Vidit Vidit, Andrey Davydov, Martin Engilberge, Pascal Fua

    Abstract: Animal Re-ID is crucial for wildlife conservation, yet it faces unique challenges compared to person Re-ID. First, the scarcity and lack of diversity in datasets lead to background-biased models. Second, animal Re-ID depends on subtle, species-specific cues, further complicated by variations in pose, background, and lighting. This study addresses background biases by proposing a method to systemat… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: Accepted to CVPR workshop CV4Animals 2024

  21. arXiv:2405.10934  [pdf, other

    cs.CV

    Reconstruction of Manipulated Garment with Guided Deformation Prior

    Authors: Ren Li, Corentin Dumery, Zhantao Deng, Pascal Fua

    Abstract: Modeling the shape of garments has received much attention, but most existing approaches assume the garments to be worn by someone, which constrains the range of shapes they can assume. In this work, we address shape recovery when garments are being manipulated instead of worn, which gives rise to an even larger range of possible shapes. To this end, we leverage the implicit sewing patterns (ISP)… ▽ More

    Submitted 13 October, 2024; v1 submitted 17 May, 2024; originally announced May 2024.

    Comments: NeurIPS 2024

  22. arXiv:2403.18820  [pdf, other

    cs.CV

    MetaCap: Meta-learning Priors from Multi-View Imagery for Sparse-view Human Performance Capture and Rendering

    Authors: Guoxing Sun, Rishabh Dabral, Pascal Fua, Christian Theobalt, Marc Habermann

    Abstract: Faithful human performance capture and free-view rendering from sparse RGB observations is a long-standing problem in Vision and Graphics. The main challenges are the lack of observations and the inherent ambiguities of the setting, e.g. occlusions and depth ambiguity. As a result, radiance fields, which have shown great promise in capturing high-frequency appearance and geometry details in dense… ▽ More

    Submitted 24 July, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

    Comments: Project page: https://vcai.mpi-inf.mpg.de/projects/MetaCap/

  23. arXiv:2403.17755  [pdf, other

    cs.AI cs.CR cs.CV

    DataCook: Crafting Anti-Adversarial Examples for Healthcare Data Copyright Protection

    Authors: Sihan Shang, Jiancheng Yang, Zhenglong Sun, Pascal Fua

    Abstract: In the realm of healthcare, the challenges of copyright protection and unauthorized third-party misuse are increasingly significant. Traditional methods for data copyright protection are applied prior to data distribution, implying that models trained on these data become uncontrollable. This paper introduces a novel approach, named DataCook, designed to safeguard the copyright of healthcare data… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  24. arXiv:2403.16732  [pdf, other

    cs.AI

    Enabling Uncertainty Estimation in Iterative Neural Networks

    Authors: Nikita Durasov, Doruk Oner, Jonathan Donier, Hieu Le, Pascal Fua

    Abstract: Turning pass-through network architectures into iterative ones, which use their own output as input, is a well-known approach for boosting performance. In this paper, we argue that such architectures offer an additional benefit: The convergence rate of their successive outputs is highly correlated with the accuracy of the value to which they converge. Thus, we can use the convergence rate as a use… ▽ More

    Submitted 25 May, 2025; v1 submitted 25 March, 2024; originally announced March 2024.

    Comments: Accepted at ICML 2024

  25. arXiv:2403.14066  [pdf, other

    eess.IV cs.CV

    LeFusion: Controllable Pathology Synthesis via Lesion-Focused Diffusion Models

    Authors: Hantao Zhang, Yuhe Liu, Jiancheng Yang, Shouhong Wan, Xinyuan Wang, Wei Peng, Pascal Fua

    Abstract: Patient data from real-world clinical practice often suffers from data scarcity and long-tail imbalances, leading to biased outcomes or algorithmic unfairness. This study addresses these challenges by generating lesion-containing image-segmentation pairs from lesion-free images. Previous efforts in medical imaging synthesis have struggled with separating lesion information from background, resulti… ▽ More

    Submitted 4 October, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

    Comments: 19 pages

  26. arXiv:2403.09050  [pdf, other

    cs.CV

    CLOAF: CoLlisiOn-Aware Human Flow

    Authors: Andrey Davydov, Martin Engilberge, Mathieu Salzmann, Pascal Fua

    Abstract: Even the best current algorithms for estimating body 3D shape and pose yield results that include body self-intersections. In this paper, we present CLOAF, which exploits the diffeomorphic nature of Ordinary Differential Equations to eliminate such self-intersections while still imposing body shape constraints. We show that, unlike earlier approaches to addressing this issue, ours completely elimi… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: CVPR 2024, 13 pages

  27. arXiv:2402.11036  [pdf, other

    cs.CV cs.LG

    Occlusion Resilient 3D Human Pose Estimation

    Authors: Soumava Kumar Roy, Ilia Badanin, Sina Honari, Pascal Fua

    Abstract: Occlusions remain one of the key challenges in 3D body pose estimation from single-camera video sequences. Temporal consistency has been extensively used to mitigate their impact but the existing algorithms in the literature do not explicitly model them. Here, we apply this by representing the deforming body as a spatio-temporal graph. We then introduce a refinement network that performs graph c… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  28. arXiv:2402.02736  [pdf, other

    cs.CV cs.LG

    Using Motion Cues to Supervise Single-Frame Body Pose and Shape Estimation in Low Data Regimes

    Authors: Andrey Davydov, Alexey Sidnev, Artsiom Sanakoyeu, Yuhua Chen, Mathieu Salzmann, Pascal Fua

    Abstract: When enough annotated training data is available, supervised deep-learning algorithms excel at estimating human body pose and shape using a single camera. The effects of too little such data being available can be mitigated by using other information sources, such as databases of body shapes, to learn priors. Unfortunately, such sources are not always available either. We show that, in such cases,… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: 21 pages; TMLR

  29. arXiv:2311.10356  [pdf, other

    cs.CV

    Garment Recovery with Shape and Deformation Priors

    Authors: Ren Li, Corentin Dumery, Benoît Guillard, Pascal Fua

    Abstract: While modeling people wearing tight-fitting clothing has made great strides in recent years, loose-fitting clothing remains a challenge. We propose a method that delivers realistic garment models from real-world images, regardless of garment shape or deformation. To this end, we introduce a fitting approach that utilizes shape and deformation priors learned from synthetic data to accurately captur… ▽ More

    Submitted 11 March, 2024; v1 submitted 17 November, 2023; originally announced November 2023.

    Comments: CVPR 2024

  30. arXiv:2309.17329  [pdf, other

    cs.CV cs.AI cs.GR cs.LG eess.IV

    Efficient Anatomical Labeling of Pulmonary Tree Structures via Deep Point-Graph Representation-based Implicit Fields

    Authors: Kangxian Xie, Jiancheng Yang, Donglai Wei, Ziqiao Weng, Pascal Fua

    Abstract: Pulmonary diseases rank prominently among the principal causes of death worldwide. Curing them will require, among other things, a better understanding of the complex 3D tree-shaped structures within the pulmonary system, such as airways, arteries, and veins. Traditional approaches using high-resolution image stacks and standard CNNs on dense voxel grids face challenges in computational efficiency… ▽ More

    Submitted 17 October, 2024; v1 submitted 29 September, 2023; originally announced September 2023.

    Comments: Accepted by Medical Image Analysis

    MSC Class: 68T45; 62P10; 68U10; 68U05; 05C90

  31. LightNeuS: Neural Surface Reconstruction in Endoscopy using Illumination Decline

    Authors: Víctor M. Batlle, José M. M. Montiel, Pascal Fua, Juan D. Tardós

    Abstract: We propose a new approach to 3D reconstruction from sequences of images acquired by monocular endoscopes. It is based on two key insights. First, endoluminal cavities are watertight, a property naturally enforced by modeling them in terms of a signed distance function. Second, the scene illumination is variable. It comes from the endoscope's light sources and decays with the inverse of the squared… ▽ More

    Submitted 19 May, 2025; v1 submitted 6 September, 2023; originally announced September 2023.

    Comments: 13 pages, 7 figures, 1 table

    Journal ref: MICCAI 2023. Lecture Notes in Computer Science, vol 14229 (2023) pp 502-512

  32. arXiv:2308.16139  [pdf, other

    cs.CV cs.DB cs.LG

    MedShapeNet -- A Large-Scale Dataset of 3D Medical Shapes for Computer Vision

    Authors: Jianning Li, Zongwei Zhou, Jiancheng Yang, Antonio Pepe, Christina Gsaxner, Gijs Luijten, Chongyu Qu, Tiezheng Zhang, Xiaoxi Chen, Wenxuan Li, Marek Wodzinski, Paul Friedrich, Kangxian Xie, Yuan Jin, Narmada Ambigapathy, Enrico Nasca, Naida Solak, Gian Marco Melito, Viet Duc Vu, Afaque R. Memon, Christopher Schlachta, Sandrine De Ribaupierre, Rajnikant Patel, Roy Eagleson, Xiaojun Chen , et al. (132 additional authors not shown)

    Abstract: Prior to the deep learning era, shape was commonly used to describe the objects. Nowadays, state-of-the-art (SOTA) algorithms in medical imaging are predominantly diverging from computer vision, where voxel grids, meshes, point clouds, and implicit surface models are used. This is seen from numerous shape-related publications in premier vision conferences as well as the growing popularity of Shape… ▽ More

    Submitted 12 December, 2023; v1 submitted 30 August, 2023; originally announced August 2023.

    Comments: 16 pages

    MSC Class: 68T01

  33. LightDepth: Single-View Depth Self-Supervision from Illumination Decline

    Authors: Javier Rodríguez-Puigvert, Víctor M. Batlle, J. M. M. Montiel, Ruben Martinez-Cantin, Pascal Fua, Juan D. Tardós, Javier Civera

    Abstract: Single-view depth estimation can be remarkably effective if there is enough ground-truth depth data for supervised training. However, there are scenarios, especially in medicine in the case of endoscopies, where such data cannot be obtained. In such cases, multi-view self-supervision and synthetic-to-real transfer serve as alternative approaches, however, with a considerable performance reduction… ▽ More

    Submitted 19 September, 2023; v1 submitted 21 August, 2023; originally announced August 2023.

  34. arXiv:2307.08716  [pdf, other

    cs.CV

    Pairwise-Constrained Implicit Functions for 3D Human Heart Modelling

    Authors: Hieu Le, Jingyi Xu, Nicolas Talabot, Jiancheng Yang, Pascal Fua

    Abstract: Accurate 3D models of the human heart require not only correct outer surfaces but also realistic inner structures, such as the ventricles, atria, and myocardial layers. Approaches relying on implicit surfaces, such as signed distance functions (SDFs), are primarily designed for single watertight surfaces, making them ill-suited for multi-layered anatomical structures. They often produce gaps or ov… ▽ More

    Submitted 2 April, 2025; v1 submitted 16 July, 2023; originally announced July 2023.

  35. arXiv:2305.14100  [pdf, other

    cs.CV

    ISP: Multi-Layered Garment Draping with Implicit Sewing Patterns

    Authors: Ren Li, Benoît Guillard, Pascal Fua

    Abstract: Many approaches to draping individual garments on human body models are realistic, fast, and yield outputs that are differentiable with respect to the body shape on which they are draped. However, they are either unable to handle multi-layered clothing, which is prevalent in everyday dress, or restricted to bodies in T-pose. In this paper, we introduce a parametric garment representation model tha… ▽ More

    Submitted 14 October, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: NeurIPS 2023

  36. arXiv:2305.02116  [pdf, other

    cs.CV physics.flu-dyn

    Automatic Parameterization for Aerodynamic Shape Optimization via Deep Geometric Learning

    Authors: Zhen Wei, Pascal Fua, Michaël Bauerheim

    Abstract: We propose two deep learning models that fully automate shape parameterization for aerodynamic shape optimization. Both models are optimized to parameterize via deep geometric learning to embed human prior knowledge into learned geometric patterns, eliminating the need for further handcrafting. The Latent Space Model (LSM) learns a low-dimensional latent representation of an object from a dataset… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.

    Comments: 15 pages, to be appeared at AIAA Aviation Forum 2023

  37. arXiv:2303.05916  [pdf, other

    cs.CV

    GECCO: Geometrically-Conditioned Point Diffusion Models

    Authors: Michał J. Tyszkiewicz, Pascal Fua, Eduard Trulls

    Abstract: Diffusion models generating images conditionally on text, such as Dall-E 2 and Stable Diffusion, have recently made a splash far beyond the computer vision community. Here, we tackle the related problem of generating point clouds, both unconditionally, and conditionally with images. For the latter, we introduce a novel geometrically-motivated conditioning scheme based on projecting sparse image fe… ▽ More

    Submitted 25 September, 2023; v1 submitted 10 March, 2023; originally announced March 2023.

  38. arXiv:2212.14397  [pdf, other

    cs.CV

    AttEntropy: On the Generalization Ability of Supervised Semantic Segmentation Transformers to New Objects in New Domains

    Authors: Krzysztof Lis, Matthias Rottmann, Annika Mütze, Sina Honari, Pascal Fua, Mathieu Salzmann

    Abstract: In addition to impressive performance, vision transformers have demonstrated remarkable abilities to encode information they were not trained to extract. For example, this information can be used to perform segmentation or single-view depth estimation even though the networks were only trained for image recognition. We show that a similar phenomenon occurs when explicitly training transformers for… ▽ More

    Submitted 29 December, 2024; v1 submitted 29 December, 2022; originally announced December 2022.

    ACM Class: I.4.6; I.4.8; I.5.4

    Journal ref: 35th British Machine Vision Conference 2024, BMVC 2024, Glasgow, UK, November 25-28, 2024

  39. arXiv:2211.12829  [pdf, other

    cs.CV cs.LG

    Unsupervised 3D Keypoint Discovery with Multi-View Geometry

    Authors: Sina Honari, Chen Zhao, Mathieu Salzmann, Pascal Fua

    Abstract: Analyzing and training 3D body posture models depend heavily on the availability of joint labels that are commonly acquired through laborious manual annotation of body joints or via marker-based joint localization using carefully curated markers and capturing systems. However, such annotations are not always available, especially for people performing unusual activities. In this paper, we propose… ▽ More

    Submitted 7 February, 2024; v1 submitted 23 November, 2022; originally announced November 2022.

    Comments: Accepted in "3DV 2024"

  40. arXiv:2211.11546  [pdf, other

    cs.CV

    PartAL: Efficient Partial Active Learning in Multi-Task Visual Settings

    Authors: Nikita Durasov, Nik Dorndorf, Pascal Fua

    Abstract: Multi-task learning is central to many real-world applications. Unfortunately, obtaining labelled data for all tasks is time-consuming, challenging, and expensive. Active Learning (AL) can be used to reduce this burden. Existing techniques typically involve picking images to be annotated and providing annotations for all tasks. In this paper, we show that it is more effective to select not only… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

  41. arXiv:2211.11435  [pdf, other

    cs.LG cs.CV

    ZigZag: Universal Sampling-free Uncertainty Estimation Through Two-Step Inference

    Authors: Nikita Durasov, Nik Dorndorf, Hieu Le, Pascal Fua

    Abstract: Whereas the ability of deep networks to produce useful predictions has been amply demonstrated, estimating the reliability of these predictions remains challenging. Sampling approaches such as MC-Dropout and Deep Ensembles have emerged as the most popular ones for this purpose. Unfortunately, they require many forward passes at inference time, which slows them down. Sampling-free approaches can be… ▽ More

    Submitted 26 May, 2024; v1 submitted 21 November, 2022; originally announced November 2022.

    Comments: Accepted to Transactions on Machine Learning Research (TMLR), ICML ABBI 2024

  42. arXiv:2211.11277  [pdf, other

    cs.CV

    DrapeNet: Garment Generation and Self-Supervised Draping

    Authors: Luca De Luigi, Ren Li, Benoît Guillard, Mathieu Salzmann, Pascal Fua

    Abstract: Recent approaches to drape garments quickly over arbitrary human bodies leverage self-supervision to eliminate the need for large training sets. However, they are designed to train one network per clothing item, which severely limits their generalization abilities. In our work, we rely on self-supervision to train a single network to drape multiple garments. This is achieved by predicting a 3D def… ▽ More

    Submitted 22 March, 2023; v1 submitted 21 November, 2022; originally announced November 2022.

  43. arXiv:2210.15664  [pdf, other

    cs.CV cs.GR

    State of the Art in Dense Monocular Non-Rigid 3D Reconstruction

    Authors: Edith Tretschk, Navami Kairanda, Mallikarjun B R, Rishabh Dabral, Adam Kortylewski, Bernhard Egger, Marc Habermann, Pascal Fua, Christian Theobalt, Vladislav Golyanik

    Abstract: 3D reconstruction of deformable (or non-rigid) scenes from a set of monocular 2D image observations is a long-standing and actively researched area of computer vision and graphics. It is an ill-posed inverse problem, since -- without additional prior assumptions -- it permits infinitely many solutions leading to accurate projection to the input 2D images. Non-rigid reconstruction is a foundational… ▽ More

    Submitted 24 March, 2023; v1 submitted 27 October, 2022; originally announced October 2022.

    Comments: 36 pages, 18 figures, 3 tables; State-of-the-Art Report at EUROGRAPHICS 2023

    Journal ref: Computer Graphics Forum, 2023

  44. arXiv:2210.10771  [pdf, other

    cs.CV cs.LG

    Multi-view Tracking Using Weakly Supervised Human Motion Prediction

    Authors: Martin Engilberge, Weizhe Liu, Pascal Fua

    Abstract: Multi-view approaches to people-tracking have the potential to better handle occlusions than single-view ones in crowded scenes. They often rely on the tracking-by-detection paradigm, which involves detecting people first and then connecting the detections. In this paper, we argue that an even more effective approach is to predict people motion over time and infer people's presence in individual f… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

    Comments: Accepted at WACV 2023

  45. arXiv:2210.10756  [pdf, other

    cs.CV cs.LG

    Two-level Data Augmentation for Calibrated Multi-view Detection

    Authors: Martin Engilberge, Haixin Shi, Zhiye Wang, Pascal Fua

    Abstract: Data augmentation has proven its usefulness to improve model generalization and performance. While it is commonly applied in computer vision application when it comes to multi-view systems, it is rarely used. Indeed geometric data augmentation can break the alignment among views. This is problematic since multi-view data tend to be scarce and it is expensive to annotate. In this work we propose to… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

    Comments: Accepted at WACV 2023

  46. Perspective Aware Road Obstacle Detection

    Authors: Krzysztof Lis, Sina Honari, Pascal Fua, Mathieu Salzmann

    Abstract: While road obstacle detection techniques have become increasingly effective, they typically ignore the fact that, in practice, the apparent size of the obstacles decreases as their distance to the vehicle increases. In this paper, we account for this by computing a scale map encoding the apparent size of a hypothetical object at every image location. We then leverage this perspective map to (i) ge… ▽ More

    Submitted 19 June, 2023; v1 submitted 4 October, 2022; originally announced October 2022.

    ACM Class: I.4.6; I.4.8; I.5.4

    Journal ref: IEEE Robotics and Automation Letters ( Volume: 8, Issue: 4, April 2023, Pages: 2150-2157)

  47. arXiv:2209.10986  [pdf, other

    cs.RO cs.CV

    Learning to Simulate Realistic LiDARs

    Authors: Benoit Guillard, Sai Vemprala, Jayesh K. Gupta, Ondrej Miksik, Vibhav Vineet, Pascal Fua, Ashish Kapoor

    Abstract: Simulating realistic sensors is a challenging part in data generation for autonomous systems, often involving carefully handcrafted sensor design, scene properties, and physics modeling. To alleviate this, we introduce a pipeline for data-driven simulation of a realistic LiDAR sensor. We propose a model that learns a mapping between RGB images and corresponding LiDAR features such as raydrop or pe… ▽ More

    Submitted 22 September, 2022; originally announced September 2022.

    Comments: IROS2022 paper

  48. arXiv:2209.10845  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    DIG: Draping Implicit Garment over the Human Body

    Authors: Ren Li, Benoît Guillard, Edoardo Remelli, Pascal Fua

    Abstract: Existing data-driven methods for draping garments over human bodies, despite being effective, cannot handle garments of arbitrary topology and are typically not end-to-end differentiable. To address these limitations, we propose an end-to-end differentiable pipeline that represents garments using implicit surfaces and learns a skinning field conditioned on shape and pose parameters of an articulat… ▽ More

    Submitted 24 September, 2022; v1 submitted 22 September, 2022; originally announced September 2022.

    Comments: 16 pages, 9 figures, 5 tables, ACCV 2022

  49. arXiv:2208.03257  [pdf, other

    cs.CV

    3D Pose Based Feedback for Physical Exercises

    Authors: Ziyi Zhao, Sena Kiciroglu, Hugues Vinzant, Yuan Cheng, Isinsu Katircioglu, Mathieu Salzmann, Pascal Fua

    Abstract: Unsupervised self-rehabilitation exercises and physical training can cause serious injuries if performed incorrectly. We introduce a learning-based framework that identifies the mistakes made by a user and proposes corrective measures for easier and safer individual training. Our framework does not rely on hard-coded, heuristic rules. Instead, it learns them from data, which facilitates its adapta… ▽ More

    Submitted 5 August, 2022; originally announced August 2022.

    Comments: Video: https://youtu.be/W3kyyeHe0SI

  50. Enforcing connectivity of 3D linear structures using their 2D projections

    Authors: Doruk Oner, Hussein Osman, Mateusz Kozinski, Pascal Fua

    Abstract: Many biological and medical tasks require the delineation of 3D curvilinear structures such as blood vessels and neurites from image volumes. This is typically done using neural networks trained by minimizing voxel-wise loss functions that do not capture the topological properties of these structures. As a result, the connectivity of the recovered structures is often wrong, which lessens their use… ▽ More

    Submitted 24 December, 2022; v1 submitted 14 July, 2022; originally announced July 2022.