Skip to main content

Showing 1–11 of 11 results for author: Van Gansbeke, W

.
  1. arXiv:2408.16504  [pdf, other

    cs.CV

    A Simple and Generalist Approach for Panoptic Segmentation

    Authors: Nedyalko Prisadnikov, Wouter Van Gansbeke, Danda Pani Paudel, Luc Van Gool

    Abstract: Panoptic segmentation is an important computer vision task, where the current state-of-the-art solutions require specialized components to perform well. We propose a simple generalist framework based on a deep encoder - shallow decoder architecture with per-pixel prediction. Essentially fine-tuning a massively pretrained image model with minimal additional components. Naively this method does not… ▽ More

    Submitted 7 March, 2025; v1 submitted 29 August, 2024; originally announced August 2024.

  2. arXiv:2404.05519  [pdf, other

    cs.CV cs.LG

    Investigating the Effectiveness of Cross-Attention to Unlock Zero-Shot Editing of Text-to-Video Diffusion Models

    Authors: Saman Motamed, Wouter Van Gansbeke, Luc Van Gool

    Abstract: With recent advances in image and video diffusion models for content creation, a plethora of techniques have been proposed for customizing their generated content. In particular, manipulating the cross-attention layers of Text-to-Image (T2I) diffusion models has shown great promise in controlling the shape and location of objects in the scene. Transferring image-editing techniques to the video dom… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: Generative Models for Computer Vision Generative Models for Computer Vision CVPR 2024 Workshop

  3. arXiv:2401.10227  [pdf, other

    cs.CV cs.LG

    A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting

    Authors: Wouter Van Gansbeke, Bert De Brabandere

    Abstract: Panoptic and instance segmentation networks are often trained with specialized object detection modules, complex loss functions, and ad-hoc post-processing steps to manage the permutation-invariance of the instance masks. This work builds upon Stable Diffusion and proposes a latent diffusion approach for panoptic segmentation, resulting in a simple architecture that omits these complexities. Our t… ▽ More

    Submitted 16 July, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

    Comments: Accepted at ECCV 2024, Code: https://github.com/segments-ai/latent-diffusion-segmentation

  4. arXiv:2206.06363  [pdf, other

    cs.CV cs.LG

    Discovering Object Masks with Transformers for Unsupervised Semantic Segmentation

    Authors: Wouter Van Gansbeke, Simon Vandenhende, Luc Van Gool

    Abstract: The task of unsupervised semantic segmentation aims to cluster pixels into semantically meaningful groups. Specifically, pixels assigned to the same cluster should share high-level semantic properties like their object or part category. This paper presents MaskDistill: a novel framework for unsupervised semantic segmentation based on three key ideas. First, we advocate a data-driven strategy to ge… ▽ More

    Submitted 13 June, 2022; originally announced June 2022.

    Comments: Code: https://github.com/wvangansbeke/MaskDistill

  5. arXiv:2106.05967  [pdf, other

    cs.CV cs.LG

    Revisiting Contrastive Methods for Unsupervised Learning of Visual Representations

    Authors: Wouter Van Gansbeke, Simon Vandenhende, Stamatios Georgoulis, Luc Van Gool

    Abstract: Contrastive self-supervised learning has outperformed supervised pretraining on many downstream tasks like segmentation and object detection. However, current methods are still primarily applied to curated datasets like ImageNet. In this paper, we first study how biases in the dataset affect existing methods. Our results show that current contrastive approaches work surprisingly well across: (i) o… ▽ More

    Submitted 14 December, 2021; v1 submitted 10 June, 2021; originally announced June 2021.

    Comments: NeurIPS 2021. Code: https://github.com/wvangansbeke/Revisiting-Contrastive-SSL

  6. arXiv:2102.06191  [pdf, other

    cs.CV cs.LG

    Unsupervised Semantic Segmentation by Contrasting Object Mask Proposals

    Authors: Wouter Van Gansbeke, Simon Vandenhende, Stamatios Georgoulis, Luc Van Gool

    Abstract: Being able to learn dense semantic representations of images without supervision is an important problem in computer vision. However, despite its significance, this problem remains rather unexplored, with a few exceptions that considered unsupervised semantic segmentation on small-scale datasets with a narrow visual domain. In this paper, we make a first attempt to tackle the problem on datasets t… ▽ More

    Submitted 3 August, 2021; v1 submitted 11 February, 2021; originally announced February 2021.

    Comments: ICCV 2021 - Code: https://github.com/wvangansbeke/Unsupervised-Semantic-Segmentation

  7. arXiv:2005.12320  [pdf, other

    cs.CV cs.LG

    SCAN: Learning to Classify Images without Labels

    Authors: Wouter Van Gansbeke, Simon Vandenhende, Stamatios Georgoulis, Marc Proesmans, Luc Van Gool

    Abstract: Can we automatically group images into semantically meaningful clusters when ground-truth annotations are absent? The task of unsupervised image classification remains an important, and open challenge in computer vision. Several recent approaches have tried to tackle this problem in an end-to-end fashion. In this paper, we deviate from recent works, and advocate a two-step approach where feature l… ▽ More

    Submitted 3 July, 2020; v1 submitted 25 May, 2020; originally announced May 2020.

    Comments: Accepted at ECCV 2020. Includes supplementary. Code and pretrained models at https://github.com/wvangansbeke/Unsupervised-Classification

  8. Multi-Task Learning for Dense Prediction Tasks: A Survey

    Authors: Simon Vandenhende, Stamatios Georgoulis, Wouter Van Gansbeke, Marc Proesmans, Dengxin Dai, Luc Van Gool

    Abstract: With the advent of deep learning, many dense prediction tasks, i.e. tasks that produce pixel-level predictions, have seen significant performance improvements. The typical approach is to learn these tasks in isolation, that is, a separate neural network is trained for each individual task. Yet, recent multi-task learning (MTL) techniques have shown promising results w.r.t. performance, computation… ▽ More

    Submitted 24 January, 2021; v1 submitted 28 April, 2020; originally announced April 2020.

    Comments: Accepted to T-PAMI. Code + Suppl. Mat. can be found here: https://github.com/SimonVandenhende/Multi-Task-Learning-PyTorch IEEE Copyright Notice

  9. arXiv:2001.02613  [pdf, other

    cs.CV cs.LG cs.RO eess.IV

    Don't Forget The Past: Recurrent Depth Estimation from Monocular Video

    Authors: Vaishakh Patil, Wouter Van Gansbeke, Dengxin Dai, Luc Van Gool

    Abstract: Autonomous cars need continuously updated depth information. Thus far, depth is mostly estimated independently for a single frame at a time, even if the method starts from video input. Our method produces a time series of depth maps, which makes it an ideal candidate for online learning approaches. In particular, we put three different types of depth estimation (supervised depth prediction, self-s… ▽ More

    Submitted 28 July, 2020; v1 submitted 8 January, 2020; originally announced January 2020.

    Comments: Please refer to our webpage for details https://www.trace.ethz.ch/publications/2020/rec_depth_estimation/

  10. arXiv:1902.05356  [pdf, other

    cs.CV

    Sparse and noisy LiDAR completion with RGB guidance and uncertainty

    Authors: Wouter Van Gansbeke, Davy Neven, Bert De Brabandere, Luc Van Gool

    Abstract: This work proposes a new method to accurately complete sparse LiDAR maps guided by RGB images. For autonomous vehicles and robotics the use of LiDAR is indispensable in order to achieve precise depth predictions. A multitude of applications depend on the awareness of their surroundings, and use depth cues to reason and react accordingly. On the one hand, monocular depth prediction methods fail to… ▽ More

    Submitted 14 February, 2019; originally announced February 2019.

    Comments: 7 pages, 3 figures

  11. arXiv:1902.00293  [pdf, other

    cs.CV

    End-to-end Lane Detection through Differentiable Least-Squares Fitting

    Authors: Wouter Van Gansbeke, Bert De Brabandere, Davy Neven, Marc Proesmans, Luc Van Gool

    Abstract: Lane detection is typically tackled with a two-step pipeline in which a segmentation mask of the lane markings is predicted first, and a lane line model (like a parabola or spline) is fitted to the post-processed mask next. The problem with such a two-step approach is that the parameters of the network are not optimized for the true task of interest (estimating the lane curvature parameters) but f… ▽ More

    Submitted 5 September, 2019; v1 submitted 1 February, 2019; originally announced February 2019.

    Comments: Accepted at ICCVW 2019 (CVRSUAD-Road Scene Understanding and Autonomous Driving)