Skip to main content

Showing 1–50 of 133 results for author: Van de Weijer, J

.
  1. arXiv:2506.00037  [pdf, ps, other

    cs.IR cs.LG

    Query Drift Compensation: Enabling Compatibility in Continual Learning of Retrieval Embedding Models

    Authors: Dipam Goswami, Liying Wang, Bartłomiej Twardowski, Joost van de Weijer

    Abstract: Text embedding models enable semantic search, powering several NLP applications like Retrieval Augmented Generation by efficient information retrieval (IR). However, text embedding models are commonly studied in scenarios where the training data is static, thus limiting its applications to dynamic scenarios where new training data emerges over time. IR methods generally encode a huge corpus of doc… ▽ More

    Submitted 27 May, 2025; originally announced June 2025.

    Comments: Accepted at CoLLAs 2025

  2. arXiv:2505.21960  [pdf, ps, other

    cs.CV

    One-Way Ticket:Time-Independent Unified Encoder for Distilling Text-to-Image Diffusion Models

    Authors: Senmao Li, Lei Wang, Kai Wang, Tao Liu, Jiehang Xie, Joost van de Weijer, Fahad Shahbaz Khan, Shiqi Yang, Yaxing Wang, Jian Yang

    Abstract: Text-to-Image (T2I) diffusion models have made remarkable advancements in generative modeling; however, they face a trade-off between inference speed and image quality, posing challenges for efficient deployment. Existing distilled T2I models can generate high-fidelity images with fewer sampling steps, but often struggle with diversity and quality, especially in one-step models. From our analysis,… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

    Comments: Accepted at CVPR2025, Code: https://github.com/sen-mao/Loopfree

  3. arXiv:2503.14275  [pdf, other

    cs.CV

    Free-Lunch Color-Texture Disentanglement for Stylized Image Generation

    Authors: Jiang Qin, Senmao Li, Alexandra Gomez-Villa, Shiqi Yang, Yaxing Wang, Kai Wang, Joost van de Weijer

    Abstract: Recent advances in Text-to-Image (T2I) diffusion models have transformed image generation, enabling significant progress in stylized generation using only a few style reference images. However, current diffusion-based methods struggle with fine-grained style customization due to challenges in controlling multiple style attributes, such as color and texture. This paper introduces the first tuning-f… ▽ More

    Submitted 21 March, 2025; v1 submitted 18 March, 2025; originally announced March 2025.

    Comments: Code will be available at https://deepffff.github.io/sadis.github.io/

  4. arXiv:2503.10439  [pdf, other

    cs.CV

    EFC++: Elastic Feature Consolidation with Prototype Re-balancing for Cold Start Exemplar-free Incremental Learning

    Authors: Simone Magistri, Tomaso Trinci, Albin Soutif-Cormerais, Joost van de Weijer, Andrew D. Bagdanov

    Abstract: Exemplar-Free Class Incremental Learning (EFCIL) aims to learn from a sequence of tasks without having access to previous task data. In this paper, we consider the challenging Cold Start scenario in which insufficient data is available in the first task to learn a high-quality backbone. This is especially challenging for EFCIL since it requires high plasticity, resulting in feature drift which is… ▽ More

    Submitted 15 March, 2025; v1 submitted 13 March, 2025; originally announced March 2025.

    Comments: Under Review since July 2024. Extension of our previous conference paper https://openreview.net/forum?id=7D9X2cFnt1

  5. arXiv:2503.09864  [pdf, other

    cs.GR cs.CV cs.LG

    Leveraging Semantic Attribute Binding for Free-Lunch Color Control in Diffusion Models

    Authors: Héctor Laria, Alexandra Gomez-Villa, Jiang Qin, Muhammad Atif Butt, Bogdan Raducanu, Javier Vazquez-Corral, Joost van de Weijer, Kai Wang

    Abstract: Recent advances in text-to-image (T2I) diffusion models have enabled remarkable control over various attributes, yet precise color specification remains a fundamental challenge. Existing approaches, such as ColorPeel, rely on model personalization, requiring additional optimization and limiting flexibility in specifying arbitrary colors. In this work, we introduce ColorWave, a novel training-free… ▽ More

    Submitted 12 March, 2025; originally announced March 2025.

    Comments: Project page: https://hecoding.github.io/colorwave-page

  6. arXiv:2502.09140  [pdf, ps, other

    cs.LG cs.CV

    Replay-free Online Continual Learning with Self-Supervised MultiPatches

    Authors: Giacomo Cignoni, Andrea Cossu, Alex Gomez-Villa, Joost van de Weijer, Antonio Carta

    Abstract: Online Continual Learning (OCL) methods train a model on a non-stationary data stream where only a few examples are available at a time, often leveraging replay strategies. However, usage of replay is sometimes forbidden, especially in applications with strict privacy regulations. Therefore, we propose Continual MultiPatches (CMP), an effective plug-in for existing OCL self-supervised learning str… ▽ More

    Submitted 13 February, 2025; originally announced February 2025.

    Comments: Accepted at ESANN 2025

    ACM Class: I.2.6

  7. arXiv:2502.04959  [pdf, ps, other

    cs.LG

    No Task Left Behind: Isotropic Model Merging with Common and Task-Specific Subspaces

    Authors: Daniel Marczak, Simone Magistri, Sebastian Cygert, Bartłomiej Twardowski, Andrew D. Bagdanov, Joost van de Weijer

    Abstract: Model merging integrates the weights of multiple task-specific models into a single multi-task model. Despite recent interest in the problem, a significant performance gap between the combined and single-task models remains. In this paper, we investigate the key characteristics of task matrices -- weight update matrices applied to a pre-trained model -- that enable effective merging. We show that… ▽ More

    Submitted 11 June, 2025; v1 submitted 7 February, 2025; originally announced February 2025.

    Comments: Accepted at ICML 2025

  8. arXiv:2502.04469  [pdf, other

    cs.CV cs.AI

    No Images, No Problem: Retaining Knowledge in Continual VQA with Questions-Only Memory

    Authors: Imad Eddine Marouf, Enzo Tartaglione, Stephane Lathuiliere, Joost van de Weijer

    Abstract: Continual Learning in Visual Question Answering (VQACL) requires models to learn new visual-linguistic tasks (plasticity) while retaining knowledge from previous tasks (stability). The multimodal nature of VQACL presents unique challenges, requiring models to balance stability across visual and textual domains while maintaining plasticity to adapt to novel objects and reasoning tasks. Existing met… ▽ More

    Submitted 6 February, 2025; originally announced February 2025.

    Comments: 8 pages, in-review

  9. arXiv:2502.02215  [pdf, other

    cs.CV

    InterLCM: Low-Quality Images as Intermediate States of Latent Consistency Models for Effective Blind Face Restoration

    Authors: Senmao Li, Kai Wang, Joost van de Weijer, Fahad Shahbaz Khan, Chun-Le Guo, Shiqi Yang, Yaxing Wang, Jian Yang, Ming-Ming Cheng

    Abstract: Diffusion priors have been used for blind face restoration (BFR) by fine-tuning diffusion models (DMs) on restoration datasets to recover low-quality images. However, the naive application of DMs presents several key limitations. (i) The diffusion prior has inferior semantic consistency (e.g., ID, structure and color.), increasing the difficulty of optimizing the BFR model; (ii) reliance on hundre… ▽ More

    Submitted 21 March, 2025; v1 submitted 4 February, 2025; originally announced February 2025.

    Comments: Accepted at ICLR2025

  10. arXiv:2501.13554  [pdf, other

    cs.CV cs.AI cs.LG

    One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt

    Authors: Tao Liu, Kai Wang, Senmao Li, Joost van de Weijer, Fahad Shahbaz Khan, Shiqi Yang, Yaxing Wang, Jian Yang, Ming-Ming Cheng

    Abstract: Text-to-image generation models can create high-quality images from input prompts. However, they struggle to support the consistent generation of identity-preserving requirements for storytelling. Existing approaches to this problem typically require extensive training in large datasets or additional modifications to the original model architectures. This limits their applicability across differen… ▽ More

    Submitted 5 February, 2025; v1 submitted 23 January, 2025; originally announced January 2025.

    Comments: 28 pages, 22 figures, ICLR2025 conference

  11. arXiv:2412.14326  [pdf, other

    cs.LG cs.CV

    Covariances for Free: Exploiting Mean Distributions for Federated Learning with Pre-Trained Models

    Authors: Dipam Goswami, Simone Magistri, Kai Wang, Bartłomiej Twardowski, Andrew D. Bagdanov, Joost van de Weijer

    Abstract: Using pre-trained models has been found to reduce the effect of data heterogeneity and speed up federated learning algorithms. Recent works have investigated the use of first-order statistics and second-order statistics to aggregate local client data distributions at the server and achieve very high performance without any training. In this work we propose a training-free method based on an unbias… ▽ More

    Submitted 4 February, 2025; v1 submitted 18 December, 2024; originally announced December 2024.

  12. arXiv:2412.10122  [pdf, other

    cs.CV

    The Art of Deception: Color Visual Illusions and Diffusion Models

    Authors: Alex Gomez-Villa, Kai Wang, Alejandro C. Parraga, Bartlomiej Twardowski, Jesus Malo, Javier Vazquez-Corral, Joost van de Weijer

    Abstract: Visual illusions in humans arise when interpreting out-of-distribution stimuli: if the observer is adapted to certain statistics, perception of outliers deviates from reality. Recent studies have shown that artificial neural networks (ANNs) can also be deceived by visual illusions. This revelation raises profound questions about the nature of visual information. Why are two independent systems, bo… ▽ More

    Submitted 13 December, 2024; originally announced December 2024.

  13. arXiv:2411.07132  [pdf, other

    cs.CV cs.AI

    Token Merging for Training-Free Semantic Binding in Text-to-Image Synthesis

    Authors: Taihang Hu, Linxuan Li, Joost van de Weijer, Hongcheng Gao, Fahad Shahbaz Khan, Jian Yang, Ming-Ming Cheng, Kai Wang, Yaxing Wang

    Abstract: Although text-to-image (T2I) models exhibit remarkable generation capabilities, they frequently fail to accurately bind semantically related objects or attributes in the input prompts; a challenge termed semantic binding. Previous approaches either involve intensive fine-tuning of the entire T2I model or require users or large language models to specify generation layouts, adding complexity. In th… ▽ More

    Submitted 11 November, 2024; originally announced November 2024.

    Comments: Accepted by Neurips2024

  14. arXiv:2410.22317  [pdf, other

    cs.CV

    Multi-Class Textual-Inversion Secretly Yields a Semantic-Agnostic Classifier

    Authors: Kai Wang, Fei Yang, Bogdan Raducanu, Joost van de Weijer

    Abstract: With the advent of large pre-trained vision-language models such as CLIP, prompt learning methods aim to enhance the transferability of the CLIP model. They learn the prompt given few samples from the downstream task given the specific class names as prior knowledge, which we term as semantic-aware classification. However, in many realistic scenarios, we only have access to few samples and knowled… ▽ More

    Submitted 29 October, 2024; originally announced October 2024.

    Comments: Accepted in WACV 2025. Code link: https://github.com/wangkai930418/mc_ti

  15. arXiv:2410.14159  [pdf, other

    cs.CV cs.GR cs.LG

    Assessing Open-world Forgetting in Generative Image Model Customization

    Authors: Héctor Laria, Alex Gomez-Villa, Kai Wang, Bogdan Raducanu, Joost van de Weijer

    Abstract: Recent advances in diffusion models have significantly enhanced image generation capabilities. However, customizing these models with new classes often leads to unintended consequences that compromise their reliability. We introduce the concept of open-world forgetting to characterize the vast scope of these unintended alterations. Our work presents the first systematic investigation into open-wor… ▽ More

    Submitted 5 February, 2025; v1 submitted 17 October, 2024; originally announced October 2024.

    Comments: Update: Added feedback; Project page: https://hecoding.github.io/open-world-forgetting/

  16. arXiv:2408.01076  [pdf, ps, other

    cs.CV

    Exploiting the Semantic Knowledge of Pre-trained Text-Encoders for Continual Learning

    Authors: Lu Yu, Zhe Tao, Dipam Goswami, Hantao Yao, Bartłomiej Twardowski, Joost Van de Weijer, Changsheng Xu

    Abstract: Deep neural networks (DNNs) excel on fixed datasets but struggle with incremental and shifting data in real-world scenarios. Continual learning addresses this challenge by allowing models to learn from new data while retaining previously learned knowledge. Existing methods mainly rely on visual features, often neglecting the rich semantic information encoded in text. The semantic knowledge availab… ▽ More

    Submitted 9 June, 2025; v1 submitted 2 August, 2024; originally announced August 2024.

  17. arXiv:2407.08536  [pdf, other

    cs.CV

    Exemplar-free Continual Representation Learning via Learnable Drift Compensation

    Authors: Alex Gomez-Villa, Dipam Goswami, Kai Wang, Andrew D. Bagdanov, Bartlomiej Twardowski, Joost van de Weijer

    Abstract: Exemplar-free class-incremental learning using a backbone trained from scratch and starting from a small first task presents a significant challenge for continual representation learning. Prototype-based approaches, when continually updated, face the critical issue of semantic drift due to which the old class prototypes drift to different positions in the new feature space. Through an analysis of… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: Accepted to ECCV 2024

  18. arXiv:2407.07197  [pdf, other

    cs.CV

    ColorPeel: Color Prompt Learning with Diffusion Models via Color and Shape Disentanglement

    Authors: Muhammad Atif Butt, Kai Wang, Javier Vazquez-Corral, Joost van de Weijer

    Abstract: Text-to-Image (T2I) generation has made significant advancements with the advent of diffusion models. These models exhibit remarkable abilities to produce images based on textual prompts. Current T2I models allow users to specify object colors using linguistic color names. However, these labels encompass broad color ranges, making it difficult to achieve precise color matching. To tackle this chal… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: Accepted in ECCV 2024

  19. arXiv:2406.05114  [pdf, other

    cs.LG cs.CV

    The Expanding Scope of the Stability Gap: Unveiling its Presence in Joint Incremental Learning of Homogeneous Tasks

    Authors: Sandesh Kamath, Albin Soutif-Cormerais, Joost van de Weijer, Bogdan Raducanu

    Abstract: Recent research identified a temporary performance drop on previously learned tasks when transitioning to a new one. This drop is called the stability gap and has great consequences for continual learning: it complicates the direct employment of continually learning since the worse-case performance at task-boundaries is dramatic, it limits its potential as an energy-efficient training paradigm, an… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: Accepted at CVPR 2024 Workshop on Continual Learning in Computer Vision (CLVision)

  20. arXiv:2405.19074  [pdf, other

    cs.CV cs.AI

    Resurrecting Old Classes with New Data for Exemplar-Free Continual Learning

    Authors: Dipam Goswami, Albin Soutif--Cormerais, Yuyang Liu, Sandesh Kamath, Bartłomiej Twardowski, Joost van de Weijer

    Abstract: Continual learning methods are known to suffer from catastrophic forgetting, a phenomenon that is particularly hard to counter for methods that do not store exemplars of previous tasks. Therefore, to reduce potential drift in the feature extractor, existing exemplar-free methods are typically evaluated in settings where the first task is significantly larger than subsequent tasks. Their performanc… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: Accepted at CVPR 2024

  21. arXiv:2405.18069  [pdf, other

    cs.LG

    An Empirical Analysis of Forgetting in Pre-trained Models with Incremental Low-Rank Updates

    Authors: Albin Soutif--Cormerais, Simone Magistri, Joost van de Weijer, Andew D. Bagdanov

    Abstract: Broad, open source availability of large pretrained foundation models on the internet through platforms such as HuggingFace has taken the world of practical deep learning by storm. A classical pipeline for neural network training now typically consists of finetuning these pretrained network on a small target dataset instead of training from scratch. In the case of large models this can be done eve… ▽ More

    Submitted 19 May, 2025; v1 submitted 28 May, 2024; originally announced May 2024.

    Comments: CoLLAs 2024 accepted paper, PMLR 274:996-1012

  22. arXiv:2405.01496  [pdf, other

    cs.CV

    LocInv: Localization-aware Inversion for Text-Guided Image Editing

    Authors: Chuanming Tang, Kai Wang, Fei Yang, Joost van de Weijer

    Abstract: Large-scale Text-to-Image (T2I) diffusion models demonstrate significant generation capabilities based on textual prompts. Based on the T2I diffusion models, text-guided image editing research aims to empower users to manipulate generated images by altering the text prompts. However, existing image editing techniques are prone to editing over unintentional regions that are beyond the intended targ… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: Accepted by CVPR 2024 Workshop AI4CC

  23. arXiv:2404.06622  [pdf, other

    cs.CV

    Calibrating Higher-Order Statistics for Few-Shot Class-Incremental Learning with Pre-trained Vision Transformers

    Authors: Dipam Goswami, Bartłomiej Twardowski, Joost van de Weijer

    Abstract: Few-shot class-incremental learning (FSCIL) aims to adapt the model to new classes from very few data (5 samples) without forgetting the previously learned classes. Recent works in many-shot CIL (MSCIL) (using all available training data) exploited pre-trained models to reduce forgetting and achieve better plasticity. In a similar fashion, we use ViT models pre-trained on large-scale datasets for… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: Accepted at CLVision workshop (CVPR 2024)

  24. arXiv:2403.07404  [pdf, ps, other

    cs.LG cs.AI

    Improving Continual Learning Performance and Efficiency with Auxiliary Classifiers

    Authors: Filip Szatkowski, Yaoyue Zheng, Fei Yang, Bartłomiej Twardowski, Tomasz Trzciński, Joost van de Weijer

    Abstract: Continual learning is crucial for applying machine learning in challenging, dynamic, and often resource-constrained environments. However, catastrophic forgetting - overwriting previously learned knowledge when new information is acquired - remains a major challenge. In this work, we examine the intermediate representations in neural network layers during continual learning and find that such repr… ▽ More

    Submitted 29 May, 2025; v1 submitted 12 March, 2024; originally announced March 2024.

    Comments: ICML 2025 (main track poster)

  25. arXiv:2402.05375  [pdf, other

    cs.CV

    Get What You Want, Not What You Don't: Image Content Suppression for Text-to-Image Diffusion Models

    Authors: Senmao Li, Joost van de Weijer, Taihang Hu, Fahad Shahbaz Khan, Qibin Hou, Yaxing Wang, Jian Yang

    Abstract: The success of recent text-to-image diffusion models is largely due to their capacity to be guided by a complex text prompt, which enables users to precisely describe the desired content. However, these models struggle to effectively suppress the generation of undesired content, which is explicitly requested to be omitted from the generated image in the prompt. In this paper, we analyze how to man… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Comments: ICLR 2024. Our code is available in https://github.com/sen-mao/SuppressEOT

  26. arXiv:2402.03917  [pdf, other

    cs.CV cs.LG

    Elastic Feature Consolidation for Cold Start Exemplar-Free Incremental Learning

    Authors: Simone Magistri, Tomaso Trinci, Albin Soutif-Cormerais, Joost van de Weijer, Andrew D. Bagdanov

    Abstract: Exemplar-Free Class Incremental Learning (EFCIL) aims to learn from a sequence of tasks without having access to previous task data. In this paper, we consider the challenging Cold Start scenario in which insufficient data is available in the first task to learn a high-quality backbone. This is especially challenging for EFCIL since it requires high plasticity, which results in feature drift which… ▽ More

    Submitted 30 May, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

    Comments: Accepted at Twelfth International Conference on Learning Representations (ICLR 2024)

  27. arXiv:2312.09608  [pdf, other

    cs.CV

    Faster Diffusion: Rethinking the Role of the Encoder for Diffusion Model Inference

    Authors: Senmao Li, Taihang Hu, Joost van de Weijer, Fahad Shahbaz Khan, Tao Liu, Linxuan Li, Shiqi Yang, Yaxing Wang, Ming-Ming Cheng, Jian Yang

    Abstract: One of the main drawback of diffusion models is the slow inference time for image generation. Among the most successful approaches to addressing this problem are distillation methods. However, these methods require considerable computational resources. In this paper, we take another approach to diffusion model acceleration. We conduct a comprehensive study of the UNet encoder and empirically analy… ▽ More

    Submitted 15 October, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

    Comments: NeurIPS 2024

  28. arXiv:2311.15908  [pdf, other

    cs.CV

    Enhancing Perceptual Quality in Video Super-Resolution through Temporally-Consistent Detail Synthesis using Diffusion Models

    Authors: Claudio Rota, Marco Buzzelli, Joost van de Weijer

    Abstract: In this paper, we address the problem of enhancing perceptual quality in video super-resolution (VSR) using Diffusion Models (DMs) while ensuring temporal consistency among frames. We present StableVSR, a VSR method based on DMs that can significantly enhance the perceptual quality of upscaled videos by synthesizing realistic and temporally-consistent details. We introduce the Temporal Conditionin… ▽ More

    Submitted 16 July, 2024; v1 submitted 27 November, 2023; originally announced November 2023.

    Comments: Accepted to ECCV 2024

  29. arXiv:2311.11908  [pdf, other

    cs.LG cs.AI cs.CV

    Continual Learning: Applications and the Road Forward

    Authors: Eli Verwimp, Rahaf Aljundi, Shai Ben-David, Matthias Bethge, Andrea Cossu, Alexander Gepperth, Tyler L. Hayes, Eyke Hüllermeier, Christopher Kanan, Dhireesha Kudithipudi, Christoph H. Lampert, Martin Mundt, Razvan Pascanu, Adrian Popescu, Andreas S. Tolias, Joost van de Weijer, Bing Liu, Vincenzo Lomonaco, Tinne Tuytelaars, Gido M. van de Ven

    Abstract: Continual learning is a subfield of machine learning, which aims to allow machine learning models to continuously learn on new data, by accumulating knowledge without forgetting what was learned in the past. In this work, we take a step back, and ask: "Why should one care about continual learning in the first place?". We set the stage by examining recent continual learning papers published at four… ▽ More

    Submitted 28 March, 2024; v1 submitted 20 November, 2023; originally announced November 2023.

    Journal ref: Transactions on Machine Learning Research (TMLR), 2024

  30. AViTMP: A Tracking-Specific Transformer for Single-Branch Visual Tracking

    Authors: Chuanming Tang, Kai Wang, Joost van de Weijer, Jianlin Zhang, Yongmei Huang

    Abstract: Visual object tracking is a fundamental component of transportation systems, especially for intelligent driving. Despite achieving state-of-the-art performance in visual tracking, recent single-branch trackers tend to overlook the weak prior assumptions associated with the Vision Transformer (ViT) encoder and inference pipeline in visual tracking. Moreover, the effectiveness of discriminative trac… ▽ More

    Submitted 3 July, 2024; v1 submitted 30 October, 2023; originally announced October 2023.

    Comments: IEEE Transactions on Intelligent Vehicles

  31. arXiv:2310.19540  [pdf, other

    cs.CV cs.GR

    IterInv: Iterative Inversion for Pixel-Level T2I Models

    Authors: Chuanming Tang, Kai Wang, Joost van de Weijer

    Abstract: Large-scale text-to-image diffusion models have been a ground-breaking development in generating convincing images following an input text prompt. The goal of image editing research is to give users control over the generated images by modifying the text prompt. Current image editing techniques predominantly hinge on DDIM inversion as a prevalent practice rooted in Latent Diffusion Models (LDM). H… ▽ More

    Submitted 21 April, 2024; v1 submitted 30 October, 2023; originally announced October 2023.

    Comments: Accepted paper at ICME 2024

  32. arXiv:2310.13533  [pdf, other

    cs.CV cs.AI cs.LG

    Technical Report for ICCV 2023 Visual Continual Learning Challenge: Continuous Test-time Adaptation for Semantic Segmentation

    Authors: Damian Sójka, Yuyang Liu, Dipam Goswami, Sebastian Cygert, Bartłomiej Twardowski, Joost van de Weijer

    Abstract: The goal of the challenge is to develop a test-time adaptation (TTA) method, which could adapt the model to gradually changing domains in video sequences for semantic segmentation task. It is based on a synthetic driving video dataset - SHIFT. The source model is trained on images taken during daytime in clear weather. Domain changes at test-time are mainly caused by varying weather conditions and… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

  33. arXiv:2309.15664  [pdf, other

    cs.CV

    Dynamic Prompt Learning: Addressing Cross-Attention Leakage for Text-Based Image Editing

    Authors: Kai Wang, Fei Yang, Shiqi Yang, Muhammad Atif Butt, Joost van de Weijer

    Abstract: Large-scale text-to-image generative models have been a ground-breaking development in generative AI, with diffusion models showing their astounding ability to synthesize convincing images following an input text prompt. The goal of image editing research is to give users control over the generated images by modifying the text prompt. Current image editing techniques are susceptible to unintended… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

    Comments: Neurips 2023. The code page: https://github.com/wangkai930418/DPL

  34. arXiv:2309.14062  [pdf, other

    cs.CV cs.LG

    FeCAM: Exploiting the Heterogeneity of Class Distributions in Exemplar-Free Continual Learning

    Authors: Dipam Goswami, Yuyang Liu, Bartłomiej Twardowski, Joost van de Weijer

    Abstract: Exemplar-free class-incremental learning (CIL) poses several challenges since it prohibits the rehearsal of data from previous tasks and thus suffers from catastrophic forgetting. Recent approaches to incrementally learning the classifier by freezing the feature extractor after the first task have gained much attention. In this paper, we explore prototypical networks for CIL, which generate new cl… ▽ More

    Submitted 12 January, 2024; v1 submitted 25 September, 2023; originally announced September 2023.

    Comments: Accepted at NeurIPS 2023

  35. arXiv:2309.06086  [pdf, other

    cs.LG cs.CV

    Plasticity-Optimized Complementary Networks for Unsupervised Continual Learning

    Authors: Alex Gomez-Villa, Bartlomiej Twardowski, Kai Wang, Joost van de Weijer

    Abstract: Continuous unsupervised representation learning (CURL) research has greatly benefited from improvements in self-supervised learning (SSL) techniques. As a result, existing CURL methods using SSL can learn high-quality representations without any labels, but with a notable performance drop when learning on a many-tasks data stream. We hypothesize that this is caused by the regularization losses tha… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

    Comments: Accepted at WACV2024

  36. arXiv:2309.02995  [pdf, other

    cs.CV

    Continual Evidential Deep Learning for Out-of-Distribution Detection

    Authors: Eduardo Aguilar, Bogdan Raducanu, Petia Radeva, Joost Van de Weijer

    Abstract: Uncertainty-based deep learning models have attracted a great deal of interest for their ability to provide accurate and reliable predictions. Evidential deep learning stands out achieving remarkable performance in detecting out-of-distribution (OOD) data with a single deterministic neural network. Motivated by this fact, in this paper we propose the integration of an evidential deep learning meth… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

    Comments: Accepted at Visual Continual Learning workshop (ICCV2023)

  37. arXiv:2309.00528  [pdf, other

    cs.CV

    Trust your Good Friends: Source-free Domain Adaptation by Reciprocal Neighborhood Clustering

    Authors: Shiqi Yang, Yaxing Wang, Joost van de Weijer, Luis Herranz, Shangling Jui, Jian Yang

    Abstract: Domain adaptation (DA) aims to alleviate the domain shift between source domain and target domain. Most DA methods require access to the source data, but often that is not possible (e.g. due to data privacy or intellectual property). In this paper, we address the challenging source-free domain adaptation (SFDA) problem, where the source pretrained model is adapted to the target domain in the absen… ▽ More

    Submitted 1 September, 2023; originally announced September 2023.

    Comments: Accepted by IEEE TPAMI, extended version of conference paper arXiv:2110.04202

  38. arXiv:2308.16567  [pdf, other

    cs.CV

    ScrollNet: Dynamic Weight Importance for Continual Learning

    Authors: Fei Yang, Kai Wang, Joost van de Weijer

    Abstract: The principle underlying most existing continual learning (CL) methods is to prioritize stability by penalizing changes in parameters crucial to old tasks, while allowing for plasticity in other parameters. The importance of weights for each task can be determined either explicitly through learning a task-specific mask during training (e.g., parameter isolation-based approaches) or implicitly by i… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

    Comments: Accepted at Visual Continual Learning workshop (ICCV2023)

  39. arXiv:2308.10328  [pdf, other

    cs.LG

    A Comprehensive Empirical Evaluation on Online Continual Learning

    Authors: Albin Soutif--Cormerais, Antonio Carta, Andrea Cossu, Julio Hurtado, Hamed Hemati, Vincenzo Lomonaco, Joost Van de Weijer

    Abstract: Online continual learning aims to get closer to a live learning experience by learning directly on a stream of data with temporally shifting distribution and by storing a minimum amount of data from that stream. In this empirical evaluation, we evaluate various methods from the literature that tackle online continual learning. More specifically, we focus on the class-incremental setting in the con… ▽ More

    Submitted 23 September, 2023; v1 submitted 20 August, 2023; originally announced August 2023.

    Comments: ICCV Visual Continual Learning Workshop 2023 accepted paper

  40. arXiv:2307.12427  [pdf, other

    cs.CV cs.LG

    Augmented Box Replay: Overcoming Foreground Shift for Incremental Object Detection

    Authors: Liu Yuyang, Cong Yang, Goswami Dipam, Liu Xialei, Joost van de Weijer

    Abstract: In incremental learning, replaying stored samples from previous tasks together with current task samples is one of the most efficient approaches to address catastrophic forgetting. However, unlike incremental classification, image replay has not been successfully applied to incremental object detection (IOD). In this paper, we identify the overlooked problem of foreground shift as the main reason… ▽ More

    Submitted 23 July, 2023; originally announced July 2023.

    Journal ref: 2023 International Conference on Computer Vision (ICCV)

  41. arXiv:2306.16817  [pdf, other

    cs.LG cs.AI

    Improving Online Continual Learning Performance and Stability with Temporal Ensembles

    Authors: Albin Soutif--Cormerais, Antonio Carta, Joost Van de Weijer

    Abstract: Neural networks are very effective when trained on large datasets for a large number of iterations. However, when they are trained on non-stationary streams of data and in an online fashion, their performance is reduced (1) by the online setup, which limits the availability of data, (2) due to catastrophic forgetting because of the non-stationary nature of the data. Furthermore, several recent wor… ▽ More

    Submitted 3 July, 2023; v1 submitted 29 June, 2023; originally announced June 2023.

    Comments: CoLLAs 2023 accepted paper

  42. arXiv:2304.05255  [pdf, other

    cs.CV

    Density Map Distillation for Incremental Object Counting

    Authors: Chenshen Wu, Joost van de Weijer

    Abstract: We investigate the problem of incremental learning for object counting, where a method must learn to count a variety of object classes from a sequence of datasets. A naïve approach to incremental object counting would suffer from catastrophic forgetting, where it would suffer from a dramatic performance drop on previous tasks. In this paper, we propose a new exemplar-free functional regularization… ▽ More

    Submitted 11 April, 2023; originally announced April 2023.

    Comments: Accepted at CVPR2023: Workshop on Continual Learning in Computer Vision

  43. arXiv:2303.15888  [pdf, other

    cs.LG cs.AI cs.CV cs.NE

    Projected Latent Distillation for Data-Agnostic Consolidation in Distributed Continual Learning

    Authors: Antonio Carta, Andrea Cossu, Vincenzo Lomonaco, Davide Bacciu, Joost van de Weijer

    Abstract: Distributed learning on the edge often comprises self-centered devices (SCD) which learn local tasks independently and are unwilling to contribute to the performance of other SDCs. How do we achieve forward transfer at zero cost for the single SCDs? We formalize this problem as a Distributed Continual Learning scenario, where SCD adapt to local tasks and a CL model consolidates the knowledge from… ▽ More

    Submitted 28 March, 2023; originally announced March 2023.

  44. StyleDiffusion: Prompt-Embedding Inversion for Text-Based Editing

    Authors: Senmao Li, Joost van de Weijer, Taihang Hu, Fahad Shahbaz Khan, Qibin Hou, Yaxing Wang, Jian Yang, Ming-Ming Cheng

    Abstract: A significant research effort is focused on exploiting the amazing capacities of pretrained diffusion models for the editing of images.They either finetune the model, or invert the image in the latent space of the pretrained model. However, they suffer from two problems: (1) Unsatisfying results for selected regions and unexpected changes in non-selected regions.(2) They require careful text promp… ▽ More

    Submitted 6 December, 2024; v1 submitted 27 March, 2023; originally announced March 2023.

    Comments: Accepted by Computational Visual Meda

  45. arXiv:2303.15012  [pdf, other

    cs.CV

    3D-Aware Multi-Class Image-to-Image Translation with NeRFs

    Authors: Senmao Li, Joost van de Weijer, Yaxing Wang, Fahad Shahbaz Khan, Meiqin Liu, Jian Yang

    Abstract: Recent advances in 3D-aware generative models (3D-aware GANs) combined with Neural Radiance Fields (NeRF) have achieved impressive results. However no prior works investigate 3D-aware GANs for 3D consistent multi-class image-to-image (3D-aware I2I) translation. Naively using 2D-I2I translation methods suffers from unrealistic shape/identity change. To perform 3D-aware multi-class I2I translation,… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

    Comments: Accepted by CVPR2023

  46. arXiv:2303.07811  [pdf, other

    cs.LG cs.AI cs.CV

    ICICLE: Interpretable Class Incremental Continual Learning

    Authors: Dawid Rymarczyk, Joost van de Weijer, Bartosz Zieliński, Bartłomiej Twardowski

    Abstract: Continual learning enables incremental learning of new tasks without forgetting those previously learned, resulting in positive knowledge transfer that can enhance performance on both new and old tasks. However, continual learning poses new challenges for interpretability, as the rationale behind model predictions may change over time, leading to interpretability concept drift. We address this pro… ▽ More

    Submitted 31 July, 2023; v1 submitted 14 March, 2023; originally announced March 2023.

    Comments: Accepted to ICCV 2023

  47. arXiv:2302.00353  [pdf, other

    cs.LG cs.CV

    Towards Label-Efficient Incremental Learning: A Survey

    Authors: Mert Kilickaya, Joost van de Weijer, Yuki M. Asano

    Abstract: The current dominant paradigm when building a machine learning model is to iterate over a dataset over and over until convergence. Such an approach is non-incremental, as it assumes access to all images of all categories at once. However, for many applications, non-incremental learning is unrealistic. To that end, researchers study incremental learning, where a learner is required to adapt to an i… ▽ More

    Submitted 11 February, 2023; v1 submitted 1 February, 2023; originally announced February 2023.

  48. arXiv:2211.12292  [pdf, other

    cs.CV

    Exemplar-free Continual Learning of Vision Transformers via Gated Class-Attention and Cascaded Feature Drift Compensation

    Authors: Marco Cotogni, Fei Yang, Claudio Cusano, Andrew D. Bagdanov, Joost van de Weijer

    Abstract: We propose a new method for exemplar-free class incremental training of ViTs. The main challenge of exemplar-free continual learning is maintaining plasticity of the learner without causing catastrophic forgetting of previously learned tasks. This is often achieved via exemplar replay which can help recalibrate previous task classifiers to the feature drift which occurs when learning new tasks. Ex… ▽ More

    Submitted 27 July, 2023; v1 submitted 22 November, 2022; originally announced November 2022.

  49. arXiv:2210.07207  [pdf, other

    cs.CV

    Attribution-aware Weight Transfer: A Warm-Start Initialization for Class-Incremental Semantic Segmentation

    Authors: Dipam Goswami, René Schuster, Joost van de Weijer, Didier Stricker

    Abstract: In class-incremental semantic segmentation (CISS), deep learning architectures suffer from the critical problems of catastrophic forgetting and semantic background shift. Although recent works focused on these issues, existing classifier initialization methods do not address the background shift problem and assign the same initialization weights to both background and new foreground class classifi… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

    Comments: Accepted at WACV 2023

  50. arXiv:2210.01600  [pdf, other

    cs.CV

    Positive Pair Distillation Considered Harmful: Continual Meta Metric Learning for Lifelong Object Re-Identification

    Authors: Kai Wang, Chenshen Wu, Andy Bagdanov, Xialei Liu, Shiqi Yang, Shangling Jui, Joost van de Weijer

    Abstract: Lifelong object re-identification incrementally learns from a stream of re-identification tasks. The objective is to learn a representation that can be applied to all tasks and that generalizes to previously unseen re-identification tasks. The main challenge is that at inference time the representation must generalize to previously unseen identities. To address this problem, we apply continual met… ▽ More

    Submitted 4 October, 2022; originally announced October 2022.

    Comments: BMVC 2022