Skip to main content

Showing 1–9 of 9 results for author: van de Ven, G M

Searching in archive stat. Search in all archives.
.
  1. arXiv:2502.11756  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    On the Computation of the Fisher Information in Continual Learning

    Authors: Gido M. van de Ven

    Abstract: One of the most popular methods for continual learning with deep neural networks is Elastic Weight Consolidation (EWC), which involves computing the Fisher Information. The exact way in which the Fisher Information is computed is however rarely described, and multiple different implementations for it can be found online. This blog post discusses and empirically compares several often-used implemen… ▽ More

    Submitted 17 February, 2025; originally announced February 2025.

    Comments: To appear in the blogpost track at ICLR 2025

  2. arXiv:2403.05175  [pdf, other

    cs.LG cs.AI cs.CV q-bio.NC stat.ML

    Continual Learning and Catastrophic Forgetting

    Authors: Gido M. van de Ven, Nicholas Soures, Dhireesha Kudithipudi

    Abstract: This book chapter delves into the dynamics of continual learning, which is the process of incrementally learning from a non-stationary stream of data. Although continual learning is a natural skill for the human brain, it is very challenging for artificial neural networks. An important reason is that, when learning something new, these networks tend to quickly and drastically forget what they had… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: Preprint of a book chapter; 21 pages, 4 figures

  3. arXiv:2311.04898  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Two Complementary Perspectives to Continual Learning: Ask Not Only What to Optimize, But Also How

    Authors: Timm Hess, Tinne Tuytelaars, Gido M. van de Ven

    Abstract: Recent years have seen considerable progress in the continual training of deep neural networks, predominantly thanks to approaches that add replay or regularization terms to the loss function to approximate the joint loss over all tasks so far. However, we show that even with a perfect approximation to the joint loss, these approaches still suffer from temporary but substantial forgetting when sta… ▽ More

    Submitted 21 June, 2024; v1 submitted 8 November, 2023; originally announced November 2023.

    Comments: Full paper version of pre-registered report accepted at the 1st ContinualAI Unconference. The originally submitted pre-registered proposal can be found at arXiv:2311.04898v1

    Journal ref: Proceedings of the 1st ContinualAI Unconference, 2023, PMLR 249: 37-61

  4. arXiv:2305.18806  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Prediction Error-based Classification for Class-Incremental Learning

    Authors: Michał Zając, Tinne Tuytelaars, Gido M. van de Ven

    Abstract: Class-incremental learning (CIL) is a particularly challenging variant of continual learning, where the goal is to learn to discriminate between all classes presented in an incremental fashion. Existing approaches often suffer from excessive forgetting and imbalance of the scores assigned to classes that have not been seen together during training. In this study, we introduce a novel approach, Pre… ▽ More

    Submitted 9 March, 2024; v1 submitted 30 May, 2023; originally announced May 2023.

    Comments: ICLR 2024 camera ready

  5. arXiv:2104.10093  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Class-Incremental Learning with Generative Classifiers

    Authors: Gido M. van de Ven, Zhe Li, Andreas S. Tolias

    Abstract: Incrementally training deep neural networks to recognize new classes is a challenging problem. Most existing class-incremental learning methods store data or use generative replay, both of which have drawbacks, while 'rehearsal-free' alternatives such as parameter regularization or bias-correction methods do not consistently achieve high performance. Here, we put forward a new strategy for class-i… ▽ More

    Submitted 28 April, 2021; v1 submitted 20 April, 2021; originally announced April 2021.

    Comments: To appear in the IEEE Conference on Computer Vision and Pattern Recognition Workshop (CVPR-W) on Continual Learning in Computer Vision (CLVision) 2021

    Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2021, pp. 3611-3620

  6. arXiv:2011.12216  [pdf, other

    cs.LG cs.AI stat.ML

    Energy-Based Models for Continual Learning

    Authors: Shuang Li, Yilun Du, Gido M. van de Ven, Igor Mordatch

    Abstract: We motivate Energy-Based Models (EBMs) as a promising model class for continual learning problems. Instead of tackling continual learning via the use of external memory, growing models, or regularization, EBMs change the underlying training objective to cause less interference with previously learned information. Our proposed version of EBMs for continual learning is simple, efficient, and outperf… ▽ More

    Submitted 18 December, 2022; v1 submitted 24 November, 2020; originally announced November 2020.

    Comments: Project page: https://energy-based-model.github.io/Energy-Based-Models-for-Continual-Learning

    Journal ref: Proceedings of The 1st Conference on Lifelong Learning Agents, PMLR 199: 1-22, 2022

  7. arXiv:2004.12908  [pdf, other

    cs.AI cs.LG stat.ML

    Simple Lifelong Learning Machines

    Authors: Jayanta Dey, Joshua T. Vogelstein, Hayden S. Helm, Will LeVine, Ronak D. Mehta, Tyler M. Tomita, Haoyin Xu, Ali Geisa, Qingyang Wang, Gido M. van de Ven, Chenyu Gao, Bryan Tower, Jonathan Larson, Christopher M. White, Carey E. Priebe

    Abstract: In lifelong learning, data are used to improve performance not only on the present task, but also on past and future (unencountered) tasks. While typical transfer learning algorithms can improve performance on future tasks, their performance on prior tasks degrades upon learning new tasks (called forgetting). Many recent approaches for continual or lifelong learning have attempted to maintain perf… ▽ More

    Submitted 20 April, 2025; v1 submitted 27 April, 2020; originally announced April 2020.

  8. arXiv:1904.07734  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Three scenarios for continual learning

    Authors: Gido M. van de Ven, Andreas S. Tolias

    Abstract: Standard artificial neural networks suffer from the well-known issue of catastrophic forgetting, making continual or lifelong learning difficult for machine learning. In recent years, numerous methods have been proposed for continual learning, but due to differences in evaluation protocols it is difficult to directly compare their performance. To enable more structured comparisons, we describe thr… ▽ More

    Submitted 15 April, 2019; originally announced April 2019.

    Comments: Extended version of work presented at the NeurIPS Continual Learning workshop (2018); 18 pages, 5 figures, 6 tables. Related to arXiv:1809.10635

  9. arXiv:1809.10635  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Generative replay with feedback connections as a general strategy for continual learning

    Authors: Gido M. van de Ven, Andreas S. Tolias

    Abstract: A major obstacle to developing artificial intelligence applications capable of true lifelong learning is that artificial neural networks quickly or catastrophically forget previously learned tasks when trained on a new one. Numerous methods for alleviating catastrophic forgetting are currently being proposed, but differences in evaluation protocols make it difficult to directly compare their perfo… ▽ More

    Submitted 17 April, 2019; v1 submitted 27 September, 2018; originally announced September 2018.

    Comments: 17 pages, 8 figures, 4 tables