-
EFC++: Elastic Feature Consolidation with Prototype Re-balancing for Cold Start Exemplar-free Incremental Learning
Authors:
Simone Magistri,
Tomaso Trinci,
Albin Soutif-Cormerais,
Joost van de Weijer,
Andrew D. Bagdanov
Abstract:
Exemplar-Free Class Incremental Learning (EFCIL) aims to learn from a sequence of tasks without having access to previous task data. In this paper, we consider the challenging Cold Start scenario in which insufficient data is available in the first task to learn a high-quality backbone. This is especially challenging for EFCIL since it requires high plasticity, resulting in feature drift which is…
▽ More
Exemplar-Free Class Incremental Learning (EFCIL) aims to learn from a sequence of tasks without having access to previous task data. In this paper, we consider the challenging Cold Start scenario in which insufficient data is available in the first task to learn a high-quality backbone. This is especially challenging for EFCIL since it requires high plasticity, resulting in feature drift which is difficult to compensate for in the exemplar-free setting. To address this problem, we propose an effective approach to consolidate feature representations by regularizing drift in directions highly relevant to previous tasks and employs prototypes to reduce task-recency bias. Our approach, which we call Elastic Feature Consolidation++ (EFC++) exploits a tractable second-order approximation of feature drift based on a proposed Empirical Feature Matrix (EFM). The EFM induces a pseudo-metric in feature space which we use to regularize feature drift in important directions and to update Gaussian prototypes. In addition, we introduce a post-training prototype re-balancing phase that updates classifiers to compensate for feature drift. Experimental results on CIFAR-100, Tiny-ImageNet, ImageNet-Subset, ImageNet-1K and DomainNet demonstrate that EFC++ is better able to learn new tasks by maintaining model plasticity and significantly outperform the state-of-the-art.
△ Less
Submitted 15 March, 2025; v1 submitted 13 March, 2025;
originally announced March 2025.
-
How green is continual learning, really? Analyzing the energy consumption in continual training of vision foundation models
Authors:
Tomaso Trinci,
Simone Magistri,
Roberto Verdecchia,
Andrew D. Bagdanov
Abstract:
With the ever-growing adoption of AI, its impact on the environment is no longer negligible. Despite the potential that continual learning could have towards Green AI, its environmental sustainability remains relatively uncharted. In this work we aim to gain a systematic understanding of the energy efficiency of continual learning algorithms. To that end, we conducted an extensive set of empirical…
▽ More
With the ever-growing adoption of AI, its impact on the environment is no longer negligible. Despite the potential that continual learning could have towards Green AI, its environmental sustainability remains relatively uncharted. In this work we aim to gain a systematic understanding of the energy efficiency of continual learning algorithms. To that end, we conducted an extensive set of empirical experiments comparing the energy consumption of recent representation-, prompt-, and exemplar-based continual learning algorithms and two standard baseline (fine tuning and joint training) when used to continually adapt a pre-trained ViT-B/16 foundation model. We performed our experiments on three standard datasets: CIFAR-100, ImageNet-R, and DomainNet. Additionally, we propose a novel metric, the Energy NetScore, which we use measure the algorithm efficiency in terms of energy-accuracy trade-off. Through numerous evaluations varying the number and size of the incremental learning steps, our experiments demonstrate that different types of continual learning algorithms have very different impacts on energy consumption during both training and inference. Although often overlooked in the continual learning literature, we found that the energy consumed during the inference phase is crucial for evaluating the environmental sustainability of continual learning models.
△ Less
Submitted 27 September, 2024;
originally announced September 2024.
-
Elastic Feature Consolidation for Cold Start Exemplar-Free Incremental Learning
Authors:
Simone Magistri,
Tomaso Trinci,
Albin Soutif-Cormerais,
Joost van de Weijer,
Andrew D. Bagdanov
Abstract:
Exemplar-Free Class Incremental Learning (EFCIL) aims to learn from a sequence of tasks without having access to previous task data. In this paper, we consider the challenging Cold Start scenario in which insufficient data is available in the first task to learn a high-quality backbone. This is especially challenging for EFCIL since it requires high plasticity, which results in feature drift which…
▽ More
Exemplar-Free Class Incremental Learning (EFCIL) aims to learn from a sequence of tasks without having access to previous task data. In this paper, we consider the challenging Cold Start scenario in which insufficient data is available in the first task to learn a high-quality backbone. This is especially challenging for EFCIL since it requires high plasticity, which results in feature drift which is difficult to compensate for in the exemplar-free setting. To address this problem, we propose a simple and effective approach that consolidates feature representations by regularizing drift in directions highly relevant to previous tasks and employs prototypes to reduce task-recency bias. Our method, called Elastic Feature Consolidation (EFC), exploits a tractable second-order approximation of feature drift based on an Empirical Feature Matrix (EFM). The EFM induces a pseudo-metric in feature space which we use to regularize feature drift in important directions and to update Gaussian prototypes used in a novel asymmetric cross entropy loss which effectively balances prototype rehearsal with data from new tasks. Experimental results on CIFAR-100, Tiny-ImageNet, ImageNet-Subset and ImageNet-1K demonstrate that Elastic Feature Consolidation is better able to learn new tasks by maintaining model plasticity and significantly outperform the state-of-the-art.
△ Less
Submitted 30 May, 2024; v1 submitted 6 February, 2024;
originally announced February 2024.
-
Zeros of slice functions and polynomials over dual quaternions
Authors:
Graziano Gentili,
Caterina Stoppato,
Tomaso Trinci
Abstract:
This work studies the zeros of slice functions over the algebra of dual quaternions and it comprises applications to the problem of factorizing motion polynomials. The class of slice functions over an alternative $*$-algebra $A$ was defined by Ghiloni and Perotti in 2011, extending the class of slice regular functions introduced by Gentili and Struppa in 2006. Both classes strictly include the pol…
▽ More
This work studies the zeros of slice functions over the algebra of dual quaternions and it comprises applications to the problem of factorizing motion polynomials. The class of slice functions over an alternative $*$-algebra $A$ was defined by Ghiloni and Perotti in 2011, extending the class of slice regular functions introduced by Gentili and Struppa in 2006. Both classes strictly include the polynomials over $A$. We focus on the case when $A$ is the algebra of dual quaternions $\mathbb{DH}$. The specific properties of this algebra allow a full characterization of the zero sets, which is not available over general alternative $*$-algebras. This characterization sheds some light on the study of motion polynomials over $\mathbb{DH}$, introduced by Hegedüs, Schicho, and Schröcker in 2013 for their relevance in mechanism science.
△ Less
Submitted 9 November, 2020; v1 submitted 30 July, 2019;
originally announced July 2019.