Search | arXiv e-print repository

EFC++: Elastic Feature Consolidation with Prototype Re-balancing for Cold Start Exemplar-free Incremental Learning

Authors: Simone Magistri, Tomaso Trinci, Albin Soutif-Cormerais, Joost van de Weijer, Andrew D. Bagdanov

Abstract: Exemplar-Free Class Incremental Learning (EFCIL) aims to learn from a sequence of tasks without having access to previous task data. In this paper, we consider the challenging Cold Start scenario in which insufficient data is available in the first task to learn a high-quality backbone. This is especially challenging for EFCIL since it requires high plasticity, resulting in feature drift which is… ▽ More Exemplar-Free Class Incremental Learning (EFCIL) aims to learn from a sequence of tasks without having access to previous task data. In this paper, we consider the challenging Cold Start scenario in which insufficient data is available in the first task to learn a high-quality backbone. This is especially challenging for EFCIL since it requires high plasticity, resulting in feature drift which is difficult to compensate for in the exemplar-free setting. To address this problem, we propose an effective approach to consolidate feature representations by regularizing drift in directions highly relevant to previous tasks and employs prototypes to reduce task-recency bias. Our approach, which we call Elastic Feature Consolidation++ (EFC++) exploits a tractable second-order approximation of feature drift based on a proposed Empirical Feature Matrix (EFM). The EFM induces a pseudo-metric in feature space which we use to regularize feature drift in important directions and to update Gaussian prototypes. In addition, we introduce a post-training prototype re-balancing phase that updates classifiers to compensate for feature drift. Experimental results on CIFAR-100, Tiny-ImageNet, ImageNet-Subset, ImageNet-1K and DomainNet demonstrate that EFC++ is better able to learn new tasks by maintaining model plasticity and significantly outperform the state-of-the-art. △ Less

Submitted 15 March, 2025; v1 submitted 13 March, 2025; originally announced March 2025.

Comments: Under Review since July 2024. Extension of our previous conference paper https://openreview.net/forum?id=7D9X2cFnt1

arXiv:2409.18664 [pdf, other]

How green is continual learning, really? Analyzing the energy consumption in continual training of vision foundation models

Authors: Tomaso Trinci, Simone Magistri, Roberto Verdecchia, Andrew D. Bagdanov

Abstract: With the ever-growing adoption of AI, its impact on the environment is no longer negligible. Despite the potential that continual learning could have towards Green AI, its environmental sustainability remains relatively uncharted. In this work we aim to gain a systematic understanding of the energy efficiency of continual learning algorithms. To that end, we conducted an extensive set of empirical… ▽ More With the ever-growing adoption of AI, its impact on the environment is no longer negligible. Despite the potential that continual learning could have towards Green AI, its environmental sustainability remains relatively uncharted. In this work we aim to gain a systematic understanding of the energy efficiency of continual learning algorithms. To that end, we conducted an extensive set of empirical experiments comparing the energy consumption of recent representation-, prompt-, and exemplar-based continual learning algorithms and two standard baseline (fine tuning and joint training) when used to continually adapt a pre-trained ViT-B/16 foundation model. We performed our experiments on three standard datasets: CIFAR-100, ImageNet-R, and DomainNet. Additionally, we propose a novel metric, the Energy NetScore, which we use measure the algorithm efficiency in terms of energy-accuracy trade-off. Through numerous evaluations varying the number and size of the incremental learning steps, our experiments demonstrate that different types of continual learning algorithms have very different impacts on energy consumption during both training and inference. Although often overlooked in the continual learning literature, we found that the energy consumed during the inference phase is crucial for evaluating the environmental sustainability of continual learning models. △ Less

Submitted 27 September, 2024; originally announced September 2024.

Comments: This manuscript has been accepted at the Green FOundation MOdels (GreenFOMO) ECCV 2024 Workshop

arXiv:2402.03917 [pdf, other]

Elastic Feature Consolidation for Cold Start Exemplar-Free Incremental Learning

Authors: Simone Magistri, Tomaso Trinci, Albin Soutif-Cormerais, Joost van de Weijer, Andrew D. Bagdanov

Abstract: Exemplar-Free Class Incremental Learning (EFCIL) aims to learn from a sequence of tasks without having access to previous task data. In this paper, we consider the challenging Cold Start scenario in which insufficient data is available in the first task to learn a high-quality backbone. This is especially challenging for EFCIL since it requires high plasticity, which results in feature drift which… ▽ More Exemplar-Free Class Incremental Learning (EFCIL) aims to learn from a sequence of tasks without having access to previous task data. In this paper, we consider the challenging Cold Start scenario in which insufficient data is available in the first task to learn a high-quality backbone. This is especially challenging for EFCIL since it requires high plasticity, which results in feature drift which is difficult to compensate for in the exemplar-free setting. To address this problem, we propose a simple and effective approach that consolidates feature representations by regularizing drift in directions highly relevant to previous tasks and employs prototypes to reduce task-recency bias. Our method, called Elastic Feature Consolidation (EFC), exploits a tractable second-order approximation of feature drift based on an Empirical Feature Matrix (EFM). The EFM induces a pseudo-metric in feature space which we use to regularize feature drift in important directions and to update Gaussian prototypes used in a novel asymmetric cross entropy loss which effectively balances prototype rehearsal with data from new tasks. Experimental results on CIFAR-100, Tiny-ImageNet, ImageNet-Subset and ImageNet-1K demonstrate that Elastic Feature Consolidation is better able to learn new tasks by maintaining model plasticity and significantly outperform the state-of-the-art. △ Less

Submitted 30 May, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

Comments: Accepted at Twelfth International Conference on Learning Representations (ICLR 2024)

arXiv:1907.13154 [pdf, ps, other]

doi 10.1090/tran/8346

Zeros of slice functions and polynomials over dual quaternions

Authors: Graziano Gentili, Caterina Stoppato, Tomaso Trinci

Abstract: This work studies the zeros of slice functions over the algebra of dual quaternions and it comprises applications to the problem of factorizing motion polynomials. The class of slice functions over an alternative $*$-algebra $A$ was defined by Ghiloni and Perotti in 2011, extending the class of slice regular functions introduced by Gentili and Struppa in 2006. Both classes strictly include the pol… ▽ More This work studies the zeros of slice functions over the algebra of dual quaternions and it comprises applications to the problem of factorizing motion polynomials. The class of slice functions over an alternative $*$-algebra $A$ was defined by Ghiloni and Perotti in 2011, extending the class of slice regular functions introduced by Gentili and Struppa in 2006. Both classes strictly include the polynomials over $A$. We focus on the case when $A$ is the algebra of dual quaternions $\mathbb{DH}$. The specific properties of this algebra allow a full characterization of the zero sets, which is not available over general alternative $*$-algebras. This characterization sheds some light on the study of motion polynomials over $\mathbb{DH}$, introduced by Hegedüs, Schicho, and Schröcker in 2013 for their relevance in mechanism science. △ Less

Submitted 9 November, 2020; v1 submitted 30 July, 2019; originally announced July 2019.

Comments: 37 pages, to appear in Trans. Amer. Math. Soc

MSC Class: 30G35 (Primary); 70B15 (Secondary)

Journal ref: Trans. Amer. Math. Soc., 374(8):5509--5544 (2021)

Showing 1–4 of 4 results for author: Trinci, T