Nonstationary data stream classification with online active learning and siamese neural networks

Malialis, Kleanthis; Panayiotou, Christos G.; Polycarpou, Marios M.

doi:10.1016/j.neucom.2022.09.065

Computer Science > Machine Learning

arXiv:2210.01090 (cs)

[Submitted on 3 Oct 2022]

Title:Nonstationary data stream classification with online active learning and siamese neural networks

Authors:Kleanthis Malialis, Christos G. Panayiotou, Marios M. Polycarpou

View PDF

Abstract:We have witnessed in recent years an ever-growing volume of information becoming available in a streaming manner in various application areas. As a result, there is an emerging need for online learning methods that train predictive models on-the-fly. A series of open challenges, however, hinder their deployment in practice. These are, learning as data arrive in real-time one-by-one, learning from data with limited ground truth information, learning from nonstationary data, and learning from severely imbalanced data, while occupying a limited amount of memory for data storage. We propose the ActiSiamese algorithm, which addresses these challenges by combining online active learning, siamese networks, and a multi-queue memory. It develops a new density-based active learning strategy which considers similarity in the latent (rather than the input) space. We conduct an extensive study that compares the role of different active learning budgets and strategies, the performance with/without memory, the performance with/without ensembling, in both synthetic and real-world datasets, under different data nonstationarity characteristics and class imbalance levels. ActiSiamese outperforms baseline and state-of-the-art algorithms, and is effective under severe imbalance, even only when a fraction of the arriving instances' labels is available. We publicly release our code to the community.

Comments:	Keywords: Incremental learning, Active learning, Data streams, Concept drift, Class imbalance
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2210.01090 [cs.LG]
	(or arXiv:2210.01090v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2210.01090
Journal reference:	Neurocomputing, Volume 512, Pages 235-252, 2022
Related DOI:	https://doi.org/10.1016/j.neucom.2022.09.065

Submission history

From: Kleanthis Malialis [view email]
[v1] Mon, 3 Oct 2022 17:16:03 UTC (13,484 KB)

Computer Science > Machine Learning

Title:Nonstationary data stream classification with online active learning and siamese neural networks

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Nonstationary data stream classification with online active learning and siamese neural networks

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators