Twin Auto-Encoder Model for Learning Separable Representation in Cyberattack Detection

Dinh, Phai Vu; Nguyen, Quang Uy; Dinh, Thai Hoang; Nguyen, Diep N.; Pham, Bao Son; Dutkiewicz, Eryk

Computer Science > Cryptography and Security

arXiv:2403.15509 (cs)

[Submitted on 22 Mar 2024 (v1), last revised 28 Apr 2025 (this version, v2)]

Title:Twin Auto-Encoder Model for Learning Separable Representation in Cyberattack Detection

Authors:Phai Vu Dinh, Quang Uy Nguyen, Thai Hoang Dinh, Diep N. Nguyen, Bao Son Pham, Eryk Dutkiewicz

View PDF HTML (experimental)

Abstract:Representation learning (RL) methods for cyberattack detection face the diversity and sophistication of attack data, leading to the issue of mixed representations of different classes, particularly as the number of classes increases. To address this, the paper proposes a novel deep learning architecture/model called the Twin Auto-Encoder (TAE). TAE first maps the input data into latent space and then deterministically shifts data samples of different classes further apart to create separable data representations, referred to as representation targets. TAE's decoder then projects the input data into these representation targets. After training, TAE's decoder extracts data representations. TAE's representation target serves as a novel dynamic codeword, which refers to the vector that represents a specific class. This vector is updated after each training epoch for every data sample, in contrast to the conventional fixed codeword that does not incorporate information from the input data. We conduct extensive experiments on diverse cybersecurity datasets, including seven IoT botnet datasets, two network IDS datasets, three malware datasets, one cloud DDoS dataset, and ten artificial datasets as the number of classes increases. TAE boosts accuracy and F-score in attack detection by around 2% compared to state-of-the-art models, achieving up to 96.1% average accuracy in IoT attack detection. Additionally, TAE is well-suited for cybersecurity applications and potentially for IoT systems, with a model size of approximately 1 MB and an average running time of around 2.6E-07 seconds for extracting a data sample.

Subjects:	Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2403.15509 [cs.CR]
	(or arXiv:2403.15509v2 [cs.CR] for this version)
	https://doi.org/10.48550/arXiv.2403.15509

Submission history

From: Dinh Phai Vu [view email]
[v1] Fri, 22 Mar 2024 03:39:40 UTC (2,617 KB)
[v2] Mon, 28 Apr 2025 22:51:30 UTC (2,713 KB)

Computer Science > Cryptography and Security

Title:Twin Auto-Encoder Model for Learning Separable Representation in Cyberattack Detection

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Cryptography and Security

Title:Twin Auto-Encoder Model for Learning Separable Representation in Cyberattack Detection

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators