Teach Me to Trick: Exploring Adversarial Transferability via Knowledge Distillation

Pradhan, Siddhartha; Shiwakoti, Shikshya; Bathuri, Neha

Computer Science > Machine Learning

arXiv:2507.21992 (cs)

[Submitted on 29 Jul 2025]

Title:Teach Me to Trick: Exploring Adversarial Transferability via Knowledge Distillation

Authors:Siddhartha Pradhan, Shikshya Shiwakoti, Neha Bathuri

View PDF HTML (experimental)

Abstract:We investigate whether knowledge distillation (KD) from multiple heterogeneous teacher models can enhance the generation of transferable adversarial examples. A lightweight student model is trained using two KD strategies: curriculum-based switching and joint optimization, with ResNet50 and DenseNet-161 as teachers. The trained student is then used to generate adversarial examples using FG, FGS, and PGD attacks, which are evaluated against a black-box target model (GoogLeNet). Our results show that student models distilled from multiple teachers achieve attack success rates comparable to ensemble-based baselines, while reducing adversarial example generation time by up to a factor of six. An ablation study further reveals that lower temperature settings and the inclusion of hard-label supervision significantly enhance transferability. These findings suggest that KD can serve not only as a model compression technique but also as a powerful tool for improving the efficiency and effectiveness of black-box adversarial attacks.

Comments:	10 pages, 4 figures
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2507.21992 [cs.LG]
	(or arXiv:2507.21992v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2507.21992

Submission history

From: Shikshya Shiwakoti [view email]
[v1] Tue, 29 Jul 2025 16:43:54 UTC (649 KB)

Computer Science > Machine Learning

Title:Teach Me to Trick: Exploring Adversarial Transferability via Knowledge Distillation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Teach Me to Trick: Exploring Adversarial Transferability via Knowledge Distillation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators