RADEP: A Resilient Adaptive Defense Framework Against Model Extraction Attacks

Chakraborty, Amit; Ahamed, Sayyed Farid; Roy, Sandip; Banerjee, Soumya; Choi, Kevin; Rahman, Abdul; Hu, Alison; Bowen, Edward; Shetty, Sachin

Abstract:Machine Learning as a Service (MLaaS) enables users to leverage powerful machine learning models through cloud-based APIs, offering scalability and ease of deployment. However, these services are vulnerable to model extraction attacks, where adversaries repeatedly query the application programming interface (API) to reconstruct a functionally similar model, compromising intellectual property and security. Despite various defense strategies being proposed, many suffer from high computational costs, limited adaptability to evolving attack techniques, and a reduction in performance for legitimate users. In this paper, we introduce a Resilient Adaptive Defense Framework for Model Extraction Attack Protection (RADEP), a multifaceted defense framework designed to counteract model extraction attacks through a multi-layered security approach. RADEP employs progressive adversarial training to enhance model resilience against extraction attempts. Malicious query detection is achieved through a combination of uncertainty quantification and behavioral pattern analysis, effectively identifying adversarial queries. Furthermore, we develop an adaptive response mechanism that dynamically modifies query outputs based on their suspicion scores, reducing the utility of stolen models. Finally, ownership verification is enforced through embedded watermarking and backdoor triggers, enabling reliable identification of unauthorized model use. Experimental evaluations demonstrate that RADEP significantly reduces extraction success rates while maintaining high detection accuracy with minimal impact on legitimate queries. Extensive experiments show that RADEP effectively defends against model extraction attacks and remains resilient even against adaptive adversaries, making it a reliable security framework for MLaaS models.

Comments:	Presented at the IEEE International Wireless Communications and Mobile Computing Conference (IWCMC) 2025
Subjects:	Cryptography and Security (cs.CR)
ACM classes:	I.2.6; D.4.6; K.6.5
Cite as:	arXiv:2505.19364 [cs.CR]
	(or arXiv:2505.19364v1 [cs.CR] for this version)
	https://doi.org/10.48550/arXiv.2505.19364

Computer Science > Cryptography and Security

Title:RADEP: A Resilient Adaptive Defense Framework Against Model Extraction Attacks

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators