TraceRAG: A LLM-Based Framework for Explainable Android Malware Detection and Behavior Analysis

Zhang, Guangyu; Wang, Xixuan; Sun, Shiyu; Xiao, Peiyan; Sun, Kun; Xiong, Yanhai

Abstract:Sophisticated evasion tactics in malicious Android applications, combined with their intricate behavioral semantics, enable attackers to conceal malicious logic within legitimate functions, underscoring the critical need for robust and in-depth analysis frameworks. However, traditional analysis techniques often fail to recover deeply hidden behaviors or provide human-readable justifications for their decisions. Inspired by advances in large language models (LLMs), we introduce TraceRAG, a retrieval-augmented generation (RAG) framework that bridges natural language queries and Java code to deliver explainable malware detection and analysis. First, TraceRAG generates summaries of method-level code snippets, which are indexed in a vector database. At query time, behavior-focused questions retrieve the most semantically relevant snippets for deeper inspection. Finally, based on the multi-turn analysis results, TraceRAG produces human-readable reports that present the identified malicious behaviors and their corresponding code implementations. Experimental results demonstrate that our method achieves 96\% malware detection accuracy and 83.81\% behavior identification accuracy based on updated VirusTotal (VT) scans and manual verification. Furthermore, expert evaluation confirms the practical utility of the reports generated by TraceRAG.

Subjects:	Software Engineering (cs.SE)
Cite as:	arXiv:2509.08865 [cs.SE]
	(or arXiv:2509.08865v1 [cs.SE] for this version)
	https://doi.org/10.48550/arXiv.2509.08865

Computer Science > Software Engineering

Title:TraceRAG: A LLM-Based Framework for Explainable Android Malware Detection and Behavior Analysis

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators