Uncovering Gradient Inversion Risks in Practical Language Model Training

Feng, Xinguo; Ma, Zhongkui; Wang, Zihan; Chegne, Eu Joe; Ma, Mengyao; Abuadbba, Alsharif; Bai, Guangdong

doi:10.1145/3658644.3690292

Computer Science > Machine Learning

arXiv:2507.21198 (cs)

[Submitted on 28 Jul 2025]

Title:Uncovering Gradient Inversion Risks in Practical Language Model Training

Authors:Xinguo Feng, Zhongkui Ma, Zihan Wang, Eu Joe Chegne, Mengyao Ma, Alsharif Abuadbba, Guangdong Bai

View PDF

Abstract:The gradient inversion attack has been demonstrated as a significant privacy threat to federated learning (FL), particularly in continuous domains such as vision models. In contrast, it is often considered less effective or highly dependent on impractical training settings when applied to language models, due to the challenges posed by the discrete nature of tokens in text data. As a result, its potential privacy threats remain largely underestimated, despite FL being an emerging training method for language models. In this work, we propose a domain-specific gradient inversion attack named Grab (gradient inversion with hybrid optimization). Grab features two alternating optimization processes to address the challenges caused by practical training settings, including a simultaneous optimization on dropout masks between layers for improved token recovery and a discrete optimization for effective token sequencing. Grab can recover a significant portion (up to 92.9% recovery rate) of the private training data, outperforming the attack strategy of utilizing discrete optimization with an auxiliary model by notable improvements of up to 28.9% recovery rate in benchmark settings and 48.5% recovery rate in practical settings. Grab provides a valuable step forward in understanding this privacy threat in the emerging FL training mode of language models.

Comments:	15 Pages, 5 figures, 10 tables. Accepted by ACM CCS 2024
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2507.21198 [cs.LG]
	(or arXiv:2507.21198v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2507.21198
Journal reference:	ACM CCM 2024
Related DOI:	https://doi.org/10.1145/3658644.3690292

Submission history

From: Xinguo Feng [view email]
[v1] Mon, 28 Jul 2025 06:06:29 UTC (2,446 KB)

Computer Science > Machine Learning

Title:Uncovering Gradient Inversion Risks in Practical Language Model Training

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Uncovering Gradient Inversion Risks in Practical Language Model Training

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators