A Hassle-free Algorithm for Private Learning in Practice: Don't Use Tree Aggregation, Use BLTs

McMahan, H. Brendan; Xu, Zheng; Zhang, Yanxiang

Computer Science > Machine Learning

arXiv:2408.08868 (cs)

[Submitted on 16 Aug 2024 (v1), last revised 29 May 2025 (this version, v3)]

Title:A Hassle-free Algorithm for Private Learning in Practice: Don't Use Tree Aggregation, Use BLTs

Authors:H. Brendan McMahan, Zheng Xu, Yanxiang Zhang

View PDF HTML (experimental)

Abstract:The state-of-the-art for training on-device language models for mobile keyboard applications combines federated learning (FL) with differential privacy (DP) via the DP-Follow-the-Regularized-Leader (DP-FTRL) algorithm. Two variants of DP-FTRL are used in practice, tree aggregation and matrix factorization. However, tree aggregation suffers from significantly suboptimal privacy/utility tradeoffs, while matrix mechanisms require expensive optimization parameterized by hard-to-estimate-in-advance constants, and high runtime memory costs. This paper extends the recently introduced Buffered Linear Toeplitz (BLT) mechanism to multi-participation scenarios. Our BLT-DP-FTRL maintains the ease-of-use advantages of tree aggregation, while essentially matching matrix factorization in terms of utility and privacy. We evaluate BLT-DP-FTRL on the StackOverflow dataset, serving as a re-producible simulation benchmark, and across four on-device language model tasks in a production FL system. Our empirical results highlight the advantages of the BLT mechanism and elevate the practicality and effectiveness of DP in real-world scenarios.

Comments:	v2: EMNLP camera ready, minor error and typo fix, updated production model launch info; v3: update production model launch info again to reflect the success of "upgrade all existing FL LMs that have previously been launched without DP to be trained with DP" in arXiv:2306.14793
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2408.08868 [cs.LG]
	(or arXiv:2408.08868v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2408.08868

Submission history

From: Zheng Xu [view email]
[v1] Fri, 16 Aug 2024 17:52:22 UTC (468 KB)
[v2] Fri, 3 Jan 2025 18:48:38 UTC (488 KB)
[v3] Thu, 29 May 2025 20:44:31 UTC (418 KB)

Computer Science > Machine Learning

Title:A Hassle-free Algorithm for Private Learning in Practice: Don't Use Tree Aggregation, Use BLTs

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:A Hassle-free Algorithm for Private Learning in Practice: Don't Use Tree Aggregation, Use BLTs

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators