Bringing Differential Private SGD to Practice: On the Independence of Gaussian Noise and the Number of Training Rounds

van Dijk, Marten; Nguyen, Nhuong V.; Nguyen, Toan N.; Nguyen, Lam M.; Nguyen, Phuong Ha

Computer Science > Machine Learning

arXiv:2102.09030v6 (cs)

[Submitted on 17 Feb 2021 (v1), revised 13 Jan 2023 (this version, v6), latest version 4 Jun 2024 (v10)]

Title:Bringing Differential Private SGD to Practice: On the Independence of Gaussian Noise and the Number of Training Rounds

Authors:Marten van Dijk, Nhuong V. Nguyen, Toan N. Nguyen, Lam M. Nguyen, Phuong Ha Nguyen

View PDF

Abstract:Different from existing Differential Privacy (DP) accountants, we introduce pro-active DP. Existing DP accountants keep track of how privacy budget has been spent while pro-active DP is a scheme that allows one to {\it a-priori} select parameters of DP-SGD based on a fixed privacy budget (in terms of $\epsilon$ and $\delta$) in such a way to optimize the anticipated utility (test accuracy) the most. To implement this idea, we show how to convert the classical DP moment accountant to a pro-active DP by exploiting the fact that it has a simple close form for computing spent privacy budget for a given interaction round.
The DP moment accountant is introduced in context of DP-SGD and has the following property which is the key ingredient to build pro-active DP. In DP-SGD each round communicates a local SGD update which leaks some new information about the underlying local data set to the outside world. In order to provide privacy, Gaussian noise with standard deviation $\sigma$ is added to local SGD updates after performing a clipping operation and normalizing with the clipping constant. We show that for attaining $(\epsilon,\delta)$-differential privacy $\sigma$ can be chosen equal to $\sqrt{2(\epsilon +\ln(1/\delta))/\epsilon}$ for $\epsilon=\Omega(T/N^2)$, where $T$ is the total number of rounds and $N$ is equal to the size of the local data set. In many existing machine learning problems, $N$ is always large and $T=O(N)$. Hence, $\sigma$ becomes "independent" of any $T=O(N)$ choice with $\epsilon=\Omega(1/N)$. This means that our {\em $\sigma$ only depends on $N$ rather than $T$}. We show how this differential privacy characterization allows us to convert DP moment accountant to a pro-active DP.

Comments:	arXiv admin note: text overlap with arXiv:2007.09208 Change latex template and author order
Subjects:	Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
Cite as:	arXiv:2102.09030 [cs.LG]
	(or arXiv:2102.09030v6 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2102.09030

Submission history

From: Ngoc Toan Nguyen [view email]
[v1] Wed, 17 Feb 2021 21:19:39 UTC (18,148 KB)
[v2] Mon, 14 Jun 2021 21:05:08 UTC (1,727 KB)
[v3] Tue, 19 Oct 2021 23:07:08 UTC (1,755 KB)
[v4] Tue, 1 Feb 2022 00:27:02 UTC (18,773 KB)
[v5] Fri, 6 Jan 2023 17:40:58 UTC (2,174 KB)
[v6] Fri, 13 Jan 2023 08:13:13 UTC (18,908 KB)
[v7] Sun, 5 Mar 2023 18:44:52 UTC (18,791 KB)
[v8] Tue, 30 May 2023 04:19:24 UTC (18,957 KB)
[v9] Fri, 24 Nov 2023 11:25:40 UTC (18,934 KB)
[v10] Tue, 4 Jun 2024 07:41:47 UTC (18,780 KB)

Computer Science > Machine Learning

Title:Bringing Differential Private SGD to Practice: On the Independence of Gaussian Noise and the Number of Training Rounds

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Bringing Differential Private SGD to Practice: On the Independence of Gaussian Noise and the Number of Training Rounds

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators