A Certified Unlearning Approach without Access to Source Data

Basaran, Umit Yigit; Ahmed, Sk Miraj; Roy-Chowdhury, Amit; Guler, Basak

Computer Science > Machine Learning

arXiv:2506.06486 (cs)

[Submitted on 6 Jun 2025]

Title:A Certified Unlearning Approach without Access to Source Data

Authors:Umit Yigit Basaran, Sk Miraj Ahmed, Amit Roy-Chowdhury, Basak Guler

View PDF HTML (experimental)

Abstract:With the growing adoption of data privacy regulations, the ability to erase private or copyrighted information from trained models has become a crucial requirement. Traditional unlearning methods often assume access to the complete training dataset, which is unrealistic in scenarios where the source data is no longer available. To address this challenge, we propose a certified unlearning framework that enables effective data removal \final{without access to the original training data samples}. Our approach utilizes a surrogate dataset that approximates the statistical properties of the source data, allowing for controlled noise scaling based on the statistical distance between the two. \updated{While our theoretical guarantees assume knowledge of the exact statistical distance, practical implementations typically approximate this distance, resulting in potentially weaker but still meaningful privacy guarantees.} This ensures strong guarantees on the model's behavior post-unlearning while maintaining its overall utility. We establish theoretical bounds, introduce practical noise calibration techniques, and validate our method through extensive experiments on both synthetic and real-world datasets. The results demonstrate the effectiveness and reliability of our approach in privacy-sensitive settings.

Comments:	Accepted by ICML 2025
Subjects:	Machine Learning (cs.LG); Cryptography and Security (cs.CR); Machine Learning (stat.ML)
Cite as:	arXiv:2506.06486 [cs.LG]
	(or arXiv:2506.06486v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2506.06486

Submission history

From: Umit Basaran [view email]
[v1] Fri, 6 Jun 2025 19:22:47 UTC (1,372 KB)

Computer Science > Machine Learning

Title:A Certified Unlearning Approach without Access to Source Data

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:A Certified Unlearning Approach without Access to Source Data

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators