Efficient Sketches for Training Data Attribution and Studying the Loss Landscape

Schioppa, Andrea

Computer Science > Machine Learning

arXiv:2402.03994 (cs)

[Submitted on 6 Feb 2024 (v1), last revised 23 Oct 2024 (this version, v2)]

Title:Efficient Sketches for Training Data Attribution and Studying the Loss Landscape

Authors:Andrea Schioppa

View PDF HTML (experimental)

Abstract:The study of modern machine learning models often necessitates storing vast quantities of gradients or Hessian vector products (HVPs). Traditional sketching methods struggle to scale under these memory constraints. We present a novel framework for scalable gradient and HVP sketching, tailored for modern hardware. We provide theoretical guarantees and demonstrate the power of our methods in applications like training data attribution, Hessian spectrum analysis, and intrinsic dimension computation for pre-trained language models. Our work sheds new light on the behavior of pre-trained language models, challenging assumptions about their intrinsic dimensionality and Hessian properties.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2402.03994 [cs.LG]
	(or arXiv:2402.03994v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2402.03994
Journal reference:	Neurips 2024

Submission history

From: Andrea Schioppa [view email]
[v1] Tue, 6 Feb 2024 13:47:12 UTC (354 KB)
[v2] Wed, 23 Oct 2024 13:44:08 UTC (816 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2024-02

Change to browse by:

cs
stat
stat.ML

References & Citations

export BibTeX citation

Computer Science > Machine Learning

Title:Efficient Sketches for Training Data Attribution and Studying the Loss Landscape

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Efficient Sketches for Training Data Attribution and Studying the Loss Landscape

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators