Projected Compression: Trainable Projection for Efficient Transformer Compression

Stefaniak, Maciej; Krutul, Michał; Małaśnicki, Jan; Pióro, Maciej; Krajewski, Jakub; Jaszczur, Sebastian; Cygan, Marek; Adamczewski, Kamil; Ludziejewski, Jan

Computer Science > Machine Learning

arXiv:2506.22255 (cs)

[Submitted on 27 Jun 2025]

Title:Projected Compression: Trainable Projection for Efficient Transformer Compression

Authors:Maciej Stefaniak, Michał Krutul, Jan Małaśnicki, Maciej Pióro, Jakub Krajewski, Sebastian Jaszczur, Marek Cygan, Kamil Adamczewski, Jan Ludziejewski

View PDF HTML (experimental)

Abstract:Large language models have steadily increased in size to achieve improved performance; however, this growth has also led to greater inference time and computational demands. Consequently, there is rising interest in model size reduction methods. To address this issue, we propose Projected Compression, a novel model compression technique, that reduces model weights by utilizing projection modules. Specifically, we first train additional trainable projections weights and preserve access to all the original model parameters. Subsequently, these projections are merged into a lower-dimensional product matrix, resulting in a reduced-size standard Transformer-based model. Unlike alternative approaches that require additional computational overhead, our method matches the base model's per-token computation step in FLOPs. Experimental results show that Projected Compression outperforms the comparable hard pruning and retraining approach on higher quality models. Moreover, the performance margin scales well with the number of tokens.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
Cite as:	arXiv:2506.22255 [cs.LG]
	(or arXiv:2506.22255v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2506.22255

Submission history

From: Michał Krutul [view email]
[v1] Fri, 27 Jun 2025 14:24:01 UTC (128 KB)

Computer Science > Machine Learning

Title:Projected Compression: Trainable Projection for Efficient Transformer Compression

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Projected Compression: Trainable Projection for Efficient Transformer Compression

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators