Adacc: Adaptive Compression and Activation Checkpointing for LLM Memory Management

Chen, Ping; Deng, Zhuohong; Li, Ping; He, Shuibing; Zhu, Hongzi; Zheng, Yi; Wang, Zhefeng; Huai, Baoxing; Guo, Minyi

Computer Science > Machine Learning

arXiv:2508.00806v1 (cs)

[Submitted on 1 Aug 2025 (this version), latest version 8 Aug 2025 (v2)]

Title:Adacc: Adaptive Compression and Activation Checkpointing for LLM Memory Management

Authors:Ping Chen, Zhuohong Deng, Ping Li, Shuibing He, Hongzi Zhu, Yi Zheng, Zhefeng Wang, Baoxing Huai, Minyi Guo

View PDF HTML (experimental)

Abstract:Training large language models often employs recomputation to alleviate memory pressure, which can introduce up to 30% overhead in real-world scenarios. In this paper, we propose Adacc, a novel memory management framework that combines adaptive compression and activation checkpointing to reduce the GPU memory footprint. It comprises three modules: (1) We design layer-specific compression algorithms that account for outliers in LLM tensors, instead of directly quantizing floats from FP16 to INT4, to ensure model accuracy. (2) We propose an optimal scheduling policy that employs MILP to determine the best memory optimization for each tensor. (3) To accommodate changes in training tensors, we introduce an adaptive policy evolution mechanism that adjusts the policy during training to enhance throughput. Experimental results show that Adacc can accelerate the LLM training by 1.01x to 1.37x compared to state-of-the-art frameworks, while maintaining comparable model accuracy to the Baseline.

Comments:	8 pages
Subjects:	Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
Cite as:	arXiv:2508.00806 [cs.LG]
	(or arXiv:2508.00806v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2508.00806

Submission history

From: Ping Chen [view email]
[v1] Fri, 1 Aug 2025 17:39:25 UTC (3,252 KB)
[v2] Fri, 8 Aug 2025 09:49:52 UTC (1,118 KB)

Computer Science > Machine Learning

Title:Adacc: Adaptive Compression and Activation Checkpointing for LLM Memory Management

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Adacc: Adaptive Compression and Activation Checkpointing for LLM Memory Management

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators