CAMAL: Optimizing LSM-trees via Active Learning

Yu, Weiping; Luo, Siqiang; Yu, Zihao; Cong, Gao

Computer Science > Databases

arXiv:2409.15130 (cs)

[Submitted on 23 Sep 2024]

Title:CAMAL: Optimizing LSM-trees via Active Learning

Authors:Weiping Yu, Siqiang Luo, Zihao Yu, Gao Cong

View PDF HTML (experimental)

Abstract:We use machine learning to optimize LSM-tree structure, aiming to reduce the cost of processing various read/write operations. We introduce a new approach Camal, which boasts the following features: (1) ML-Aided: Camal is the first attempt to apply active learning to tune LSM-tree based key-value stores. The learning process is coupled with traditional cost models to improve the training process; (2) Decoupled Active Learning: backed by rigorous analysis, Camal adopts active learning paradigm based on a decoupled tuning of each parameter, which further accelerates the learning process; (3) Easy Extrapolation: Camal adopts an effective mechanism to incrementally update the model with the growth of the data size; (4) Dynamic Mode: Camal is able to tune LSM-tree online under dynamically changing workloads; (5) Significant System Improvement: By integrating Camal into a full system RocksDB, the system performance improves by 28% on average and up to 8x compared to a state-of-the-art RocksDB design.

Comments:	SIGMOD 2025
Subjects:	Databases (cs.DB); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2409.15130 [cs.DB]
	(or arXiv:2409.15130v1 [cs.DB] for this version)
	https://doi.org/10.48550/arXiv.2409.15130

Submission history

From: Weiping Yu Mr. [view email]
[v1] Mon, 23 Sep 2024 15:35:23 UTC (2,035 KB)

Computer Science > Databases

Title:CAMAL: Optimizing LSM-trees via Active Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Databases

Title:CAMAL: Optimizing LSM-trees via Active Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators