Free Lunch in the Forest: Functionally-Identical Pruning of Boosted Tree Ensembles

Emine, Youssouf; Forel, Alexandre; Malek, Idriss; Vidal, Thibaut

Computer Science > Machine Learning

arXiv:2408.16167 (cs)

[Submitted on 28 Aug 2024 (v1), last revised 20 Jan 2025 (this version, v2)]

Title:Free Lunch in the Forest: Functionally-Identical Pruning of Boosted Tree Ensembles

Authors:Youssouf Emine, Alexandre Forel, Idriss Malek, Thibaut Vidal

View PDF

Abstract:Tree ensembles, including boosting methods, are highly effective and widely used for tabular data. However, large ensembles lack interpretability and require longer inference times. We introduce a method to prune a tree ensemble into a reduced version that is "functionally identical" to the original model. In other words, our method guarantees that the prediction function stays unchanged for any possible input. As a consequence, this pruning algorithm is lossless for any aggregated metric. We formalize the problem of functionally identical pruning on ensembles, introduce an exact optimization model, and provide a fast yet highly effective method to prune large ensembles. Our algorithm iteratively prunes considering a finite set of points, which is incrementally augmented using an adversarial model. In multiple computational experiments, we show that our approach is a "free lunch", significantly reducing the ensemble size without altering the model's behavior. Thus, we can preserve state-of-the-art performance at a fraction of the original model's size.

Subjects:	Machine Learning (cs.LG); Optimization and Control (math.OC)
Cite as:	arXiv:2408.16167 [cs.LG]
	(or arXiv:2408.16167v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2408.16167

Submission history

From: Youssouf Emine [view email]
[v1] Wed, 28 Aug 2024 23:15:46 UTC (182 KB)
[v2] Mon, 20 Jan 2025 19:09:12 UTC (626 KB)

Computer Science > Machine Learning

Title:Free Lunch in the Forest: Functionally-Identical Pruning of Boosted Tree Ensembles

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Free Lunch in the Forest: Functionally-Identical Pruning of Boosted Tree Ensembles

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators