A Fast Sampling Gradient Tree Boosting Framework

Zhou, Daniel Chao; Jin, Zhongming; Zhang, Tong

Computer Science > Machine Learning

arXiv:1911.08820 (cs)

[Submitted on 20 Nov 2019]

Title:A Fast Sampling Gradient Tree Boosting Framework

Authors:Daniel Chao Zhou, Zhongming Jin, Tong Zhang

View PDF

Abstract:As an adaptive, interpretable, robust, and accurate meta-algorithm for arbitrary differentiable loss functions, gradient tree boosting is one of the most popular machine learning techniques, though the computational expensiveness severely limits its usage. Stochastic gradient boosting could be adopted to accelerates gradient boosting by uniformly sampling training instances, but its estimator could introduce a high variance. This situation arises motivation for us to optimize gradient tree boosting. We combine gradient tree boosting with importance sampling, which achieves better performance by reducing the stochastic variance. Furthermore, we use a regularizer to improve the diagonal approximation in the Newton step of gradient boosting. The theoretical analysis supports that our strategies achieve a linear convergence rate on logistic loss. Empirical results show that our algorithm achieves a 2.5x--18x acceleration on two different gradient boosting algorithms (LogitBoost and LambdaMART) without appreciable performance loss.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1911.08820 [cs.LG]
	(or arXiv:1911.08820v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1911.08820

Submission history

From: Daniel Chao Zhou [view email]
[v1] Wed, 20 Nov 2019 11:00:53 UTC (141 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2019-11

Change to browse by:

cs
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Zhongming Jin
Tong Zhang

export BibTeX citation

Computer Science > Machine Learning

Title:A Fast Sampling Gradient Tree Boosting Framework

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:A Fast Sampling Gradient Tree Boosting Framework

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators