Consistent Individualized Feature Attribution for Tree Ensembles

Lundberg, Scott M.; Erion, Gabriel G.; Lee, Su-In

Computer Science > Machine Learning

arXiv:1802.03888 (cs)

[Submitted on 12 Feb 2018 (v1), last revised 7 Mar 2019 (this version, v3)]

Title:Consistent Individualized Feature Attribution for Tree Ensembles

Authors:Scott M. Lundberg, Gabriel G. Erion, Su-In Lee

View PDF

Abstract:Interpreting predictions from tree ensemble methods such as gradient boosting machines and random forests is important, yet feature attribution for trees is often heuristic and not individualized for each prediction. Here we show that popular feature attribution methods are inconsistent, meaning they can lower a feature's assigned importance when the true impact of that feature actually increases. This is a fundamental problem that casts doubt on any comparison between features. To address it we turn to recent applications of game theory and develop fast exact tree solutions for SHAP (SHapley Additive exPlanation) values, which are the unique consistent and locally accurate attribution values. We then extend SHAP values to interaction effects and define SHAP interaction values. We propose a rich visualization of individualized feature attributions that improves over classic attribution summaries and partial dependence plots, and a unique "supervised" clustering (clustering based on feature attributions). We demonstrate better agreement with human intuition through a user study, exponential improvements in run time, improved clustering performance, and better identification of influential features. An implementation of our algorithm has also been merged into XGBoost and LightGBM, see this http URL for details.

Comments:	Follow-up to 2017 ICML Workshop arXiv:1706.06060
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1802.03888 [cs.LG]
	(or arXiv:1802.03888v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1802.03888

Submission history

From: Scott Lundberg [view email]
[v1] Mon, 12 Feb 2018 04:23:03 UTC (4,076 KB)
[v2] Mon, 18 Jun 2018 15:40:29 UTC (4,076 KB)
[v3] Thu, 7 Mar 2019 00:06:09 UTC (4,076 KB)

Computer Science > Machine Learning

Title:Consistent Individualized Feature Attribution for Tree Ensembles

Submission history

Access Paper:

References & Citations

8 blog links

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Consistent Individualized Feature Attribution for Tree Ensembles

Submission history

Access Paper:

References & Citations

8 blog links

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators