当前位置: X-MOL 学术ACM Trans. Knowl. Discov. Data › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Unbiased Measurement of Feature Importance in Tree-Based Methods
ACM Transactions on Knowledge Discovery from Data ( IF 4.0 ) Pub Date : 2021-01-04 , DOI: 10.1145/3429445
Zhengze Zhou 1 , Giles Hooker 1
Affiliation  

We propose a modification that corrects for split-improvement variable importance measures in Random Forests and other tree-based methods. These methods have been shown to be biased towards increasing the importance of features with more potential splits. We show that by appropriately incorporating split-improvement as measured on out of sample data, this bias can be corrected yielding better summaries and screening tools.

中文翻译:

基于树的方法中特征重要性的无偏测量

我们提出了一种修改,以纠正随机森林和其他基于树的方法中的拆分改进变量重要性度量。这些方法已被证明偏向于增加具有更多潜在分裂的特征的重要性。我们表明,通过适当地结合对样本外数据进行测量的拆分改进,可以纠正这种偏差,从而产生更好的摘要和筛选工具。
更新日期:2021-01-04
down
wechat
bug