A survey and benchmarking study of multitreatment uplift modeling,Data Mining and Knowledge Discovery

当前位置： X-MOL 学术 › Data Min. Knowl. Discov. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

A survey and benchmarking study of multitreatment uplift modeling
Data Mining and Knowledge Discovery ( IF 2.8 ) Pub Date : 2020-01-13 , DOI: 10.1007/s10618-019-00670-y
Diego Olaya , Kristof Coussement , Wouter Verbeke

Uplift modeling is an instrument used to estimate the change in outcome due to a treatment at the individual entity level. Uplift models assist decision-makers in optimally allocating scarce resources. This allows the selection of the subset of entities for which the effect of a treatment will be largest and, as such, the maximization of the overall returns. The literature on uplift modeling mostly focuses on queries concerning the effect of a single treatment and rarely considers situations where more than one treatment alternative is utilized. This article surveys the current literature on multitreatment uplift modeling and proposes two novel techniques: the naive uplift approach and the multitreatment modified outcome approach. Moreover, a benchmarking experiment is performed to contrast the performances of different multitreatment uplift modeling techniques across eight data sets from various domains. We verify and, if needed, correct the imbalance among the pretreatment characteristics of the treatment groups by means of optimal propensity score matching, which ensures a correct interpretation of the estimated uplift. Conventional and recently proposed evaluation metrics are adapted to the multitreatment scenario to assess performance. None of the evaluated techniques consistently outperforms other techniques. Hence, it is concluded that performance largely depends on the context and problem characteristics. The newly proposed techniques are found to offer similar performances compared to state-of-the-art approaches.

中文翻译：

多处理隆升模型的调查和基准研究

隆起模型是一种用于估计由于个体实体级别的治疗而导致的结果变化的工具。提升模型可帮助决策者最佳地分配稀缺资源。这允许选择将对其产生最大影响的实体子集，从而使整体收益最大化。关于隆起模型的文献主要集中于关于单一治疗效果的查询，很少考虑使用多个治疗替代方案的情况。本文概述了有关多治疗隆起模型的最新文献，并提出了两种新颖的技术：朴素的隆起方法和多治疗改良结局方法。此外，进行基准测试，以对比来自不同领域的八个数据集的不同多处理隆升建模技术的性能。我们通过最佳倾向评分匹配来验证并根据需要纠正治疗组的预处理特征之间的不平衡，从而确保对估计的隆起进行正确的解释。常规和最近提出的评估指标适用于多处理方案以评估性能。所评估的技术均无法始终胜过其他技术。因此，可以得出结论，绩效很大程度上取决于环境和问题特征。发现最新提出的技术与最新技术相比具有类似的性能。我们通过最佳倾向评分匹配来验证并根据需要纠正治疗组的预处理特征之间的不平衡，从而确保对估计的隆起进行正确的解释。常规和最近提出的评估指标适用于多处理方案以评估性能。所评估的技术均无法始终胜过其他技术。因此，可以得出结论，绩效很大程度上取决于环境和问题特征。发现最新提出的技术与最新技术相比具有类似的性能。我们通过最佳倾向评分匹配来验证并根据需要纠正治疗组的预处理特征之间的不平衡，从而确保对估计的隆起进行正确的解释。常规的和最近提出的评估指标适用于多处理方案以评估性能。所评估的技术均无法始终胜过其他技术。因此，可以得出结论，绩效很大程度上取决于环境和问题特征。发现最新提出的技术与最新技术相比具有类似的性能。常规和最近提出的评估指标适用于多处理方案以评估性能。所评估的技术均无法始终胜过其他技术。因此，可以得出结论，绩效很大程度上取决于环境和问题特征。发现最新提出的技术与最新技术相比具有类似的性能。常规和最近提出的评估指标适用于多处理方案以评估性能。所评估的技术均无法始终胜过其他技术。因此，可以得出结论，绩效很大程度上取决于环境和问题特征。发现最新提出的技术与最新技术相比具有类似的性能。

更新日期：2020-01-13

点击分享查看原文

点击收藏

公开下载

阅读更多本刊最新论文本刊介绍/投稿指南11