当前位置: X-MOL 学术IEEE Trans. Pattern Anal. Mach. Intell. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Meta-Wrapper: Differentiable Wrapping Operator for User Interest Selection in CTR Prediction.
IEEE Transactions on Pattern Analysis and Machine Intelligence ( IF 23.6 ) Pub Date : 2022-10-04 , DOI: 10.1109/tpami.2021.3103741
Tianwei Cao , Qianqian Xu , Zhiyong Yang , Qingming Huang

Click-through rate (CTR) prediction, whose goal is to predict the probability of the user to click on an item, has become increasingly significant in the recommender systems. Recently, some deep learning models with the ability to automatically extract the user interest from his/her behaviors have achieved great success. In these work, the attention mechanism is used to select the user interested items in historical behaviors, improving the performance of the CTR predictor. Normally, these attentive modules can be jointly trained with the base predictor by using gradient descents. In this paper, we regard user interest modeling as a feature selection problem, which we call user interest selection. For such a problem, we propose a novel approach under the framework of the wrapper method, which is named Meta-Wrapper. More specifically, we use a differentiable module as our wrapping operator and then recast its learning problem as a continuous bilevel optimization. Moreover, we use a meta-learning algorithm to solve the optimization and theoretically prove its convergence. Meanwhile, we also provide theoretical analysis to show that our proposed method 1) efficiencies the wrapper-based feature selection, and 2) achieves better resistance to overfitting. Finally, extensive experiments on three public datasets manifest the superiority of our method in boosting the performance of CTR prediction.

中文翻译:

Meta-Wrapper:用于 CTR 预测中用户兴趣选择的可区分包装运算符。

点击率 (CTR) 预测的目标是预测用户点击某个项目的概率,在推荐系统中变得越来越重要。最近,一些能够自动从用户行为中提取用户兴趣的深度学习模型取得了巨大成功。在这些工作中,注意机制用于选择用户历史行为中感兴趣的项目,提高了 CTR 预测器的性能。通常,这些注意力模块可以通过使用梯度下降与基础预测器联合训练。在本文中,我们将用户兴趣建模视为一个特征选择问题,我们称之为用户兴趣选择。针对这样的问题,我们在wrapper方法的框架下提出了一种新的方法,命名为Meta-Wrapper。进一步来说,我们使用可微模块作为我们的包装运算符,然后将其学习问题重铸为连续的双层优化。此外,我们使用元学习算法来解决优化并从理论上证明其收敛性。同时,我们还提供了理论分析,表明我们提出的方法 1) 提高了基于包装器的特征选择的效率,以及 2) 实现了更好的抗过度拟合能力。最后,在三个公共数据集上进行的大量实验证明了我们的方法在提高 CTR 预测性能方面的优越性。我们还提供了理论分析,以表明我们提出的方法 1) 提高了基于包装器的特征选择的效率,以及 2) 更好地抵抗过度拟合。最后,在三个公共数据集上进行的大量实验证明了我们的方法在提高 CTR 预测性能方面的优越性。我们还提供了理论分析,以表明我们提出的方法 1) 提高了基于包装器的特征选择的效率,以及 2) 更好地抵抗过度拟合。最后,在三个公共数据集上进行的大量实验证明了我们的方法在提高 CTR 预测性能方面的优越性。
更新日期:2021-08-10
down
wechat
bug