Boosting Fraud Detection in Mobile Payment with Prior Knowledge,Applied Sciences

当前位置： X-MOL 学术 › Appl. Sci. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Boosting Fraud Detection in Mobile Payment with Prior Knowledge
Applied Sciences ( IF 2.838 ) Pub Date : 2021-05-11 , DOI: 10.3390/app11104347
Quan Sun , Tao Tang , Hongfeng Chai , Jie Wu , Yang Chen

With the prevalence of mobile e-commerce, fraudulent transactions conducted by robots are becoming increasingly common in mobile payments, which is severely undermining market fairness and resulting in financial losses. It has become a difficult problem for mobile applications to identify robotic automation accurately and efficiently from a massive number of transactions. The current research does not propose any effective method or engineering implementation. In this article, an extension to boost algorithms is presented that permits the incorporation of prior human knowledge as a means of compensating for a training data shortage and improving prediction results. Prior human knowledge is accumulated from historical fraud transactions or transferred from different domains in the form of expert rules and blacklists. The knowledge is applied to extract risk features from transaction data, risk features together with normal features are input into the boosting algorithm to perform training, and therefore we incorporate boosting algorithm with prior human knowledge to improve the performance of the model. For the first time we verified the effectiveness of the method via a widely deployed mobile APP with 150+ million users, and by taking experiments on a certain dataset, the extended boosting model shows an accuracy increase from 0.9825 to 0.9871 and a recall rate increase from 0.888 to 0.948. We also investigated feature differences between robots and normal users and we discovered the behavior patterns of robotic automation that include less spatial motion detected by device sensors (1/10 of normal user pattern), higher IP group-clustering ratio (60% in robots vs. 15% in normal users), higher jailbroken device rate (92.47% vs. 4.64%), more irregular device names and fewer IP address changes. The quantitative analysis result is helpful for APP developers and service providers to understand and prevent fraudulent transactions from robotic automation.This article proposed an optimized boosting model, which has better use in the field of robotic automation detection of mobile phones. By combining prior knowledge and feature importance analysis, the model is more robust when the actual dataset is unbalanced or with few-short samples. The model is also more explainable as feature analysis is available which can be used for generating disposal rules in the actual fake mobile user blocking systems.

中文翻译：

借助先验知识促进移动支付中的欺诈检测

随着移动电子商务的普及，由机器人进行的欺诈性交易在移动支付中变得越来越普遍，这严重破坏了市场公平性并造成了财务损失。对于移动应用程序而言，从大量事务中准确有效地识别机械手自动化已成为一个难题。当前的研究没有提出任何有效的方法或工程实施。在本文中，提出了对Boost算法的扩展，该算法允许将先前的人类知识作为补偿训练数据短缺和改善预测结果的一种手段。先前的人类知识是从历史欺诈交易中积累的，或者是从不同领域以专家规则和黑名单的形式转移的。该知识用于从交易数据中提取风险特征，将风险特征和正常特征一起输入到Boosting算法中以进行训练，因此我们将Boosting算法与先前的人类知识相结合以改善模型的性能。我们首次通过广泛部署的150亿以上用户的移动APP验证了该方法的有效性，并且通过对特定数据集进行实验，扩展的Boosting模型显示出准确度从0.9825提高到0.9871，召回率增加了0.888至0.948。我们还研究了机器人与正常用户之间的功能差异，并发现了机器人自动化的行为模式，其中包括设备传感器检测到的空间运动较少（正常用户模式的1/10），IP组聚集率更高（机器人为60％，普通用户为15％），越狱设备率（92.47％对4.64％），更多不规则设备名称和更少IP地址更改。定量分析的结果有助于APP开发人员和服务提供商了解并防止机器人自动化进行欺诈交易。本文提出了一种优化的增强模型，该模型在手机机器人自动化检测领域具有较好的应用价值。通过将先验知识与特征重要性分析相结合，当实际数据集不平衡或样本较少时，该模型将更加健壮。该模型还具有更多的可解释性，因为可以使用特征分析，该特征分析可用于在实际的伪造的移动用户阻止系统中生成处置规则。47％和4.64％），更多不规则的设备名称和更少的IP地址更改。定量分析的结果有助于APP开发人员和服务提供商了解并防止机器人自动化进行欺诈交易。本文提出了一种优化的增强模型，该模型在手机机器人自动化检测领域具有较好的应用价值。通过将先验知识与特征重要性分析相结合，当实际数据集不平衡或样本较少时，该模型将更加健壮。该模型还具有更多的可解释性，因为可以使用特征分析，该特征分析可用于在实际的伪造的移动用户阻止系统中生成处置规则。47％和4.64％），更多不规则的设备名称和更少的IP地址更改。定量分析的结果有助于APP开发人员和服务提供商了解并防止机器人自动化进行欺诈交易。本文提出了一种优化的增强模型，该模型在手机机器人自动化检测领域具有较好的应用价值。通过将先验知识与特征重要性分析相结合，当实际数据集不平衡或样本较少时，该模型将更加健壮。该模型还具有更多的可解释性，因为可以使用特征分析，该特征分析可用于在实际的伪造的移动用户阻止系统中生成处置规则。定量分析的结果有助于APP开发人员和服务提供商了解并防止机器人自动化进行欺诈交易。本文提出了一种优化的增强模型，该模型在手机机器人自动化检测领域具有较好的应用价值。通过将先验知识与特征重要性分析相结合，当实际数据集不平衡或样本较少时，该模型将更加健壮。该模型还具有更多的可解释性，因为可以使用特征分析，该特征分析可用于在实际的伪造的移动用户阻止系统中生成处置规则。定量分析的结果有助于APP开发人员和服务提供商了解并防止机器人自动化进行欺诈交易。本文提出了一种优化的增强模型，该模型在手机机器人自动化检测领域具有较好的应用价值。通过将先验知识与特征重要性分析相结合，当实际数据集不平衡或样本较少时，该模型将更加健壮。该模型还具有更多的可解释性，因为可以使用特征分析，该特征分析可用于在实际的伪造的移动用户阻止系统中生成处置规则。当实际数据集不平衡或样本较少时，该模型会更健壮。该模型还具有更多的可解释性，因为可以使用特征分析，该特征分析可用于在实际的伪造的移动用户阻止系统中生成处置规则。当实际数据集不平衡或样本较少时，该模型会更健壮。该模型还具有更多的可解释性，因为可以使用特征分析，该特征分析可用于在实际的伪造的移动用户阻止系统中生成处置规则。

更新日期：2021-05-11

点击分享查看原文

点击收藏

公开下载

阅读更多本刊最新论文

全部期刊列表>>