Machine Learning and Nature Inspired Based Phishing Detection: A Literature Survey,International Journal on Artificial Intelligence Tools

当前位置： X-MOL 学术 › Int. J. Artif. Intell. Tools › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Machine Learning and Nature Inspired Based Phishing Detection: A Literature Survey
International Journal on Artificial Intelligence Tools ( IF 1.1 ) Pub Date : 2019-08-30 , DOI: 10.1142/s0218213019300023
Andronicus A. Akinyelu ₁

Affiliation

Phishing detection remains one of the most challenging problems currently faced by cyber users as sophisticated phishing schemes evolves rapidly. This paper presents a comprehensive survey of different phishing detection techniques with a focus of identifying gaps and proffering solutions to the current phishing detection problems. As shown in the survey, numerous techniques have been proposed to tackle phishing, including Machine Learning (ML) based, Nature Inspired (NI) based, heuristic-based, blacklist-based and whitelist-based techniques. However, ML-based techniques outperform other techniques in terms of classification accuracy due to their ability to analyze data contents, extract unknown or concealed patterns from the data and consequently turn the patterns into meaningful information for decision making. Unfortunately, ML algorithms cannot effectively handle big datasets, hence they can be combined with NI algorithms to build fast and improved models for phishing detection. Although, some surveys on phishing detection techniques exist, very few focused on ML-based and NI-based techniques. Therefore, this study presents a survey on ML-based and NI-based phishing detection techniques. The survey reveals the various shortcomings of phishing detection techniques, including limited dataset, use of third-party services (age of domain, search engine query, etc.), use of small feature set, use of classification rules, use of blacklist and whitelist, etc. There is an obvious need for efficient and reliable solutions for phishing detection. The recent success and prevalence of deep learning provides a rare opportunity for researchers seeking to design improved and effective phishing detection techniques. This study aims to empower the research community with suitable solutions and motivating insights that can be used to effectively tackle the phishing detection problem. It provides a good picture of the current state-of-the-art on phishing detection and serves as a reliable springboard for further studies.

中文翻译：

基于机器学习和自然启发的网络钓鱼检测：文献调查

随着复杂的网络钓鱼方案迅速发展，网络钓鱼检测仍然是网络用户当前面临的最具挑战性的问题之一。本文对不同的网络钓鱼检测技术进行了全面调查，重点是识别差距并为当前的网络钓鱼检测问题提供解决方案。如调查所示，已经提出了许多技术来应对网络钓鱼，包括基于机器学习 (ML)、基于自然启发 (NI)、基于启发式、基于黑名单和基于白名单的技术。然而，基于 ML 的技术在分类准确性方面优于其他技术，因为它们能够分析数据内容、从数据中提取未知或隐藏的模式，从而将模式转化为有意义的决策信息。很遗憾，ML 算法不能有效地处理大数据集，因此它们可以与 NI 算法相结合，以构建快速和改进的网络钓鱼检测模型。尽管存在一些关于网络钓鱼检测技术的调查，但很少有人关注基于 ML 和基于 NI 的技术。因此，本研究对基于 ML 和基于 NI 的网络钓鱼检测技术进行了调查。调查揭示了网络钓鱼检测技术的各种缺点，包括数据集有限、使用第三方服务（域年龄、搜索引擎查询等）、使用小特征集、使用分类规则、使用黑名单和白名单等。显然需要有效和可靠的网络钓鱼检测解决方案。深度学习最近的成功和流行为寻求设计改进和有效的网络钓鱼检测技术的研究人员提供了难得的机会。本研究旨在为研究界提供合适的解决方案和启发性见解，可用于有效解决网络钓鱼检测问题。它很好地展示了当前网络钓鱼检测的最新技术，并作为进一步研究的可靠跳板。

更新日期：2019-08-30

点击分享查看原文

点击收藏

阅读更多本刊最新论文本刊介绍/投稿指南

全部期刊列表>>