当前位置: X-MOL 学术Comput. Secur. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
An effective genetic algorithm-based feature selection method for intrusion detection systems
Computers & Security ( IF 4.8 ) Pub Date : 2021-08-26 , DOI: 10.1016/j.cose.2021.102448
Zahid Halim 1 , Muhammad Nadeem Yousaf 1 , Muhammad Waqas 2, 3 , Muhammad Sulaiman 1, 4 , Ghulam Abbas 2 , Masroor Hussain 1 , Iftekhar Ahmad 5 , Muhammad Hanif 1
Affiliation  

Availability of suitable and validated data is a key issue in multiple domains for implementing machine learning methods. Higher data dimensionality has adverse effects on the learning algorithm's performance. This work aims to design a method that preserves most of the unique information related to the data with minimum number of features. Addressing the feature selection problem in the domain of network security and intrusion detection, this work contributes an enhanced Genetic Algorithm (GA)-based feature selection method, named as GA-based Feature Selection (GbFS), to increase the classifiers’ accuracy. Securing a network from the cyber-attacks is a critical task and needs to be strengthened. Machine learning, due to its proven results, is widely used in developing firewalls and Intrusion Detection Systems (IDSs) to identify new kinds of attacks. Utilizing machine learning algorithms, IDSs are able to detect the intruder by analyzing the network traffic passing through it. This work presents parameter tuning for the GA-based feature selection along with a novel fitness function. The present work develops an enhanced GA-based feature selection method which is tested over three benchmark network traffic datasets, namely, CIRA-CIC-DOHBrw-2020, UNSW-NB15, and Bot-IoT. A comparison is also performed with the standard feature selection methods. Results show that the accuracies improve using GbFS by achieving a maximum accuracy of 99.80%.



中文翻译:

一种有效的基于遗传算法的入侵检测系统特征选择方法

合适且经过验证的数据的可用性是实施机器学习方法的多个领域中的关键问题。较高的数据维数对学习算法的性能有不利影响。这项工作旨在设计一种方法,以最少的特征数保留与数据相关的大部分独特信息。针对网络安全和入侵检测领域的特征选择问题,这项工作贡献了一种基于遗传算法(GA)的特征选择方法,称为基于遗传算法的特征选择(GbFS),以提高分类器的准确性。保护网络免受网络攻击是一项关键任务,需要加强。机器学习,由于其经过验证的结果,广泛用于开发防火墙和入侵检测系统 (IDS) 以识别新型攻击。利用机器学习算法,IDS 能​​够通过分析通过它的网络流量来检测入侵者。这项工作提出了基于 GA 的特征选择的参数调整以及一种新颖的适应度函数。目前的工作开发了一种基于 GA 的增强特征选择方法,该方法在三个基准网络流量数据集上进行了测试,即 CIRA-CIC-DOHBrw-2020、UNSW-NB15 和 Bot-IoT。还与标准特征选择方法进行了比较。结果表明,通过实现 99.80% 的最大准确度,使用 GbFS 提高了准确度。这项工作提出了基于 GA 的特征选择的参数调整以及一种新颖的适应度函数。目前的工作开发了一种基于 GA 的增强特征选择方法,该方法在三个基准网络流量数据集上进行了测试,即 CIRA-CIC-DOHBrw-2020、UNSW-NB15 和 Bot-IoT。还与标准特征选择方法进行了比较。结果表明,通过实现 99.80% 的最大准确度,使用 GbFS 提高了准确度。这项工作提出了基于 GA 的特征选择的参数调整以及一种新颖的适应度函数。目前的工作开发了一种基于 GA 的增强特征选择方法,该方法在三个基准网络流量数据集上进行了测试,即 CIRA-CIC-DOHBrw-2020、UNSW-NB15 和 Bot-IoT。还与标准特征选择方法进行了比较。结果表明,通过实现 99.80% 的最大准确度,使用 GbFS 提高了准确度。

更新日期:2021-09-04
down
wechat
bug