当前位置: X-MOL 学术Knowl. Based Syst. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
A filter-based feature construction and feature selection approach for classification using Genetic Programming
Knowledge-Based Systems ( IF 8.8 ) Pub Date : 2020-03-26 , DOI: 10.1016/j.knosys.2020.105806
Jianbin Ma , Xiaoying Gao

Feature construction and feature selection are two common pre-processing methods for classification. Genetic Programming (GP) can be used to solve feature construction and feature selection tasks due to its flexible representation. In this paper, a filter-based multiple feature construction approach using GP named FCM that stores top individuals is proposed, and a filter-based feature selection approach using GP named FS that uses correlation-based evaluation method is employed. A hybrid feature construction and feature selection approach named FCMFS that first constructs multiple features using FCM then selects effective features using FS is proposed. Experiments on nine datasets show that features selected by FS or constructed by FCM are all effective to improve the classification performance comparing with original features, and our proposed FCMFS can maintain the classification performance with smaller number of features comparing with FCM, and can obtain better classification performance with smaller number of features than FS on the majority of the nine datasets. Compared with another feature construction and feature selection approach named FSFCM that first selects features using FS then constructs features using FCM, FCMFS achieves better performance in terms of classification and the smaller number of features. The comparisons with three state-of-art techniques show that our proposed FCMFS approach can achieve better experimental results in most cases.



中文翻译:

基于遗传规划的基于过滤器的特征构造和特征选择方法

特征构造和特征选择是用于分类的两种常见预处理方法。遗传编程(GP)由于其灵活的表示方式,可用于解决特征构建和特征选择任务。在本文中,提出了一种使用基于FCM的GP的基于过滤器的多特征构建方法,该方法存储顶级个人,并且采用基于FS的GP的基于过滤器的多特征构建方法,该方法使用了基于相关性的评估方法。提出了一种名为FCMFS的混合特征构造和特征选择方法,该方法首先使用FCM构造多个特征,然后使用FS选择有效特征。在9个数据集上进行的实验表明,与原始特征相比,由FS选择或由FCM构造的特征均能有效提高分类性能,并且我们提出的FCMFS与FCM相比,可以以较少的特征数保持分类性能,并且在大多数9个数据集中,与FS相比,可以以较少的特征数获得更好的分类性能。与另一种称为FSFCM的特征构造和特征选择方法相比,该方法首先使用FS选择特征,然后使用FCM构造特征,FCMFS在分类和较少数量的特征方面实现了更好的性能。与三种最新技术的比较表明,我们提出的FCMFS方法可以在大多数情况下获得更好的实验结果。并且在9个数据集中的大多数数据上,与FS相比,可以以较少的特征获得更好的分类性能。与另一种称为FSFCM的特征构造和特征选择方法相比,该方法首先使用FS选择特征,然后使用FCM构造特征,FCMFS在分类和较少数量的特征方面实现了更好的性能。与三种最新技术的比较表明,我们提出的FCMFS方法可以在大多数情况下获得更好的实验结果。并且在9个数据集中的大多数数据上,与FS相比,可以以较少的特征获得更好的分类性能。与另一种称为FSFCM的特征构造和特征选择方法相比,该方法首先使用FS选择特征,然后使用FCM构造特征,FCMFS在分类和较少数量的特征方面实现了更好的性能。与三种最新技术的比较表明,我们提出的FCMFS方法可以在大多数情况下获得更好的实验结果。

更新日期:2020-03-27
down
wechat
bug