当前位置: X-MOL 学术Int. J. Mach. Learn. & Cyber. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Feature selection schema based on game theory and biology migration algorithm for regression problems
International Journal of Machine Learning and Cybernetics ( IF 5.6 ) Pub Date : 2020-08-12 , DOI: 10.1007/s13042-020-01174-8
Mohammad Masoud Javidi

Many real-world datasets nowadays are of regression type, while only a few dimensionality reduction methods have been developed for regression problems. On the other hand, most existing regression methods are based on the computation of the covariance matrix, rendering them inefficient in the reduction process. Therefore, a BMA-based multi-objective feature selection method, GBMA, is introduced by incorporating the Nash equilibrium approach. GBMA is intended to maximize model accuracy and minimize the number of features through a less complex procedure. The proposed method is composed of four steps. The first step involves defining three players, each of which is trying to improve its objective function (i.e., model error, number of features, and precision adjustment). The second step includes clustering features based on the correlation therebetween and detecting the most appropriate ordering of features to enhance cluster efficiency. The third step comprises extracting a new feature from each cluster based on various weighting methods (i.e., moderate, strict, and hybrid). Finally, the fourth step encompasses updating players based on stochastic search operators. The proposed GBMA strategy explores the search space and finds optimal solutions in an acceptable amount of time without examining every possible solution. The experimental results and statistical tests based on ten well-known datasets from the UCI repository proved the high performance of GBMA in selecting features for solving regression problems.



中文翻译:

基于博弈论和生物学迁移算法的回归问题特征选择方案

如今,许多现实世界的数据集都是回归类型的,而针对回归问题仅开发了少数降维方法。另一方面,大多数现有的回归方法都基于协方差矩阵的计算,这使得它们在约简过程中效率低下。因此,通过结合纳什均衡方法,引入了基于BMA的多目标特征选择方法GBMA。GBMA旨在通过不太复杂的过程来最大化模型准确性并最小化特征数量。所提出的方法包括四个步骤。第一步涉及定义三个参与者,每个参与者都在尝试改善其目标功能(即模型误差,特征数量和精度调整)。第二步包括基于特征之间的相关性对特征进行聚类,并检测特征的最适当顺序以增强聚类效率。第三步包括基于各种加权方法(即中等,严格和混合)从每个群集中提取新特征。最后,第四步包括基于随机搜索运算符更新播放器。提出的GBMA策略可探索搜索空间并在可接受的时间内找到最佳解决方案,而无需检查所有可能的解决方案。基于UCI储存库中十个著名数据集的实验结果和统计测试证明了GBMA在选择用于解决回归问题的特征方面的高性能。第三步包括基于各种加权方法(即中等,严格和混合)从每个群集中提取新特征。最后,第四步包括基于随机搜索运算符更新播放器。提出的GBMA策略可探索搜索空间并在可接受的时间内找到最佳解决方案,而无需检查所有可能的解决方案。基于UCI储存库中十个著名数据集的实验结果和统计测试证明了GBMA在选择用于解决回归问题的特征方面的高性能。第三步包括基于各种加权方法(即中等,严格和混合)从每个群集中提取新特征。最后,第四步包括基于随机搜索运算符更新播放器。提出的GBMA策略可探索搜索空间并在可接受的时间内找到最佳解决方案,而无需检查所有可能的解决方案。基于UCI储存库中十个著名数据集的实验结果和统计测试证明了GBMA在选择用于解决回归问题的特征方面的高性能。提出的GBMA策略可探索搜索空间并在可接受的时间内找到最佳解决方案,而无需检查所有可能的解决方案。基于UCI储存库中十个著名数据集的实验结果和统计测试证明了GBMA在选择用于解决回归问题的特征方面的高性能。提出的GBMA策略可探索搜索空间并在可接受的时间内找到最佳解决方案,而无需检查所有可能的解决方案。基于UCI储存库中十个著名数据集的实验结果和统计测试证明了GBMA在选择用于解决回归问题的特征方面的高性能。

更新日期:2020-08-12
down
wechat
bug