当前位置: X-MOL 学术J. Comput. Graph. Stat. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
MIP-BOOST: Efficient and Effective L0 Feature Selection for Linear Regression
Journal of Computational and Graphical Statistics ( IF 1.4 ) Pub Date : 2021-01-04 , DOI: 10.1080/10618600.2020.1845184
Ana Kenney 1 , Francesca Chiaromonte 2 , Giovanni Felici 3
Affiliation  

Abstract

Recent advances in mathematical programming have made mixed integer optimization a competitive alternative to popular regularization methods for selecting features in regression problems. The approach exhibits unquestionable foundational appeal and versatility, but also poses important challenges. Here, we propose MIP-BOOST, a revision of standard mixed integer programming feature selection that reduces the computational burden of tuning the critical sparsity bound parameter and improves performance in the presence of feature collinearity and of signals that vary in nature and strength. The final outcome is a more efficient and effective L0 feature selection method for applications of realistic size and complexity, grounded on rigorous cross-validation tuning and exact optimization of the associated mixed integer program. Computational viability and improved performance in realistic scenarios is achieved through three independent but synergistic proposals. Supplementary materials including additional results, pseudocode, and computer code are available online.



中文翻译:

MIP-BOOST:用于线性回归的高效且有效的 L0 特征选择

摘要

数学规划的最新进展使混合整数优化成为流行的正则化方法的竞争替代方案,用于在回归问题中选择特征。该方法展示了无可置疑的基础吸引力和多功能性,但也带来了重大挑战。在这里,我们提出了 MIP-BOOST,这是对标准混合整数规划特征选择的修订版,它减少了调整临界稀疏边界参数的计算负担,并在存在特征共线性以及性质和强度不同的信号的情况下提高了性能。最终结果是更高效和有效的L 0基于严格的交叉验证调整和相关混合整数程序的精确优化,用于实际大小和复杂性应用程序的特征选择方法。通过三个独立但协同的提案实现了现实场景中的计算可行性和改进的性能。在线提供包括附加结果、伪代码和计算机代码在内的补充材料。

更新日期:2021-01-04
down
wechat
bug