Robust model selection in linear regression models using information complexity,Journal of Computational and Applied Mathematics

当前位置： X-MOL 学术 › J. Comput. Appl. Math. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Robust model selection in linear regression models using information complexity
Journal of Computational and Applied Mathematics ( IF 2.1 ) Pub Date : 2021-05-28 , DOI: 10.1016/j.cam.2021.113679
Yeşim Güney , Hamparsum Bozdogan , Olcay Arslan

In recent years, in the literature of linear regression models, robust model selection methods have received increasing attention when the datasets contain even a small fraction of outliers. Outliers can have a serious impact on statistical inference and the choice of models using model selection criteria. Most of the existing robust information-based model selection methods are confined to robust AIC and robust AIC-type criteria. However, the penalty function used in AIC-type criteria happens to be insufficient to measure the overall model complexity in the presence of outliers and when the model is misspecified. Furthermore, in the literature, there is not much attention paid to the overall model complexity despite its importance. To overcome these problems, this paper proposes a robust version of the information-theoretic measure of complexity (ICOMP) criterion due to Bozdogan based on robust alternatives to the maximum likelihood (ML) method. Unlike the AIC criterion, ICOMP penalizes not only the number of free parameters but also the profusion of the model complexity to reduce the effect of outliers. The proposed robust ICOMP criteria are based on robust M, S, and MM estimation methods. Numerical examples are shown using a large-scale Monte Carlo simulation to study the performance of the proposed criteria. Also, a real numerical example is provided to detect outliers and to choose the best subset of predictors. Our results show the flexibility and the versatility of the proposed new approach.

中文翻译：

使用信息复杂度的线性回归模型中的稳健模型选择

近年来，在线性回归模型的文献中，当数据集包含甚至一小部分异常值时，稳健的模型选择方法受到越来越多的关注。异常值会对统计推断和使用模型选择标准的模型选择产生严重影响。大多数现有的基于信息的稳健模型选择方法仅限于稳健 AIC 和稳健 AIC 类型标准。然而，在存在异常值和错误指定模型的情况下，AIC 类型标准中使用的惩罚函数恰好不足以衡量整体模型的复杂性。此外，在文献中，尽管整体模型的复杂性很重要，但并没有太多关注。为了克服这些问题，本文基于最大似然 (ML) 方法的稳健替代方案，提出了基于 Bozdogan 的信息理论复杂性度量 (ICOMP) 标准的稳健版本。与 AIC 标准不同，ICOMP 不仅惩罚自由参数的数量，还惩罚模型复杂性的丰富程度，以减少异常值的影响。建议的稳健 ICOMP 标准基于稳健的 M、S 和 MM 估计方法。使用大规模蒙特卡罗模拟显示了数值示例，以研究所提出标准的性能。此外，还提供了一个真实的数值示例来检测异常值并选择预测变量的最佳子集。我们的结果显示了所提出的新方法的灵活性和多功能性。与 AIC 标准不同，ICOMP 不仅惩罚自由参数的数量，还惩罚模型复杂性的丰富程度，以减少异常值的影响。建议的稳健 ICOMP 标准基于稳健的 M、S 和 MM 估计方法。使用大规模蒙特卡罗模拟显示了数值示例，以研究所提出标准的性能。此外，还提供了一个真实的数值示例来检测异常值并选择预测变量的最佳子集。我们的结果显示了所提出的新方法的灵活性和多功能性。与 AIC 标准不同，ICOMP 不仅惩罚自由参数的数量，还惩罚模型复杂性的丰富程度，以减少异常值的影响。建议的稳健 ICOMP 标准基于稳健的 M、S 和 MM 估计方法。使用大规模蒙特卡罗模拟显示了数值示例，以研究所提出标准的性能。此外，还提供了一个真实的数值示例来检测异常值并选择预测变量的最佳子集。我们的结果显示了所提出的新方法的灵活性和多功能性。使用大规模蒙特卡罗模拟显示了数值示例，以研究所提出标准的性能。此外，还提供了一个真实的数值示例来检测异常值并选择预测变量的最佳子集。我们的结果显示了所提出的新方法的灵活性和多功能性。使用大规模蒙特卡罗模拟显示了数值示例，以研究所提出标准的性能。此外，还提供了一个真实的数值示例来检测异常值并选择预测变量的最佳子集。我们的结果显示了所提出的新方法的灵活性和多功能性。

更新日期：2021-06-13

点击分享查看原文

点击收藏

阅读更多本刊最新论文本刊介绍/投稿指南11