Use and performance of machine learning models for type 2 diabetes prediction in community settings: A systematic review and meta-analysis.,International Journal of Medical Informatics

当前位置： X-MOL 学术 › Int. J. Med. Inform. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Use and performance of machine learning models for type 2 diabetes prediction in community settings: A systematic review and meta-analysis.
International Journal of Medical Informatics ( IF 4.9 ) Pub Date : 2020-09-07 , DOI: 10.1016/j.ijmedinf.2020.104268
Kushan De Silva ₁ , Wai Kit Lee ₁ , Andrew Forbes ₂ , Ryan T Demmer ₃ , Christopher Barton ₄ , Joanne Enticott ₁

Affiliation

Objective

We aimed to identify machine learning (ML) models for type 2 diabetes (T2DM) prediction in community settings and determine their predictive performance.

Method

Systematic review of ML predictive modelling studies in 13 databases since 2009 was conducted. Primary outcomes included metrics of discrimination, calibration, and classification. Secondary outcomes included important variables, level of validation, and intended use of models. Meta-analysis of c-indices, subgroup analyses, meta-regression, publication bias assessments and sensitivity analyses were conducted.

Results

Twenty-three studies (40 prediction models) were included. Studies with high-, moderate-, and low- risk of bias were 3, 14, and 6 respectively. All studies conducted internal validation whereas none conducted external validation of their models. Twenty studies provided classification metrics to varying extents whereas only 7 studies performed model calibration. Eighteen studies reported information on both the variables used for model development and the feature importance. Twelve studies highlighted potential applicability of their models for T2DM screening. Meta-analysis produced a good pooled c-index (0.812). Sources of heterogeneity were identified through subgroup analyses and meta-regression. Issues pertaining to methodological quality and reporting were observed.

Conclusions

We found evidence of good performance of ML models for T2DM prediction in the community. Improvements to methodology, reporting and validation are needed before they can be used at scale.

中文翻译：

机器学习模型在社区环境中对2型糖尿病的预测和使用：系统的回顾和荟萃分析。

目的

我们旨在确定社区环境中2型糖尿病（T2DM）预测的机器学习（ML）模型，并确定其预测性能。

方法

自2009年以来在13个数据库中进行了ML预测建模研究的系统评价。主要结果包括判别，校准和分类指标。次要结果包括重要变量，验证级别和模型的预期用途。进行了c指标的荟萃分析，亚组分析，荟萃回归，出版物偏倚评估和敏感性分析。

结果

包括23个研究（40个预测模型）。高，中度和低度偏倚风险的研究分别为3、14和6。所有研究均进行内部验证，而没有研究对其模型进行外部验证。二十项研究提供了不同程度的分类指标，而只有七项研究进行了模型校准。十八项研究报告了有关用于模型开发的变量和特征重要性的信息。十二项研究强调了其模型在T2DM筛查中的潜在适用性。荟萃分析得出良好的合并c指数（0.812）。通过亚组分析和荟萃回归确定了异质性的来源。观察到与方法学质量和报告有关的问题。