Best variable identification by means of data-mining and cooperative game theory,Journal of Biomedical informatics

当前位置： X-MOL 学术 › J. Biomed. Inform. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Best variable identification by means of data-mining and cooperative game theory
Journal of Biomedical informatics ( IF 4.5 ) Pub Date : 2020-11-19 , DOI: 10.1016/j.jbi.2020.103625
Stefan Bruch ₁ , Lisa Ernst ₁ , Mareike Schulz ₁ , Leonie Zieglowski ₁ , René H Tolba ₁

Affiliation

Objective:

To develop and evaluate methods to assess single and grouped variables impact on measuring intervention severities and support a search for most expressive variables.

Methods:

Datasets of cohort studies are analyzed automatically based on algorithms. For this, a metric is developed to compare measured variables in different cohorts in a data-mining process. Variables are measured in all possible combinations to detect possible synergies of certain variable constellations and allow for a ranking of the combinations’ expressiveness. Such ranking serves as a basis for a wide range of algorithmic data analysis. In an exemplary application, every group member’s impact on the total result is determined based on the principle of the cooperative game theory besides to the total expressiveness of the variable groups.

Results:

For different types of interventions, the method is applied to experimental data containing multiple recorded medical lab values. The expressiveness of variable combinations to indicate severity is ranked by means of a metric. Within each combination, any variable’s contribution to the total effect is determined and accumulated over whole datasets to yield local and global variable importance measures. The computed results have been successfully matched with clinical expectations to prove their plausibility.

Conclusion:

Algorithmic evaluation shows to be a promising approach in automatized quantification of variable expressiveness. It can assess descriptive power of measurements, help to improve future study designs and expose worthwhile research issues.

中文翻译：

通过数据挖掘和合作博弈理论进行最佳变量识别

目的：

开发和评估评估单个变量和分组变量对测量干预严重程度的影响并支持搜索最具表现力的变量的方法。

方法：

队列研究的数据集会根据算法自动进行分析。为此，开发了一种度量标准以比较数据挖掘过程中不同队列中的测量变量。在所有可能的组合中测量变量，以检测某些变量星座的可能协同作用，并允许对组合的表现力进行排名。这样的排名是各种算法数据分析的基础。在示例性应用中，除了变量组的总表达能力之外，还基于合作博弈原理确定每个组成员对总结果的影响。