当前位置: X-MOL 学术IEEE Open J. Eng. Med. Biol. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Bayesian Inference-Based Gaussian Mixture Models With Optimal Components Estimation Towards Large-Scale Synthetic Data Generation for In Silico Clinical Trials
IEEE Open Journal of Engineering in Medicine and Biology Pub Date : 2022-06-10 , DOI: 10.1109/ojemb.2022.3181796
Vasileios C Pezoulas 1 , Nikolaos S Tachos 1 , George Gkois 1 , Iacopo Olivotto 2 , Fausto Barlocco 2 , Dimitrios I Fotiadis 1, 3
Affiliation  

Goal: To develop a computationally efficient and unbiased synthetic data generator for large-scale in silico clinical trials (CTs). Methods: We propose the BGMM-OCE, an extension of the conventional BGMM (Bayesian Gaussian Mixture Models) algorithm to provide unbiased estimations regarding the optimal number of Gaussian components and yield high-quality, large-scale synthetic data at reduced computational complexity. Spectral clustering with efficient eigenvalue decomposition is applied to estimate the hyperparameters of the generator. A case study is conducted to compare the performance of BGMM-OCE against four straightforward synthetic data generators for in silico CTs in hypertrophic cardiomyopathy (HCM). Results: The BGMM-OCE generated 30000 virtual patient profiles having the lowest coefficient-of-variation (0.046), inter- and intra-correlation differences (0.017, and 0.016, respectively) with the real ones in reduced execution time. Conclusions: BGMM-OCE overcomes the lack of population size in HCM which obscures the development of targeted therapies and robust risk stratification models.

中文翻译:

基于贝叶斯推理的高斯混合模型和最佳成分估计,用于计算机模拟临床试验的大规模合成数据生成

目标:为大规模计算机临床试验 (CT) 开发一种计算高效且无偏见的合成数据生成器。方法:我们提出了 BGMM-OCE,它是传统 BGMM(贝叶斯高斯混合模型)算法的扩展,以提供关于最佳高斯分量数量的无偏估计,并在降低计算复杂性的情况下产生高质量、大规模的合成数据。应用具有有效特征值分解的谱聚类来估计生成器的超参数。进行了一项案例研究,以比较 BGMM-OCE 与用于肥厚型心肌病 (HCM) 计算机 CT的四种直接合成数据生成器的性能。结果:BGMM-OCE 生成了 30000 个虚拟患者档案,这些档案具有最低的变异系数 (0.046)、内部和内部相关差异(分别为 0.017 和 0.016),并且执行时间更短。结论: BGMM-OCE 克服了 HCM 人群规模不足的问题,后者阻碍了靶向治疗和稳健的风险分层模型的发展。
更新日期:2022-06-10
down
wechat
bug