当前位置: X-MOL 学术IEEE/ACM Trans. Comput. Biol. Bioinform. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
DIVERSE: Bayesian Data IntegratiVE Learning for Precise Drug ResponSE Prediction
IEEE/ACM Transactions on Computational Biology and Bioinformatics ( IF 3.6 ) Pub Date : 2021-03-11 , DOI: 10.1109/tcbb.2021.3065535
Betul Guvenc Paltun 1 , Samuel Kaski 1 , Hiroshi Mamitsuka 2
Affiliation  

Detecting predictive biomarkers from multi-omics data is important for precision medicine, to improve diagnostics of complex diseases and for better treatments. This needs substantial experimental efforts that are made difficult by the heterogeneity of cell lines and huge cost. An effective solution is to build a computational model over the diverse omics data, including genomic, molecular, and environmental information. However, choosing informative and reliable data sources from among the different types of data is a challenging problem. We propose DIVERSE, a framework of Bayesian importance-weighted tri- and bi-matrix factorization(DIVERSE3 or DIVERSE2) to predict drug responses from data of cell lines, drugs, and gene interactions. DIVERSE integrates the data sources systematically, in a step-wise manner, examining the importance of each added data set in turn. More specifically, we sequentially integrate five different data sets, which have not all been combined in earlier bioinformatic methods for predicting drug responses. Empirical experiments show that DIVERSE clearly outperformed five other methods including three state-of-the-art approaches, under cross-validation, particularly in out-of-matrix prediction, which is closer to the setting of real use cases and more challenging than simpler in-matrix prediction. Additionally, case studies for discovering new drugs further confirmed the performance advantage of DIVERSE.

中文翻译:

DIVERSE:用于精确药物反应预测的贝叶斯数据集成学习

从多组学数据中检测预测性生物标志物对于精准医学、改进复杂疾病的诊断和更好的治疗非常重要。这需要大量的实验工作,但由于细胞系的异质性和巨大的成本而变得困难。一个有效的解决方案是在不同的组学数据上建立一个计算模型,包括基因组、分子和环境信息。然而,从不同类型的数据中选择信息丰富且可靠的数据源是一个具有挑战性的问题。我们提出了 DIVERSE,这是一个贝叶斯重要性加权三矩阵和双矩阵分解(DIVERSE3 或 DIVERSE2)的框架,用于从细胞系、药物和基因相互作用的数据中预测药物反应。DIVERSE 以逐步的方式系统地集成数据源,依次检查每个添加的数据集的重要性。更具体地说,我们依次整合了五个不同的数据集,这些数据集在早期的生物信息学方法中并未全部结合用于预测药物反应。实证实验表明,在交叉验证下,DIVERSE 明显优于其他五种方法,包括三种最先进的方法,特别是在矩阵外预测方面,它更接近真实用例的设置,并且比简单的方法更具挑战性矩阵内预测。此外,发现新药的案例研究进一步证实了 DIVERSE 的性能优势。实证实验表明,在交叉验证下,DIVERSE 明显优于其他五种方法,包括三种最先进的方法,特别是在矩阵外预测方面,它更接近真实用例的设置,并且比简单的方法更具挑战性矩阵内预测。此外,发现新药的案例研究进一步证实了 DIVERSE 的性能优势。实证实验表明,在交叉验证下,DIVERSE 明显优于其他五种方法,包括三种最先进的方法,特别是在矩阵外预测方面,它更接近真实用例的设置,并且比简单的方法更具挑战性矩阵内预测。此外,发现新药的案例研究进一步证实了 DIVERSE 的性能优势。
更新日期:2021-03-11
down
wechat
bug