Biomedical Data and Deep Learning Computational Models for Predicting Compound-Protein Relations,IEEE/ACM Transactions on Computational Biology and Bioinformatics

当前位置： X-MOL 学术 › IEEE/ACM Trans. Comput. Biol. Bioinform. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Biomedical Data and Deep Learning Computational Models for Predicting Compound-Protein Relations
IEEE/ACM Transactions on Computational Biology and Bioinformatics ( IF 4.5 ) Pub Date : 2021-03-26 , DOI: 10.1109/tcbb.2021.3069040
Qichang Zhao ₁ , Mengyun Yang ₁ , Zhongjian Cheng ₁ , Yaohang Li ₂ , Jianxin Wang ₁

Affiliation

The identification of compound-protein relations (CPRs), which includes compound-protein interactions (CPIs) and compound-protein affinities (CPAs), is critical to drug development. A common method for compound-protein relation identification is the use of in vitro screening experiments. However, the number of compounds and proteins is massive, and in vitro screening experiments are labor-intensive, expensive, and time-consuming with high failure rates. Researchers have developed a computational field called virtual screening (VS) to aid experimental drug development. These methods utilize experimentally validated biological interaction information to generate datasets and use the physicochemical and structural properties of compounds and target proteins as input information to train computational prediction models. At present, deep learning has been widely used in computer vision and natural language processing and has experienced epoch-making progress. At the same time, deep learning has also been used in the field of biomedicine widely, and the prediction of CPRs based on deep learning has developed rapidly and has achieved good results. The purpose of this study is to investigate and discuss the latest applications of deep learning techniques in CPR prediction. First, we describe the datasets and feature engineering (i.e., compound and protein representations and descriptors) commonly used in CPR prediction methods. Then, we review and classify recent deep learning approaches in CPR prediction. Next, a comprehensive comparison is performed to demonstrate the prediction performance of representative methods on classical datasets. Finally, we discuss the current state of the field, including the existing challenges and our proposed future directions. We believe that this investigation will provide sufficient references and insight for researchers to understand and develop new deep learning methods to enhance CPR predictions.

中文翻译：

用于预测化合物-蛋白质关系的生物医学数据和深度学习计算模型

化合物-蛋白质关系 (CPR) 的鉴定，包括化合物-蛋白质相互作用 (CPI) 和化合物-蛋白质亲和力 (CPA)，对药物开发至关重要。化合物-蛋白质关系鉴定的常用方法是使用体外筛选实验。然而，化合物和蛋白质的数量是巨大的，而且在体外筛选实验是劳动密集型、昂贵且耗时且失败率高的。研究人员开发了一种称为虚拟筛选 (VS) 的计算领域，以帮助开发实验药物。这些方法利用经过实验验证的生物相互作用信息来生成数据集，并使用化合物和靶蛋白的物理化学和结构特性作为输入信息来训练计算预测模型。目前，深度学习已广泛应用于计算机视觉和自然语言处理，并取得了划时代的进步。同时，深度学习在生物医学领域也得到了广泛应用，基于深度学习的心肺复苏预测发展迅速并取得了良好的效果。本研究的目的是调查和讨论深度学习技术在 CPR 预测中的最新应用。首先，我们描述了 CPR 预测方法中常用的数据集和特征工程（即化合物和蛋白质表示和描述符）。然后，我们回顾和分类最近在 CPR 预测中的深度学习方法。接下来，进行综合比较以证明代表性方法在经典数据集上的预测性能。最后，我们讨论了该领域的现状，包括现有的挑战和我们提出的未来方向。我们相信，这项调查将为研究人员理解和开发新的深度学习方法以增强 CPR 预测提供足够的参考和见解。

更新日期：2021-03-26

点击分享查看原文

点击收藏

阅读更多本刊最新论文

全部期刊列表>>