当前位置: X-MOL 学术IEEE/ACM Trans. Comput. Biol. Bioinform. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Enhancing Cancer Driver Gene Prediction by Protein-Protein Interaction Network
IEEE/ACM Transactions on Computational Biology and Bioinformatics ( IF 4.5 ) Pub Date : 2021-03-03 , DOI: 10.1109/tcbb.2021.3063532
Chuang Liu 1 , Yao Dai 1 , Keping Yu 2 , Zi-Ke Zhang 1
Affiliation  

With the advances in gene sequencing technologies, millions of somatic mutations have been reported in the past decades, but mining cancer driver genes with oncogenic mutations from these data remains a critical and challenging area of research. In this study, we proposed a network-based classification method for identifying cancer driver genes with merging the multi-biological information. In this method, we construct a cancer specific genetic network from the human protein-protein interactome (PPI) to mine the network structure attributes, and combine biological information such as mutation frequency and differential expression of genes to achieve accurate prediction of cancer driver genes. Across seven different cancer types, the proposed algorithm always achieves high prediction accuracy, which is superior to the existing advanced methods. In the analysis of the predicted results, about 40 percent of the top 10 candidate genes overlap with the Cancer Gene Census database. Interestingly, the feature comparison indicates that the network based features are still more important than the biological features, including the mutation frequency and genetic differential expression. Further analyses also show that the integration of network structure attributes and biological information is valuable for predicting new cancer driver genes.

中文翻译:

通过蛋白质-蛋白质相互作用网络增强癌症驱动基因预测

随着基因测序技术的进步,在过去几十年中已经报道了数百万种体细胞突变,但从这些数据中挖掘具有致癌突变的癌症驱动基因仍然是一个关键且具有挑战性的研究领域。在这项研究中,我们提出了一种基于网络的分类方法,用于通过合并多生物信息来识别癌症驱动基因。在该方法中,我们从人类蛋白质-蛋白质相互作用组(PPI)构建癌症特异性遗传网络,挖掘网络结构属性,结合基因突变频率和差异表达等生物学信息,实现对癌症驱动基因的准确预测。在七种不同的癌症类型中,所提出的算法始终达到较高的预测精度,优于现有的先进方法。在对预测结果的分析中,前 10 个候选基因中约有 40% 与癌症基因普查数据库重叠。有趣的是,特征比较表明基于网络的特征仍然比生物学特征更重要,包括突变频率和遗传差异表达。进一步的分析还表明,网络结构属性和生物信息的整合对于预测新的癌症驱动基因很有价值。
更新日期:2021-03-03
down
wechat
bug