当前位置: X-MOL 学术Genome Biol. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Comprehensive assessment of computational algorithms in predicting cancer driver mutations
Genome Biology ( IF 12.3 ) Pub Date : 2020-02-20 , DOI: 10.1186/s13059-020-01954-z
Hu Chen 1, 2 , Jun Li 2 , Yumeng Wang 2 , Patrick Kwok-Shing Ng 3 , Yiu Huen Tsang 4 , Kenna R Shaw 3 , Gordon B Mills 4 , Han Liang 2, 5
Affiliation  

Background The initiation and subsequent evolution of cancer are largely driven by a relatively small number of somatic mutations with critical functional impacts, so-called driver mutations. Identifying driver mutations in a patient’s tumor cells is a central task in the era of precision cancer medicine. Over the decade, many computational algorithms have been developed to predict the effects of missense single-nucleotide variants, and they are frequently employed to prioritize mutation candidates. These algorithms employ diverse molecular features to build predictive models, and while some algorithms are cancer-specific, others are not. However, the relative performance of these algorithms has not been rigorously assessed. Results We construct five complementary benchmark datasets: mutation clustering patterns in the protein 3D structures, literature annotation based on OncoKB, TP53 mutations based on their effects on target-gene transactivation, effects of cancer mutations on tumor formation in xenograft experiments, and functional annotation based on in vitro cell viability assays we developed including a new dataset of ~ 200 mutations. We evaluate the performance of 33 algorithms and found that CHASM, CTAT-cancer, DEOGEN2, and PrimateAI show consistently better performance than the other algorithms. Moreover, cancer-specific algorithms show much better performance than those designed for a general purpose. Conclusions Our study is a comprehensive assessment of the performance of different algorithms in predicting cancer driver mutations and provides deep insights into the best practice of computationally prioritizing cancer mutation candidates for end-users and for the future development of new algorithms.

中文翻译:

预测癌症驱动突变的计算算法的综合评估

背景癌症的发生和随后的演变很大程度上是由相对少量的具有关键功能影响的体细胞突变驱动的,即所谓的驱动突变。在精准癌症医学时代,识别患者肿瘤细胞中的驱动突变是一项核心任务。十年来,已经开发了许多计算算法来预测错义单核苷酸变体的影响,并且它们经常被用于优先考虑突变候选者。这些算法采用不同的分子特征来构建预测模型,虽然有些算法是针对癌症的,但有些则不是。然而,这些算法的相对性能尚未得到严格评估。结果 我们构建了五个互补的基准数据集:蛋白质 3D 结构中的突变聚类模式,基于 OncoKB 的文献注释、基于对靶基因反式激活的影响的 TP53 突变、异种移植实验中癌症突变对肿瘤形成的影响,以及基于我们开发的体外细胞活力测定的功能注释,包括约 200 个突变的新数据集。我们评估了 33 种算法的性能,发现 CHASM、CTAT-cancer、DEOGEN2 和 PrimateAI 始终比其他算法表现出更好的性能。此外,特定于癌症的算法表现出比为通用目的设计的算法要好得多的性能。
更新日期:2020-02-20
down
wechat
bug