当前位置:
X-MOL 学术
›
Nat. Methods
›
论文详情
Our official English website, www.x-mol.net, welcomes your
feedback! (Note: you will need to create a separate account there.)
A synthetic-diploid benchmark for accurate variant-calling evaluation
Nature Methods ( IF 36.1 ) Pub Date : 2018-07-16 , DOI: 10.1038/s41592-018-0054-7 Heng Li 1 , Jonathan M Bloom 1 , Yossi Farjoun 1 , Mark Fleharty 1 , Laura Gauthier 1 , Benjamin Neale 1, 2 , Daniel MacArthur 1, 2
中文翻译:
用于准确变异识别评估的合成二倍体基准
更新日期:2018-07-18
Nature Methods ( IF 36.1 ) Pub Date : 2018-07-16 , DOI: 10.1038/s41592-018-0054-7 Heng Li 1 , Jonathan M Bloom 1 , Yossi Farjoun 1 , Mark Fleharty 1 , Laura Gauthier 1 , Benjamin Neale 1, 2 , Daniel MacArthur 1, 2
Affiliation
Existing benchmark datasets for use in evaluating variant-calling accuracy are constructed from a consensus of known short-variant callers, and they are thus biased toward easy regions that are accessible by these algorithms. We derived a new benchmark dataset from the de novo PacBio assemblies of two fully homozygous human cell lines, which provides a relatively more accurate and less biased estimate of small-variant-calling error rates in a realistic context.
中文翻译:
用于准确变异识别评估的合成二倍体基准
用于评估变体调用准确性的现有基准数据集是根据已知短变体调用者的共识构建的,因此它们偏向于这些算法可访问的简单区域。我们从两个完全纯合的人类细胞系的 de novo PacBio 组装中得出了一个新的基准数据集,它为现实环境中的小变异调用错误率提供了相对更准确且偏差较小的估计。