当前位置: X-MOL 学术Methods › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Three invariant Hi-C interaction patterns: applications to genome assembly
Methods ( IF 4.2 ) Pub Date : 2018-06-01 , DOI: 10.1016/j.ymeth.2018.04.013
Sivan Oddes , Aviv Zelig , Noam Kaplan

Assembly of reference-quality genomes from next-generation sequencing data is a key challenge in genomics. Recently, we and others have shown that Hi-C data can be used to address several outstanding challenges in the field of genome assembly. This principle has since been developed in academia and industry, and has been used in the assembly of several major genomes. In this paper, we explore the central principles underlying Hi-C-based assembly approaches, by quantitatively defining and characterizing three invariant Hi-C interaction patterns on which these approaches can build: Intrachromosomal interaction enrichment, distance-dependent interaction decay and local interaction smoothness. Specifically, we evaluate to what degree each invariant pattern holds on a single locus level in different species, cell types and Hi-C map resolutions. We find that these patterns are generally consistent across species and cell types but are affected by sequencing depth, and that matrix balancing improves consistency of loci with all three invariant patterns. Finally, we overview current Hi-C-based assembly approaches in light of these invariant patterns and demonstrate how local interaction smoothness can be used to easily detect scaffolding errors in extremely sparse Hi-C maps. We suggest that simultaneously considering all three invariant patterns may lead to better Hi-C-based genome assembly methods.

中文翻译:

三种不变的 Hi-C 相互作用模式:在基因组组装中的应用

从下一代测序数据组装参考质量的基因组是基因组学中的一个关键挑战。最近,我们和其他人表明 Hi-C 数据可用于解决基因组组装领域的几个突出挑战。此原则已在学术界和工业界发展起来,并已用于几个主要基因组的组装。在本文中,我们通过定量定义和表征这些方法可以建立的三种不变的 Hi-C 相互作用模式来探索基于 Hi-C 的组装方法的核心原则:染色体内相互作用富集、距离相关相互作用衰减和局部相互作用平滑. 具体来说,我们评估每个不变模式在不同物种、细胞类型和 Hi-C 图分辨率中的单个基因座水平上的程度。我们发现这些模式在物种和细胞类型之间通常是一致的,但受测序深度的影响,并且矩阵平衡提高了基因座与所有三种不变模式的一致性。最后,我们根据这些不变模式概述了当前基于 Hi-C 的组装方法,并展示了如何使用局部交互平滑性来轻松检测极其稀疏的 Hi-C 地图中的脚手架错误。我们建议同时考虑所有三种不变模式可能会导致更好的基于 Hi-C 的基因组组装方法。我们根据这些不变模式概述了当前基于 Hi-C 的组装方法,并展示了如何使用局部交互平滑性来轻松检测极其稀疏的 Hi-C 地图中的脚手架错误。我们建议同时考虑所有三种不变模式可能会导致更好的基于 Hi-C 的基因组组装方法。我们根据这些不变模式概述了当前基于 Hi-C 的组装方法,并展示了如何使用局部交互平滑性来轻松检测极其稀疏的 Hi-C 地图中的脚手架错误。我们建议同时考虑所有三种不变模式可能会导致更好的基于 Hi-C 的基因组组装方法。
更新日期:2018-06-01
down
wechat
bug