当前位置: X-MOL 学术Genome Biol. Evol. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
A Chromosome-Level Genome Assembly of Dendrobium Huoshanense Using Long Reads and Hi-C Data
Genome Biology and Evolution ( IF 3.2 ) Pub Date : 2020-10-12 , DOI: 10.1093/gbe/evaa215
Bangxing Han 1 , Yi Jing 2 , Jun Dai 1 , Tao Zheng 2, 3 , Fangli Gu 1 , Qun Zhao 1 , Fucheng Zhu 1 , Xiangwen Song 1 , Hui Deng 1 , Peipei Wei 1 , Cheng Song 1 , Dong Liu 1 , Xueping Jiang 1 , Fang Wang 1 , Yanjun Chen 1 , Chuanbo Sun 1 , Houjun Yao 1 , Li Zhang 1 , Naidong Chen 1 , Shaotong Chen 1 , Xiaoli Li 1 , Yuan Wei 4 , Zhen Ouyang 4 , Hui Yan 5 , Jiangjie Lu 6 , Huizhong Wang 6 , Lanping Guo 7 , Lingdong Kong 8 , Jing Zhao 9 , Shaoping Li 9 , Lifen Luo 10 , Karsten Kristiansen 3 , Zhan Feng 2 , Silong Sun 2 , Cunwu Chen 1 , Zhen Yue 2 , Naifu Chen 1
Affiliation  

Dendrobium huoshanense is used to treat various diseases in traditional Chinese medicine. Recent studies have identified active components. However, the lack of genomic data limits research on the biosynthesis and application of these therapeutic ingredients. To address this issue, we generated the first chromosome-level genome assembly and annotation of D. huoshanense. We integrated PacBio sequencing data, Illumina paired-end sequencing data, and Hi-C sequencing data to assemble a 1.285 Gb genome, with contig and scaffold N50 lengths of 598 kb and 71.79 Mb, respectively. We annotated 21,070 protein-coding genes and 0.96 Gb transposable elements, constituting 74.92% of the whole assembly. In addition, we identified 252 genes responsible for polysaccharide biosynthesis by Kyoto Encyclopedia of Genes and Genomes functional annotation. Our data provide a basis for further functional studies, particularly those focused on genes related to glycan biosynthesis and metabolism, and have implications for both conservation and medicine.

中文翻译:

使用长读取和 Hi-C 数据的霍山石斛的染色体水平基因组组装

火山石斛在中医中用于治疗各种疾病。最近的研究已经确定了活性成分。然而,基因组数据的缺乏限制了对这些治疗成分的生物合成和应用的研究。为了解决这个问题,我们生成了霍山果蝇的第一个染色体水平基因组组装和注释. 我们整合了 PacBio 测序数据、Illumina 双端测序数据和 Hi-C 测序数据,以组装 1.285 Gb 基因组,重叠群和支架 N50 长度分别为 598 kb 和 71.79 Mb。我们注释了 21,070 个蛋白质编码基因和 0.96 Gb 转座元件,占整个程序集的 74.92%。此外,我们通过京都基因和基因组百科全书功能注释确定了负责多糖生物合成的 252 个基因。我们的数据为进一步的功能研究提供了基础,特别是那些关注与聚糖生物合成和代谢相关的基因的研究,并且对保护和医学都有影响。
更新日期:2020-12-06
down
wechat
bug