当前位置: X-MOL 学术Int. J. Genom. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Comprehensive Stress-Based De Novo Transcriptome Assembly and Annotation of Guar (Cyamopsis tetragonoloba (L.) Taub.): An Important Industrial and Forage Crop.
International Journal of Genomics ( IF 2.6 ) Pub Date : 2019-10-08 , DOI: 10.1155/2019/7295859
Fahad Al-Qurainy 1 , Aref Alshameri 1 , Abdel-Rhman Gaafar 1 , Salim Khan 1 , Mohammad Nadeem 1 , Abdulhafed Abdullah Alameri 1 , Mohamed Tarroum 1 , Muhammad Ashraf 1
Affiliation  

The forage crop Guar (Cyamopsis tetragonoloba (L.) Taub.) has the ability to endure heat, drought, and mild salinity. A complete image on its genic architecture will promote our understanding about gene expression networks and different tolerance mechanisms at the molecular level. Therefore, whole mRNA sequence approach on the Guar plant was conducted to provide a snapshot of the mRNA information in the cell under salinity, heat, and drought stresses to be integrated with previous transcriptomic studies. RNA-Seq technology was employed to perform a paired-end sequencing using an Illumina HiSeq 2500 platform for the transcriptome of leaves of C. tetragonoloba under normal, heat, drought, and salinity conditions. Trinity was used to achieve a de novo assembly followed by gene annotation, functional classification, metabolic pathway analysis, and identification of SSR markers. A total of 218.2 million paired-end raw reads (~44 Gbp) were generated. Of those, 193.5M paired-end reads of high quality were used to reconstruct a total of 161,058 transcripts (~266 Mbp) with N50 of 2552 bp and 61,508 putative genes. There were 6463 proteins having >90% full-length coverage against the Swiss-Prot database and 94% complete orthologs against Embryophyta. Approximately, 62.87% of transcripts were blasted, 50.46% mapped, and 43.50% annotated. A total of 4715 InterProScan families, 3441 domains, 74 repeats, and 490 sites were detected. Biological processes, molecular functions, and cellular components comprised 64.12%, 25.42%, and 10.4%, respectively. The transcriptome was associated with 985 enzymes and 156 KEGG pathways. A total of 27,066 SSRs were gained with an average frequency of one SSR/9.825 kb in the assembled transcripts. This resulting data will be helpful for the advanced analysis of Guar to multi-stress tolerance.

中文翻译:

瓜尔瓜(基于Cyamopsis tetragonoloba(L.)Taub。)的基于压力的综合从头转录组组装和注释:重要的工业和饲料作物。

饲料作物瓜尔瓜(Cyamopsis tetragonoloba(L.)Taub。)具有耐高温,干旱和轻度盐碱的能力。关于其基因结构的完整图像将促进我们对基因表达网络和分子水平上不同耐受机制的理解。因此,对瓜尔豆植物进行了完整的mRNA序列分析,以提供盐,热和干旱胁迫下细胞中mRNA信息的快照,以与以前的转录组研究相结合。RNA-Seq技术用于执行使用Illumina HiSeq 2500平台进行双末端测序,在正常,高温,干旱和盐度条件下,对四角线虫叶片的转录组进行测序。三位一体被用来实现从头开始装配,然后进行基因注释,功能分类,代谢途径分析和SSR标记鉴定。总共产生了2.182亿个配对末端原始读取(〜44 Gbp)。其中,高质量的193.5M配对末端读段被用于重建总共161,058个转录物(〜266 Mbp),N50为2552 bp,推定基因为61,508。有6463个蛋白质对Swiss-Prot数据库的全长覆盖率超过90%,对胚芽的完整直系同源物达到94%。大约有62.87%的转录本被原始表达,50.46%的作图和43.50%的注释。总共检测到4715个InterProScan家族,3441个域,74个重复序列和490个位点。生物过程,分子功能和细胞成分分别占64.12%,25.42%和10.4%。转录组与985种酶和156条KEGG途径相关。总计27,066个SSR,在组装的转录本中平均获得1个SSR / 9.825 kb的频率。这些结果数据将有助于瓜尔胶对多应力耐受性的高级分析。
更新日期:2019-10-08
down
wechat
bug