当前位置: X-MOL 学术Hortic. Res. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Reannotation of the cultivated strawberry genome and establishment of a strawberry genome database
Horticulture Research ( IF 7.6 ) Pub Date : 2021-03-01 , DOI: 10.1038/s41438-021-00476-4
Tianjia Liu 1 , Muzi Li 2 , Zhongchi Liu 2 , Xiaoyan Ai 1 , Yongping Li 3
Affiliation  

Cultivated strawberry (Fragaria × ananassa) is an important fruit crop species whose fruits are enjoyed by many worldwide. An octoploid of hybrid origin, the complex genome of this species was recently sequenced, serving as a key reference genome for cultivated strawberry and related species of the Rosaceae family. The current annotation of the F. ananassa genome mainly relies on ab initio predictions and, to a lesser extent, transcriptome data. Here, we present the structure and functional reannotation of the F. ananassa genome based on one PacBio full-length RNA library and ninety-two Illumina RNA-Seq libraries. This improved annotation of the F. ananassa genome, v1.0.a2, comprises a total of 108,447 gene models, with 97.85% complete BUSCOs. The models of 19,174 genes were modified, 360 new genes were identified, and 11,044 genes were found to have alternatively spliced isoforms. Additionally, we constructed a strawberry genome database (SGD) for strawberry gene homolog searching and annotation downloading. Finally, the transcriptome of the receptacles and achenes of F. ananassa at four developmental stages were reanalyzed and qualified, and the expression profiles of all the genes in this annotation are also provided. Together, this study provides an updated annotation of the F. ananassa genome, which will facilitate genomic analyses across the Rosaceae family and gene functional studies in cultivated strawberry.

中文翻译:

栽培草莓基因组重新注释及草莓基因组数据库的建立

栽培草莓(草莓属×阿纳纳萨)是一种重要的水果作物品种,其果实为全世界许多人所享用。该物种是一种杂交起源的八倍体,其复杂基因组最近已被测序,可作为栽培草莓和相关物种的关键参考基因组。蔷薇科家庭。当前的注释F. 阿纳萨基因组主要依赖于从头开始的预测,并在较小程度上依赖于转录组数据。在这里,我们提出了结构和功能重新注释F. 阿纳萨基于 1 个 PacBio 全长 RNA 文库和 92 个 Illumina RNA-Seq 文库的基因组。这改进了注释F. 阿纳萨基因组v1.0.a2总共包含108,447个基因模型,其中97.85%完整的BUSCO。修改了 19,174 个基因的模型,鉴定了 360 个新基因,并发现 11,044 个基因具有可变剪接异构体。此外,我们还构建了草莓基因组数据库(SGD),用于草莓基因同源物搜索和注释下载。最后,花托和瘦果的转录组F. 阿纳萨对四个发育阶段的基因进行了重新分析和鉴定,并提供了该注释中所有基因的表达谱。总之,这项研究提供了更新的注释F. 阿纳萨基因组,这将促进跨领域的基因组分析蔷薇科栽培草莓的家族和基因功能研究。
更新日期:2021-03-01
down
wechat
bug