当前位置: X-MOL 学术bioRxiv. Genom. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Long-read transcriptome and other genomic resources for the angiosperm Silene noctiflora
bioRxiv - Genomics Pub Date : 2020-08-10 , DOI: 10.1101/2020.08.09.243378
Alissa M. Williams , Michael W. Itgen , Amanda K. Broz , Olivia G. Carter , Daniel B. Sloan

The angiosperm genus Silene is a model system for several traits of ecological and evolutionary significance in plants, including breeding system and sex chromosome evolution, host-pathogen interactions, invasive species biology, heavy metal tolerance, and cytonuclear interactions. Despite its importance, genomic resources for this large genus of approximately 850 species are scarce, with only one published whole-genome sequence (from the dioecious species S. latifolia). Here, we provide genomic and transcriptomic resources for a hermaphroditic representative of this genus (S. noctiflora), including a PacBio Iso-Seq transcriptome, which uses long-read, single-molecule sequencing technology to analyze full-length mRNA transcripts and identify paralogous genes and alternatively spliced genes. Using these data, we have assembled and annotated high-quality full-length cDNA sequences for approximately 17,000 S. noctiflora genes and 27,000 isoforms. We demonstrated the utility of these data to distinguish between recent and highly similar gene duplicates by identifying novel paralogous genes in an essential protease complex. Further, we provide a draft assembly for the approximately 2.7-Gb genome of this species, which is near the upper range of genome-size values reported for diploids in this genus and three-fold larger than the 0.9-Gb genome of S. conica, another species in the same subgenus. Karyotyping confirmed that S. noctiflora is a diploid, indicating that its large genome size is not due to polyploidization. These resources should facilitate further study and development of this genus as a model in plant ecology and evolution.

中文翻译:

被子植物Silene noctiflora的长转录组和其他基因组资源

被子植物Silene是一个模型系统,具有植物在生态和进化上的重要意义,包括育种系统和性染色体进化,宿主-病原体相互作用,入侵物种生物学,重金属耐受性和细胞核相互作用。尽管它很重要,但大约有850种这个大属的基因组资源却很少,只有一个公开的全基因组序列(来自雌雄异株S. latifolia)。在这里,我们为该属的一种雌雄同体代表提供基因组学和转录组学资源,其中包括一个PacBio Iso-Seq转录组,该组使用长期的单分子测序技术来分析全长mRNA转录本并鉴定旁系同源基因。基因和剪接的基因。使用这些数据,我们已经组装并注释了大约17,000个夜蛾链球菌基因和27,000个同工型的高质量全长cDNA序列。我们通过鉴定必需蛋白酶复合物中的新型旁系同源基因,证明了这些数据可用于区分近期和高度相似的基因重复。此外,我们提供了该物种约2.7 Gb基因组的装配图,该基因组接近该属二倍体报道的基因组大小值的上限,并且比锥虫的0.9 Gb基因组大三倍。 ,是同一亚属中的另一个物种。核型分析证实夜蛾链球菌是二倍体,表明其大基因组大小不是由于多倍体化。这些资源应有助于作为植物生态学和进化的模型对该属的进一步研究和开发。
更新日期:2020-08-11
down
wechat
bug