当前位置: X-MOL 学术3 Biotech › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Implications of genome simple sequence repeats signature in 98 Polyomaviridae species
3 Biotech ( IF 2.6 ) Pub Date : 2021-01-06 , DOI: 10.1007/s13205-020-02583-w
Rezwanuzzaman Laskar 1 , Md Gulam Jilani 1 , Safdar Ali 1
Affiliation  

The analysis of simple sequence repeats (SSRs) in 98 genomes across four genera of the family Polyomaviridae was performed. The genome size ranged from 3962 (BM87) to 7369 bp (BM85) but maximum genomes were in the range of 5–5.5 kb. The GC% had an average of 42% and ranged between 34.69 (BM95) and 52.35 (BM81). A total of 3036 SSRs and 223 cSSRs were extracted using IMEx with incident frequency from 18 to 56 and 0 to 7, respectively. The most prevalent mono-nucleotide repeat motif was “T” (48.95%) followed by “A” (33.48%). “AT/TA” was the most prevalent dinucleotide motif closely followed by “CT/TC”. The distribution was expectedly more in the coding region with 77.6% SSRs of which nearly half were in Large T Antigen (LTA) gene. Notably, most viruses with humans, apes and related species as host exhibited exclusivity of mono-nucleotide repeats in AT region, a proposed predictive marker for determination of humans as host in the virus in course of its evolution. Each genome has a unique SSR signature which is pivotal for viral evolution particularly in terms of host divergence.



中文翻译:


98 种多瘤病毒科物种基因组简单序列重复特征的意义



多瘤病毒科四个属的 98 个基因组中的简单序列重复 (SSR) 进行了分析。基因组大小范围为 3962 (BM87) 至 7369 bp (BM85),但最大基因组范围为 5-5.5 kb。 GC% 平均为 42%,范围在 34.69 (BM95) 和 52.35 (BM81) 之间。使用 IMEx 总共提取了 3036 个 SSR 和 223 个 cSSR,事件频率分别为 18 至 56 和 0 至 7。最常见的单核苷酸重复基序是“T”(48.95%),其次是“A”(33.48%)。 “AT/TA”是最常见的二核苷酸基序,紧随其后的是“CT/TC”。预期分布更多在编码区,有 77.6% SSR,其中近一半位于大 T 抗原 (LTA) 基因中。值得注意的是,大多数以人类、猿类和相关物种为宿主的病毒在 AT 区域表现出单核苷酸重复的排他性,这是一种拟议的预测标记,用于在病毒进化过程中确定人类作为宿主。每个基因组都有独特的 SSR 特征,这对于病毒进化至关重要,特别是在宿主分化方面。

更新日期:2021-01-06
down
wechat
bug