当前位置: X-MOL 学术bioRxiv. Microbiol. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Mutational analysis and assessment of its impact on proteins of SARS-CoV-2 genomes from India
bioRxiv - Microbiology Pub Date : 2020-10-19 , DOI: 10.1101/2020.10.19.345066
Rezwanuzzaman Laskar , Safdar Ali

The ongoing global pandemic of SARS-CoV-2 implies a corresponding accumulation of mutations. Herein the mutational status of 611 genomes from India along with their impact on proteins was ascertained. After excluding gaps and ambiguous sequences, a total of 493 variable sites (152 parsimony informative and 341 singleton) were observed. The most prevalent reference nucleotide was C (209) and substituted one was T (293). NSP3 had the highest incidence of 101 sites followed by S protein (74 sites), NSP12b (43 sites) and ORF3a (31 sites). The average number of mutations per sample for males and females was 2.56 and 2.88 respectively suggesting a higher contribution of mutations from females. Non-uniform geographical distribution of mutations implied by Odisha (30 samples, 109 mutations) and Tamil Nadu (31 samples, 40 mutations) suggests that sequences in some regions are mutating faster than others. There were 281 mutations (198 Neutral and 83 Disease) affecting amino acid sequence. NSP13 has a maximum of 14 Disease variants followed by S protein and ORF3a with 13 each. Further, constitution of Disease mutations in genomes from asymptomatic people was mere 11% but those from deceased patients was over three folds higher at 38% indicating contribution of these mutations to the pathophysiology of the SARS-CoV-2.

中文翻译:

突变分析及其对印度SARS-CoV-2基因组蛋白质影响的评估

正在进行的SARS-CoV-2全球大流行意味着相应的突变积累。在此,确定了来自印度的611个基因组的突变状态及其对蛋白质的影响。排除缺口和歧义序列后,总共观察到493个可变位点(152个简约信息和341个单例)。最普遍的参考核苷酸是C(209),被取代的一个是T(293)。NSP3的发生率最高,为101个位点,其次是S蛋白(74个位点),NSP12b(43个位点)和ORF3a(31个位点)。男性和女性每个样本的平均突变数分别为2.56和2.88,表明女性突变的贡献更大。奥里萨邦(30个样本,109个突变)和泰米尔纳德邦(31个样本,40个突变)表明某些区域的序列突变比其他区域快。有281个突变(198个中性和83个疾病)影响氨基酸序列。NSP13最多具有14种疾病变体,其次是S蛋白和ORF3a,每种都有13种。此外,无症状人群基因组疾病突变的构成仅为11%,而死者基因组疾病突变的构成是38%的三倍以上,表明这些突变对SARS-CoV-2的病理生理的贡献。
更新日期:2020-10-20
down
wechat
bug