当前位置: X-MOL 学术Microb. Genom. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
DiSCo: a sequence-based type-specific predictor of Dsr-dependent dissimilatory sulphur metabolism in microbial data
Microbial Genomics ( IF 3.9 ) Pub Date : 2021-07-09 , DOI: 10.1099/mgen.0.000603
Sinje Neukirchen 1 , Filipa L Sousa 1
Affiliation  

Current methods in comparative genomic analyses for metabolic potential prediction of proteins involved in, or associated with the Dsr (dissimilatory sulphite reductase)-dependent dissimilatory sulphur metabolism are both time-intensive and computationally challenging, especially when considering metagenomic data. We developed DiSCo, a Dsr-dependent dissimilatory sulphur metabolism classification tool, which automatically identifies and classifies the protein type from sequence data. It takes user-supplied protein sequences and lists the identified proteins and their classification in terms of protein family and predicted type. It can also extract the sequence data from user-input to serve as basis for additional downstream analyses. DiSCo provides the metabolic functional prediction of proteins involved in Dsr-dependent dissimilatory sulphur metabolism with high levels of accuracy in a fast manner. We ran DiSCo against a dataset composed of over 190 thousand (meta)genomic records and efficiently mapped Dsr-dependent dissimilatory sulphur proteins in 1798 lineages across both prokaryotic domains. This allowed the identification of new micro-organisms belonging to Thaumarchaeota and Spirochaetes lineages with the metabolic potential to use the Dsr-pathway for energy conservation. DiSCo is implemented in Perl 5 and freely available under the GNU GPLv3 at https://github.com/Genome-Evolution-and-Ecology-Group-GEEG/DiSCo.

中文翻译:

DiSCo:微生物数据中 Dsr 依赖性异化硫代谢的基于序列的类型特异性预测因子

目前在比较基因组分析中用于预测参与或与 Dsr(异化亚硫酸还原酶)依赖性异化硫代谢相关的蛋白质的代谢潜力的方法既耗时又具有计算挑战性,尤其是在考虑宏基因组数据时。我们开发了 DiSCo,一种依赖于 Dsr 的异化硫代谢分类工具,它可以从序列数据中自动识别和分类蛋白质类型。它采用用户提供的蛋白质序列,并根据蛋白质家族和预测类型列出已识别的蛋白质及其分类。它还可以从用户输入中提取序列数据,作为其他下游分析的基础。DiSCo 以快速的方式以高精度提供参与 Dsr 依赖性异化硫代谢的蛋白质的代谢功能预测。我们针对由超过 19 万个(元)基因组记录组成的数据集运行 DiSCo,并在两个原核领域的 1798 个谱系中有效地绘制了 Dsr 依赖性异化硫蛋白。这允许鉴定属于 Thaumarchaeota 和 Spirochaetes 谱系的新微生物,这些微生物具有使用 Dsr 途径进行能量保存的代谢潜力。DiSCo 在 Perl 5 中实现,并在 GNU GPLv3 下免费提供,网址为 https://github.com/Genome-Evolution-and-Ecology-Group-GEEG/DiSCo。我们针对由超过 19 万个(元)基因组记录组成的数据集运行 DiSCo,并在两个原核领域的 1798 个谱系中有效地绘制了 Dsr 依赖性异化硫蛋白。这允许鉴定属于 Thaumarchaeota 和 Spirochaetes 谱系的新微生物,这些微生物具有使用 Dsr 途径进行能量保存的代谢潜力。DiSCo 在 Perl 5 中实现,并在 GNU GPLv3 下免费提供,网址为 https://github.com/Genome-Evolution-and-Ecology-Group-GEEG/DiSCo。我们针对由超过 190,000 个(元)基因组记录组成的数据集运行 DiSCo,并在两个原核领域的 1798 个谱系中有效地绘制了 Dsr 依赖性异化硫蛋白。这允许鉴定属于 Thaumarchaeota 和 Spirochaetes 谱系的新微生物,这些微生物具有使用 Dsr 途径进行能量保存的代谢潜力。DiSCo 在 Perl 5 中实现,并在 GNU GPLv3 下免费提供,网址为 https://github.com/Genome-Evolution-and-Ecology-Group-GEEG/DiSCo。
更新日期:2021-07-12
down
wechat
bug