SeqScrub: a web tool for automatic cleaning and annotation of FASTA file headers for bioinformatic applications.,Biotechniques

当前位置： X-MOL 学术 › Biotechniques › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

SeqScrub: a web tool for automatic cleaning and annotation of FASTA file headers for bioinformatic applications.
Biotechniques ( IF 2.2 ) Pub Date : 2019-06-20 , DOI: 10.2144/btn-2018-0188
Gabriel Foley ₁ , Leander Sützl ₂ , Stephlina A D'Cunha ₁ , Elizabeth Mj Gillam ₁ , Mikael Bodén ₁

Affiliation

Data consistency is necessary for effective bioinformatic analysis. SeqScrub is a web tool that parses and maintains consistent information about protein and DNA sequences in FASTA file format, checks if records are current, and adds taxonomic information by matching identifiers against entries in authoritative biological sequence databases. SeqScrub provides a powerful, yet simple workflow for managing, enriching and exchanging data, which is crucial to establish a record of provenance for sequences found from broad and varied searches; for example, using BLAST on continually updated genome sequence sets. Headers standardized using SeqScrub can be parsed by a majority of bioinformatic tools, stay uniformly named between collaborators and contain informative labels to aid management of reproducible, scientific data. SeqScrub is available at http://bioinf.scmb.uq.edu.au/seqscrub.

中文翻译：

SeqScrub：一个用于自动清理和注释 FASTA 文件头的网络工具，用于生物信息学应用程序。

数据一致性对于有效的生物信息学分析是必要的。SeqScrub 是一个网络工具，它解析和维护 FASTA 文件格式的蛋白质和 DNA 序列的一致信息，检查记录是否是最新的，并通过将标识符与权威生物序列数据库中的条目进行匹配来添加分类信息。SeqScrub 为管理、丰富和交换数据提供了一个强大而简单的工作流程，这对于建立从广泛多样的搜索中找到的序列的来源记录至关重要；例如，在不断更新的基因组序列集上使用 BLAST。使用 SeqScrub 标准化的标头可以被大多数生物信息学工具解析，在合作者之间保持统一命名，并包含信息标签以帮助管理可重复的科学数据。SeqScrub 可在 http:

更新日期：2020-08-21

点击分享查看原文

点击收藏

公开下载

阅读更多本刊最新论文本刊介绍/投稿指南11