当前位置: X-MOL 学术Mol. Ecol. Resour. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
EnTAP: Bringing faster and smarter functional annotation to non-model eukaryotic transcriptomes.
Molecular Ecology Resources ( IF 7.7 ) Pub Date : 2019-12-31 , DOI: 10.1111/1755-0998.13106
Alexander J Hart 1 , Samuel Ginzburg 1 , Muyang Sam Xu 1 , Cera R Fisher 1 , Nasim Rahmatpour 1 , Jeffry B Mitton 2 , Robin Paul 1 , Jill L Wegrzyn 1
Affiliation  

EnTAP (Eukaryotic Non-Model Transcriptome Annotation Pipeline) was designed to improve the accuracy, speed, and flexibility of functional gene annotation for de novo assembled transcriptomes in non-model eukaryotes. This software package addresses the fragmentation and related assembly issues that result in inflated transcript estimates and poor annotation rates of protein-coding transcripts. Following filters applied through assessment of true expression and frame selection, open-source tools are leveraged to functionally annotate the reduced set of translated proteins. Downstream features include fast similarity search across five repositories, protein domain assignment, orthologous gene family assessment, and Gene Ontology (GO) term assignment. The final annotation integrates across multiple databases and selects an optimal assignment from a combination of weighted metrics describing similarity search score, taxonomic relationship, and informativeness. Researchers have the option to include additional filters to identify and remove contaminants, identify associated pathways, and prepare the transcripts for enrichment analysis. This fully featured pipeline is easy to install, configure, and runs significantly faster than comparable annotation packages. EnTAP is optimized to generate extensive functional information for the gene space of organisms with limited or poorly characterized genomic resources.

中文翻译:

EnTAP:为非模型真核转录组带来更快,更智能的功能注释。

EnTAP(真核非模型转录组注释管道)旨在提高非模型真核生物从头组装转录组的功能基因注释的准确性,速度和灵活性。该软件包解决了片段化和相关的装配问题,这些问题导致笔录估计过高和蛋白质编码笔录的注释率不佳。在通过评估真实表达和框架选择而应用的过滤器之后,利用开源工具在功能上注释了减少的翻译蛋白集。下游功能包括跨五个存储库的快速相似性搜索,蛋白质域分配,直系同源基因家族评估和基因本体(GO)术语分配。最终注释跨多个数据库进行集成,并从描述相似性搜索得分,分类关系和信息性的加权指标的组合中选择最佳分配。研究人员可以选择包括其他过滤器,以识别和去除污染物,识别相关途径并准备转录本以进行富集分析。该功能齐全的管道易于安装,配置,并且比同类注解程序包的运行速度快得多。EnTAP经过优化,可为基因资源有限或表征欠佳的生物的基因空间生成广泛的功能信息。研究人员可以选择包括其他过滤器,以识别和去除污染物,识别相关途径并准备转录本以进行富集分析。该功能齐全的管道易于安装,配置,并且比同类注解程序包的运行速度快得多。EnTAP经过优化,可为基因资源有限或表征欠佳的生物的基因空间生成广泛的功能信息。研究人员可以选择包括其他过滤器,以识别和去除污染物,识别相关途径并准备转录本以进行富集分析。该功能齐全的管道易于安装,配置,并且比同类注解程序包的运行速度快得多。EnTAP经过优化,可为基因资源有限或表征欠佳的生物的基因空间生成广泛的功能信息。
更新日期:2019-12-31
down
wechat
bug