当前位置: X-MOL 学术Interface Focus › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
GSER (a Genome Size Estimator using R): a pipeline for quality assessment of sequenced genome libraries through genome size estimation
Interface Focus ( IF 4.4 ) Pub Date : 2021-06-11 , DOI: 10.1098/rsfs.2020.0077
Braulio Valdebenito-Maturana 1 , Gonzalo Riadi 2
Affiliation  

The first step in any genome research after obtaining the read data is to perform a due quality control of the sequenced reads. In a de novo genome assembly project, the second step is to estimate two important features, the genome size and ‘best k-mer’, to start the assembly tests with different de novo assembly software and its parameters. However, the quality control of the sequenced genome libraries as a whole, instead of focusing on the reads only, is frequently overlooked and realized to be important only when the assembly tests did not render the expected results. We have developed GSER, a Genome Size Estimator using R, a pipeline to evaluate the relationship between k-mers and genome size, as a means for quality assessment of the sequenced genome libraries. GSER generates a set of charts that allow the analyst to evaluate the library datasets before starting the assembly. The script which runs the pipeline can be downloaded from http://www.mobilomics.org/GSER/downloads or http://github.com/mobilomics/GSER.



中文翻译:

GSER(使用 R 的基因组大小估计器):通过基因组大小估计对测序基因组文库进行质量评估的管道

在获得读取数据后,任何基因组研究的第一步都是对测序读取进行适当的质量控制。在从头基因组组装项目中,第二步是估计两个重要特征,即基因组大小和“最佳k -mer”,以使用不同的从头组装软件及其参数开始组装测试。然而,测序基因组文库作为一个整体的质量控制,而不是仅仅关注读数,经常被忽视,并且只有在组装测试没有呈现预期结果时才意识到这一点很重要。我们开发了 GSER,一种使用 R 的基因组大小估计器,一种评估k之间关系的管道-mers 和基因组大小,作为测序基因组文库质量评估的一种手段。GSER 生成一组图表,允许分析师在开始组装之前评估库数据集。运行管道的脚本可以从 http://www.mobilomics.org/GSER/downloads 或 http://github.com/mobilomics/GSER 下载。

更新日期:2021-06-11
down
wechat
bug