当前位置: X-MOL 学术J. Proteome Res. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Quantitative Evaluation of Ion Chromatogram Extraction Algorithms.
Journal of Proteome Research ( IF 3.8 ) Pub Date : 2020-03-27 , DOI: 10.1021/acs.jproteome.9b00768
Rob Smith 1 , Annika R Tostengard 1
Affiliation  

Extracted ion chromatograms (XIC) are the fundamental signal unit in mass spectrometry. There are many algorithms for analyzing raw mass spectrometry data tasked with distinguishing real isotopic signals from noise. While one or more of the available algorithms are typically chained together for end-to-end mass spectrometry analysis, analysis of each algorithm in isolation provides a specific measurement of the strengths and weaknesses of each approach. Though qualitative opinions on extraction algorithm performance abound, quantitative performance has never been publicly ascertained. Quantitative evaluation has not occurred partly due to the lack of an available quantitative ground truth MS1 data set. Using a recently published, manually extracted XICs as ground truth data, we evaluate the quality of popular XIC algorithms, including MaxQuant, MZMine2, and several methods from XCMS. The manually curated data set comprises 48 human proteins stratified over 6 abundance orders of magnitude. Signals in the sample were manually curated into XIC using a commercial tool for visually identifying XIC and isotopic envelopes. XIC algorithms were applied to the manually extracted data using a grid search of possible parameters. Performance varied greatly between different parameter settings, though nearly all algorithms with parameter settings optimized with respect to the number of true positives recovered over 10 000 XICs.

中文翻译:

离子色谱提取算法的定量评价。

提取离子色谱图 (XIC) 是质谱中的基本信号单位。有许多用于分析原始质谱数据的算法,其任务是区分真实同位素信号和噪声。虽然一个或多个可用算法通常链接在一起以进行端到端质谱分析,但对每种算法的单独分析提供了对每种方法的优缺点的具体测量。尽管关于提取算法性能的定性意见比比皆是,但定量性能从未被公开确定。由于缺乏可用的定量地面实况 MS1 数据集,定量评估尚未发生。使用最近发布的手动提取的 XIC 作为地面实况数据,我们评估流行的 XIC 算法的质量,包括 MaxQuant,MZMine2 和 XCMS 的几种方法。手动管理的数据集包括 48 种人类蛋白质,分层超过 6 个数量级。使用用于视觉识别 XIC 和同位素包络的商业工具将样本中的信号手动整理到 XIC 中。使用可能参数的网格搜索将 XIC 算法应用于手动提取的数据。不同参数设置之间的性能差异很大,尽管几乎所有算法的参数设置都针对真阳性数进行了优化,超过 10 000 个 XIC。使用可能参数的网格搜索将 XIC 算法应用于手动提取的数据。不同参数设置之间的性能差异很大,尽管几乎所有算法的参数设置都针对真阳性数进行了优化,超过 10 000 个 XIC。使用可能参数的网格搜索将 XIC 算法应用于手动提取的数据。不同参数设置之间的性能差异很大,尽管几乎所有算法的参数设置都针对真阳性数进行了优化,超过 10 000 个 XIC。
更新日期:2020-03-27
down
wechat
bug