当前位置: X-MOL 学术Chem. Sci. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Graph-based machine learning interprets and predicts diagnostic isomer-selective ion–molecule reactions in tandem mass spectrometry
Chemical Science ( IF 8.4 ) Pub Date : 2020-10-05 , DOI: 10.1039/d0sc02530e
Jonathan Fine 1, 2, 3, 4 , Judy Kuan-Yu Liu 1, 2, 3, 4 , Armen Beck 1, 2, 3, 4 , Kawthar Z. Alzarieni 1, 2, 3, 4 , Xin Ma 1, 2, 3, 4 , Victoria M. Boulos 1, 2, 3, 4 , Hilkka I. Kenttämaa 1, 2, 3, 4 , Gaurav Chopra 1, 2, 3, 4, 5
Affiliation  

Diagnostic ion–molecule reactions employed in tandem mass spectrometry experiments can frequently be used to differentiate between isomeric compounds unlike the popular collision-activated dissociation methodology. Selected neutral reagents, such as 2-methoxypropene (MOP), are introduced into an ion trap mass spectrometer where they react with protonated analytes to yield product ions that are diagnostic for the functional groups present in the analytes. However, the understanding and interpretation of the mass spectra obtained can be challenging and time-consuming. Here, we introduce the first bootstrapped decision tree model trained on 36 known ion–molecule reactions with MOP. It uses the graph-based connectivity of analytes' functional groups as input to predict whether the protonated analyte will undergo a diagnostic reaction with MOP. A Cohen kappa statistic of 0.70 was achieved with a blind test set, suggesting substantial inter-model reliability on limited training data. Prospective diagnostic product predictions were experimentally tested for 13 previously unpublished analytes. We introduce chemical reactivity flowcharts to facilitate chemical interpretation of the decisions made by the machine learning method that will be useful to understand and interpret the mass spectra for chemical reactivity.

中文翻译:

基于图的机器学习可解释和预测串联质谱中的诊断性异构体选择性离子分子反应

与流行的碰撞活化解离方法不同,串联质谱实验中使用的诊断性离子分子反应通常可用于区分异构化合物。将选定的中性试剂(例如2-甲氧基丙烯(MOP))引入离子阱质谱仪,在其中与质子化分析物发生反应,生成可诊断分析物中存在的官能团的产物离子。然而,对获得的质谱的理解和解释可能是具有挑战性和耗时的。在这里,我们介绍了第一个自举决策树模型,该模型在MOP的36种已知离子-分子反应中训练。它使用基于图的分析物官能团的连通性作为输入,以预测质子化分析物是否会与MOP进行诊断反应。盲测集获得的Cohen kappa统计量为0.70,这表明在有限的训练数据上模型间的可靠性很高。对13种先前未发布的分析物进行了实验性的前瞻性诊断产品预测。我们引入化学反应性流程图,以促进对机器学习方法所做出决定的化学解释,这对于理解和解释化学反应性质谱非常有用。
更新日期:2020-10-13
down
wechat
bug