当前位置: X-MOL 学术J. Electron Spectrosc. Relat. Phenomena › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Box plots: A simple graphical tool for visualizing overfitting in peak fitting as demonstrated with X-ray photoelectron spectroscopy data
Journal of Electron Spectroscopy and Related Phenomena ( IF 1.9 ) Pub Date : 2021-06-21 , DOI: 10.1016/j.elspec.2021.147094
Behnam Moeini , Hyrum Haack , Neal Fairley , Vincent Fernandez , Thomas R. Gengenbach , Christopher D. Easton , Matthew R. Linford

While peak fitting of spectra/data is frequently performed in science, recent reports suggest that the quality of peak fitting in the scientific literature is often inadequate. Here, we describe a new statistical tool for determining the quality of fitting protocols, illustrating this capability with X-ray photoelectron spectroscopy (XPS) data. This tool, box plots of random starting conditions and their results, helps identify local minima in the multidimensional fit space of the fit parameters. Ideally, there should be a single global minimum for a fitting protocol such that different, reasonable starting conditions lead to the same result. To determine whether a fit space contains multiple local minima, a series of reasonable starting conditions is randomly chosen for the fit. If the boxes in the box plot of the peak areas of these multiple fits are narrow, the different possibilities converge to a single global minimum. Conversely, if the boxes are wide, multiple local minima are present. This method is related to the mathematical concept of ‘disproof by contradiction’. Our approach is demonstrated with four- and ten-component fits to a moderately complex C 1s XPS narrow scan. The results from our analysis compare favorably to those of traditional Monte Carlo analyses and uniqueness plots, where box plots are also applied to the Monte Carlo results, and each of these statistical tools performs a different function/probes a fit space/protocol differently.



中文翻译:

箱线图:一个简单的图形工具,用于可视化峰值拟合中的过度拟合,如 X 射线光电子能谱数据所示

虽然在科学中经常进行光谱/数据的峰值拟合,但最近的报告表明科学文献中峰值拟合的质量往往不足。在这里,我们描述了一种用于确定拟合协议质量的新统计工具,并用 X 射线光电子能谱 (XPS) 数据说明了这种能力。该工具是随机起始条件及其结果的箱线图,有助于识别拟合参数的多维拟合空间中的局部最小值。理想情况下,拟合协议应该有一个全局最小值,这样不同的、合理的起始条件会导致相同的结果。为了确定一个拟合空间是否包含多个局部最小值,随机选择一系列合理的起始条件进行拟合。如果这些多重拟合的峰面积箱线图中的框很窄,则不同的可能性会收敛到单个全局最小值。相反,如果框很宽,则存在多个局部最小值。这种方法与“反证法”的数学概念有关。我们的方法通过四分量和十分量适合中等复杂的 C 1s XPS 窄扫描来证明。我们的分析结果优于传统的蒙特卡罗分析和唯一性图,其中箱形图也适用于蒙特卡罗结果,并且这些统计工具中的每一个执行不同的函数/探测拟合空间/协议的方式不同。这种方法与“反证法”的数学概念有关。我们的方法通过四分量和十分量适合中等复杂的 C 1s XPS 窄扫描来证明。我们的分析结果优于传统的蒙特卡罗分析和唯一性图,其中箱形图也适用于蒙特卡罗结果,并且这些统计工具中的每一个执行不同的函数/探测拟合空间/协议的方式不同。这种方法与“反证法”的数学概念有关。我们的方法通过四分量和十分量适合中等复杂的 C 1s XPS 窄扫描来证明。我们的分析结果优于传统的蒙特卡罗分析和唯一性图,其中箱形图也适用于蒙特卡罗结果,并且这些统计工具中的每一个执行不同的函数/探测拟合空间/协议的方式不同。

更新日期:2021-06-28
down
wechat
bug