当前位置: X-MOL 学术Math. Geosci. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Discriminant Analysis for Compositional Data Incorporating Cell-Wise Uncertainties
Mathematical Geosciences ( IF 2.6 ) Pub Date : 2020-07-20 , DOI: 10.1007/s11004-020-09878-x
Solveig Pospiech , Raimon Tolosana-Delgado , K. Gerald van den Boogaart

Abstract

In the geosciences it is still uncommon to include measurement uncertainties into statistical methods such as discriminant analysis, but, especially for trace elements, measurement uncertainties are frequently of relevant size. Uncertainties can be reported by each measured variable, by each observation or by individual cells (i.e., each observation has an individual uncertainty for each variable). Most methods incorporating uncertainties use the uncertainties as weights for the variables or observations of the data set. The method proposed in this contribution uses variance additivity properties and generalised least squares to calculate better estimates of group variances and group means, which then influence the decision rules of linear and quadratic discrimination algorithms. This methodological framework allows incorporation of cell-wise uncertainties, and would be largely valid if the information about co-dependency between variable errors within each observation were reported. The method is also appropriate for incorporating uncertainties into compositional data sets—for example, those formed by concentrations, proportions, percentages or any other form of information about the relative abundance of a set of components forming a whole—even if such uncertainties are nearly never reported considering this compositional nature. The methods are illustrated by means of case studies with simulated data.

Graphical abstract



中文翻译:

结合细胞明智不确定性的成分数据的判别分析

摘要

在地球科学中,将测量不确定度纳入诸如判别分析之类的统计方法中仍然不常见,但是,特别是对于痕量元素而言,测量不确定度通常具有相关的大小。不确定性可以通过每个测量变量,每个观察值或单个单元格来报告(即,每个观察值对于每个变量都有单独的不确定性)。大多数包含不确定性的方法都将不确定性用作数据集变量或观测值的权重。本贡献中提出的方法使用方差可加性和广义最小二乘来计算组方差和组均值的更好估计,然后影响线性和二次判别算法的决策规则。这种方法框架允许合并细胞方向的不确定性,并且如果报告了每个观察值之间可变误差之间的相互依赖性信息,那么该方法框架在很大程度上将是有效的。该方法也适用于将不确定性纳入组成数据集(例如,由浓度,比例,百分比或有关形成整体的一组组分的相对丰度的任何其他形式的信息形成的不确定性),即使此类不确定性几乎永远不会报告考虑到这种组成性质。通过案例研究和模拟数据说明了这些方法。该方法也适用于将不确定性纳入组成数据集(例如,由浓度,比例,百分比或有关形成整体的一组组分的相对丰度的任何其他形式的信息形成的不确定性),即使此类不确定性几乎永远不会报告考虑到这种组成性质。通过案例研究和模拟数据说明了这些方法。该方法也适用于将不确定性纳入组成数据集(例如,由浓度,比例,百分比或有关形成整体的一组组分的相对丰度的任何其他形式的信息形成的不确定性),即使此类不确定性几乎永远不会报告考虑到这种组成性质。通过案例研究和模拟数据说明了这些方法。

图形概要

更新日期:2020-07-20
down
wechat
bug