当前位置: X-MOL 学术Sci. Rep. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Evaluation metrics and validation of presence-only species distribution models based on distributional maps with varying coverage
Scientific Reports ( IF 4.6 ) Pub Date : 2021-01-15 , DOI: 10.1038/s41598-020-80062-1
Kamil Konowalik 1 , Agata Nosol 1
Affiliation  

We examine how different datasets, including georeferenced hardcopy maps of different extents and georeferenced herbarium specimens (spanning the range from 100 to 85,000 km2) influence ecological niche modeling. We check 13 of the available environmental niche modeling algorithms, using 30 metrics to score their validity and evaluate which are useful for the selection of the best model. The validation is made using an independent dataset comprised of presences and absences collected in a range-wide field survey of Carpathian endemic plant Leucanthemum rotundifolium (Compositae). Our analysis of models’ predictive performances indicates that almost all datasets may be used for the construction of a species distributional range. Both very local and very general datasets can produce useful predictions, which may be more detailed than the original ranges. Results also highlight the possibility of using the data from manually georeferenced archival sources in reconstructions aimed at establishing species’ ecological niches. We discuss possible applications of those data and associated problems. For the evaluation of models, we suggest employing AUC, MAE, and Bias. We show an example of how AUC and MAE may be combined to select the model with the best performance.



中文翻译:

基于不同覆盖率分布图的仅存在物种分布模型的评估指标和验证

我们研究了不同的数据集,包括不同范围的地理参考硬拷贝地图和地理参考植物标本(范围从 100 到 85,000 公里2)如何影响生态位模型。我们检查了 13 种可用的环境生态位建模算法,使用 30 个指标对其有效性进行评分,并评估哪些对选择最佳模型有用。验证是使用一个独立的数据集进行的,该数据集包括在喀尔巴阡山脉特有植物Leucanthemum rotundifolium的范围广泛的实地调查中收集的存在和缺失(菊科)。我们对模型预测性能的分析表明,几乎所有数据集都可用于构建物种分布范围。非常局部和非常通用的数据集都可以产生有用的预测,这些预测可能比原始范围更详细。结果还强调了在旨在建立物种生态位的重建中使用来自手动地理参考档案来源的数据的可能性。我们讨论了这些数据的可能应用和相关问题。对于模型的评估,我们建议使用 AUC、MAE 和偏差。我们展示了如何结合 AUC 和 MAE 来选择具有最佳性能的模型的示例。

更新日期:2021-01-16
down
wechat
bug