当前位置: X-MOL 学术ISPRS Int. J. Geo-Inf. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Semantics of Voids within Data: Ignorance-Aware Machine Learning
ISPRS International Journal of Geo-Information ( IF 3.4 ) Pub Date : 2021-04-08 , DOI: 10.3390/ijgi10040246
Vagan Terziyan , Anton Nikulin

Operating with ignorance is an important concern of geographical information science when the objective is to discover knowledge from the imperfect spatial data. Data mining (driven by knowledge discovery tools) is about processing available (observed, known, and understood) samples of data aiming to build a model (e.g., a classifier) to handle data samples that are not yet observed, known, or understood. These tools traditionally take semantically labeled samples of the available data (known facts) as an input for learning. We want to challenge the indispensability of this approach, and we suggest considering the things the other way around. What if the task would be as follows: how to build a model based on the semantics of our ignorance, i.e., by processing the shape of “voids” within the available data space? Can we improve traditional classification by also modeling the ignorance? In this paper, we provide some algorithms for the discovery and visualization of the ignorance zones in two-dimensional data spaces and design two ignorance-aware smart prototype selection techniques (incremental and adversarial) to improve the performance of the nearest neighbor classifiers. We present experiments with artificial and real datasets to test the concept of the usefulness of ignorance semantics discovery.

中文翻译:

数据中空洞的语义:无知觉机器学习

当目标是从不完善的空间数据中发现知识时,无知的操作是地理信息科学的一个重要问题。数据挖掘(由知识发现工具驱动)与处理可用(观察到的,已知的和理解的)数据样本有关,旨在建立一个模型(例如分类器)来处理尚未观察,已知或理解的数据样本。传统上,这些工具将语义标记的可用数据样本(已知事实)作为学习的输入。我们想挑战这种方法的不可或缺性,我们建议从相反的角度考虑这些问题。如果任务如下:该如何基于我们的无知的语义(即,通过在可用数据空间内处理“空隙”的形状?我们能否通过对无知进行建模来改善传统分类?本文为二维数据空间中的无知区域的发现和可视化提供了一些算法,并设计了两种无知觉智能原型选择技术(增量式和对抗式),以提高最近邻分类器的性能。我们目前使用人工和真实数据集进行实验,以测试无知语义发现的有用性的概念。我们提供了一些算法来发现和可视化二维数据空间中的无知区域,并设计了两种无知觉的智能原型选择技术(增量式和对抗式),以提高最近邻分类器的性能。我们目前使用人工和真实数据集进行实验,以测试无知语义发现的有用性的概念。我们提供了一些算法来发现和可视化二维数据空间中的无知区域,并设计了两种无知觉的智能原型选择技术(增量式和对抗式),以提高最近邻分类器的性能。我们目前使用人工和真实数据集进行实验,以测试无知语义发现的有用性的概念。
更新日期:2021-04-08
down
wechat
bug