当前位置: X-MOL 学术IEEE Trans. Knowl. Data. Eng. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
A Scalable Multi-Data Sources based Recursive Approximation Approach for Fast Error Recovery in Big Sensing Data on Cloud
IEEE Transactions on Knowledge and Data Engineering ( IF 8.9 ) Pub Date : 2020-05-01 , DOI: 10.1109/tkde.2019.2895612
Chi Yang , Xianghua Xu , Kotagiri Ramamohanarao , Jinjun Chen

Big sensing data is commonly encountered from various surveillance or sensing systems. Sampling and transferring errors are commonly encountered during each stage of sensing data processing. How to recover from these errors with accuracy and efficiency is quite challenging because of high sensing data volume and unrepeatable wireless communication environment. While Cloud provides a promising platform for processing big sensing data, however scalable and accurate error recovery solutions are still need. In this paper, we propose a novel approach to achieve fast error recovery in a scalable manner on cloud. This approach is based on the prediction of a recovery replacement data by making multiple data sources based approximation. The approximation process will use coverage information carried by data units to limit the algorithm in a small cluster of sensing data instead of a whole data spectrum. Specifically, in each sensing data cluster, a Euclidean distance based approximation is proposed to calculate a time series prediction. With the calculated time series, a detected error can be recovered with a predicted data value. Through the experiment with real world meteorological data sets on cloud, we demonstrate that the proposed error recovery approach can achieve high accuracy in data approximation to replace the original data error. At the same time, with MapReduce based implementation for scalability, the experimental results also show significant efficiency on time saving.

中文翻译:

一种基于可扩展多数据源的递归逼近方法,用于云上大传感数据中的快速错误恢复

大传感数据通常来自各种监视或传感系统。在传感数据处理的每个阶段通常都会遇到采样和传输错误。由于高传感数据量和不可重复的无线通信环境,如何准确有效地从这些错误中恢复是相当具有挑战性的。虽然云为处理大传感数据提供了一个有前途的平台,但仍然需要可扩展和准确的错误恢复解决方案。在本文中,我们提出了一种在云上以可扩展方式实现快速错误恢复的新方法。该方法基于通过进行基于多个数据源的近似来预测恢复替换数据。近似过程将使用数据单元携带的覆盖信息来将算法限制在一小群感知数据而不是整个数据频谱中。具体而言,在每个传感数据簇中,提出了基于欧几里德距离的近似来计算时间序列预测。通过计算出的时间序列,可以使用预测数据值恢复检测到的错误。通过云上真实世界气象数据集的实验,我们证明了所提出的误差恢复方法可以实现高精度的数据近似,以替代原始数据误差。同时,使用基于 MapReduce 的可扩展性实现,实验结果也显示出显着的时间节省效率。提出了基于欧几里得距离的近似值来计算时间序列预测。通过计算出的时间序列,可以使用预测数据值恢复检测到的错误。通过云上真实世界气象数据集的实验,我们证明了所提出的误差恢复方法可以实现高精度的数据近似,以替代原始数据误差。同时,使用基于 MapReduce 的可扩展性实现,实验结果也显示出显着的时间节省效率。提出了基于欧几里得距离的近似值来计算时间序列预测。通过计算出的时间序列,可以使用预测数据值恢复检测到的错误。通过云上真实世界气象数据集的实验,我们证明了所提出的误差恢复方法可以实现高精度的数据近似,以替代原始数据误差。同时,使用基于 MapReduce 的可扩展性实现,实验结果也显示出显着的时间节省效率。我们证明了所提出的错误恢复方法可以在数据近似中实现高精度以替换原始数据错误。同时,使用基于 MapReduce 的可扩展性实现,实验结果也显示出显着的时间节省效率。我们证明了所提出的错误恢复方法可以在数据近似中实现高精度以替换原始数据错误。同时,使用基于 MapReduce 的可扩展性实现,实验结果也显示出显着的时间节省效率。
更新日期:2020-05-01
down
wechat
bug