当前位置: X-MOL 学术Secur. Commun. Netw. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
An Anomaly Detection Algorithm Selection Service for IoT Stream Data Based on Tsfresh Tool and Genetic Algorithm
Security and Communication Networks Pub Date : 2021-02-08 , DOI: 10.1155/2021/6677027
Zhongguo Yang 1 , Irshad Ahmed Abbasi 2 , Elfatih Elmubarak Mustafa 2 , Sikandar Ali 3, 4 , Mingzhu Zhang 1
Affiliation  

Anomaly detection algorithms (ADA) have been widely used as services in many maintenance monitoring platforms. However, there are numerous algorithms that could be applied to these fast changing stream data. Furthermore, in IoT stream data due to its dynamic nature, the phenomena of conception drift happened. Therefore, it is a challenging task to choose a suitable anomaly detection service (ADS) in real time. For accurate online anomalous data detection, this paper developed a service selection method to select and configure ADS at run-time. Initially, a time-series feature extractor (Tsfresh) and a genetic algorithm-based feature selection method are applied to swiftly extract dominant features which act as representation for the stream data patterns. Additionally, stream data and various efficient algorithms are collected as our historical data. A fast classification model based on XGBoost is trained to record stream data features to detect appropriate ADS dynamically at run-time. These methods help to choose suitable service and their respective configuration based on the patterns of stream data. The features used to describe and reflect time-series data’s intrinsic characteristics are the main success factor in our framework. Consequently, experiments are conducted to evaluate the effectiveness of features closed by genetic algorithm. Experimentations on both artificial and real datasets demonstrate that the accuracy of our proposed method outperforms various advanced approaches and can choose appropriate service in different scenarios efficiently.

中文翻译:

基于Tsfresh工具和遗传算法的物联网流数据异常检测算法选择服务

异常检测算法(ADA)已在许多维护监视平台中广泛用作服务。但是,有许多算法可以应用于这些快速变化的流数据。此外,在物联网流数据中,由于其动态特性,发生了概念漂移现象。因此,实时选择合适的异常检测服务(ADS)是一项艰巨的任务。为了进行准确的在线异常数据检测,本文开发了一种服务选择方法来在运行时选择和配置ADS。最初,使用时间序列特征提取器(Tsfresh)和基于遗传算法的特征选择方法来快速提取占主导地位的特征,这些特征充当流数据模式的表示。此外,流数据和各种有效算法也被收集为我们的历史数据。经过训练的基于XGBoost的快速分类模型可以记录流数据功能,以在运行时动态检测适当的ADS。这些方法有助于根据流数据的模式选择合适的服务及其相应的配置。用于描述和反映时间序列数据的内在特征的功能是我们框架中的主要成功因素。因此,进行了实验以评估由遗传算法封闭的特征的有效性。在人工和真实数据集上的实验表明,我们提出的方法的准确性优于各种高级方法,并且可以在不同情况下有效地选择合适的服务。这些方法有助于根据流数据的模式选择合适的服务及其相应的配置。用于描述和反映时间序列数据的内在特征的功能是我们框架中的主要成功因素。因此,进行了实验以评估由遗传算法封闭的特征的有效性。在人工和真实数据集上的实验表明,我们提出的方法的准确性优于各种高级方法,并且可以在不同情况下有效地选择合适的服务。这些方法有助于根据流数据的模式选择合适的服务及其相应的配置。用于描述和反映时间序列数据的内在特征的功能是我们框架中的主要成功因素。因此,进行了实验以评估由遗传算法封闭的特征的有效性。在人工和真实数据集上的实验表明,我们提出的方法的准确性优于各种高级方法,并且可以在不同情况下有效地选择合适的服务。实验评估了遗传算法封闭特征的有效性。在人工和真实数据集上的实验表明,我们提出的方法的准确性优于各种高级方法,并且可以在不同情况下有效地选择合适的服务。实验评估了遗传算法封闭特征的有效性。在人工和真实数据集上的实验表明,我们提出的方法的准确性优于各种高级方法,并且可以在不同情况下有效地选择合适的服务。
更新日期:2021-02-08
down
wechat
bug