当前位置: X-MOL 学术Comput. Secur. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
DeepStream: Autoencoder-based stream temporal clustering and anomaly detection
Computers & Security ( IF 4.8 ) Pub Date : 2021-04-18 , DOI: 10.1016/j.cose.2021.102276
Shimon Harush , Yair Meidan , Asaf Shabtai

The increasing number of IoT devices in “smart” environments, such as homes, offices, and cities, produce seemingly endless data streams and drive many daily decisions. Consequently, there is growing interest in identifying contextual information from sensor data to facilitate the performance of various tasks, e.g., traffic management, cyber attack detection, and healthcare monitoring. The correct identification of contexts in data streams is helpful for many tasks, for example, it can assist in providing high-quality recommendations to end users and in reporting anomalous behavior based on the detection of unusual contexts. This paper presents DeepStream, a novel data stream temporal clustering algorithm that dynamically detects sequential and overlapping clusters. DeepStream is tuned to classify contextual information in real time and is capable of coping with a high-dimensional feature space. DeepStream utilizes stacked autoencoders to reduce the dimensionality of unbounded data streams and for cluster representation. This method detects contextual behavior and captures nonlinear relations of the input data, giving it an advantage over existing methods that rely on PCA. We evaluated DeepStream empirically using four sensor and IoT datasets and compared it to five state-of-the-art stream clustering algorithms. Our evaluation shows that DeepStream outperforms all of these algorithms. Our evaluation also demonstrates how DeepStream’s improved clustering performance results in improved detection of anomalous data.



中文翻译:

DeepStream:基于自动编码器的流时间聚类和异常检测

在诸如房屋,办公室和城市等“智能”环境中,越来越多的物联网设备产生看似无止境的数据流,并推动许多日常决策。因此,对于从传感器数据中识别上下文信息以促进各种任务(例如流量管理,网络攻击检测和医疗保健监视)的执行的兴趣日益浓厚。正确识别数据流中的上下文对于许多任务很有帮助,例如,它可以帮助向最终用户提供高质量的建议,并可以基于对异常上下文的检测来报告异常行为。本文介绍了DeepStream,这是一种新颖的数据流时间聚类算法,可以动态检测顺序和重叠的聚类。DeepStream进行了调整,可以实时对上下文信息进行分类,并且能够应对高维特征空间。DeepStream利用堆叠式自动编码器来减少无界数据流的维数并用于集群表示。该方法检测上下文行为并捕获输入数据的非线性关系,这使其相对于依赖PCA的现有方法具有优势。我们使用四个传感器和IoT数据集凭经验评估了DeepStream,并将其与五个最新的流聚类算法进行了比较。我们的评估表明,DeepStream优于所有这些算法。我们的评估还演示了DeepStream改进的群集性能如何导致改进的异常数据检测。DeepStream利用堆叠式自动编码器来减少无界数据流的维数并用于集群表示。该方法检测上下文行为并捕获输入数据的非线性关系,这使其相对于依赖PCA的现有方法具有优势。我们使用四个传感器和IoT数据集凭经验评估了DeepStream,并将其与五个最新的流聚类算法进行了比较。我们的评估表明,DeepStream优于所有这些算法。我们的评估还演示了DeepStream改进的群集性能如何导致改进的异常数据检测。DeepStream利用堆叠式自动编码器来减少无界数据流的维数并用于集群表示。该方法检测上下文行为并捕获输入数据的非线性关系,这使其相对于依赖PCA的现有方法具有优势。我们使用四个传感器和IoT数据集凭经验评估了DeepStream,并将其与五个最新的流聚类算法进行了比较。我们的评估表明,DeepStream优于所有这些算法。我们的评估还演示了DeepStream改进的群集性能如何导致改进的异常数据检测。与依赖PCA的现有方法相比,它具有优势。我们使用四个传感器和IoT数据集凭经验评估了DeepStream,并将其与五个最新的流聚类算法进行了比较。我们的评估表明,DeepStream优于所有这些算法。我们的评估还演示了DeepStream改进的群集性能如何导致改进的异常数据检测。与依赖PCA的现有方法相比,它具有优势。我们使用四个传感器和IoT数据集凭经验评估了DeepStream,并将其与五个最新的流聚类算法进行了比较。我们的评估表明,DeepStream优于所有这些算法。我们的评估还演示了DeepStream改进的群集性能如何导致改进的异常数据检测。

更新日期:2021-05-03
down
wechat
bug