当前位置: X-MOL 学术Appl. Soft Comput. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Clustering-based anomaly detection in multivariate time series data
Applied Soft Computing ( IF 7.2 ) Pub Date : 2020-11-24 , DOI: 10.1016/j.asoc.2020.106919
Jinbo Li , Hesam Izakian , Witold Pedrycz , Iqbal Jamal

Multivariate time series data come as a collection of time series describing different aspects of a certain temporal phenomenon. Anomaly detection in this type of data constitutes a challenging problem yet with numerous applications in science and engineering because anomaly scores come from the simultaneous consideration of the temporal and variable relationships. In this paper, we propose a clustering-based approach to detect anomalies concerning the amplitude and the shape of multivariate time series. First, we use a sliding window to generate a set of multivariate subsequences and thereafter apply an extended fuzzy clustering to reveal a structure present within the generated multivariate subsequences. Finally, a reconstruction criterion is employed to reconstruct the multivariate subsequences with the optimal cluster centers and the partition matrix. We construct a confidence index to quantify a level of anomaly detected in the series and apply Particle Swarm Optimization as an optimization vehicle for the problem of anomaly detection. Experimental studies completed on several synthetic and six real-world datasets suggest that the proposed methods can detect the anomalies in multivariate time series. With the help of available clusters revealed by the extended fuzzy clustering, the proposed framework can detect anomalies in the multivariate time series and is suitable for identifying anomalous amplitude and shape patterns in various application domains such as health care, weather data analysis, finance, and disease outbreak detection.



中文翻译:

多元时间序列数据中基于聚类的异常检测

多元时间序列数据是描述特定时间现象不同方面的时间序列的集合。这种类型的数据中的异常检测构成了一个具有挑战性的问题,但是在科学和工程中却有许多应用,因为异常评分来自同时考虑时间和变量关系。在本文中,我们提出了一种基于聚类的方法来检测有关多元时间序列的幅度和形状的异常。首先,我们使用滑动窗口生成一组多元子序列,然后应用扩展的模糊聚类以揭示生成的多元子序列中存在的结构。最后,采用重构准则重构具有最优聚类中心和分区矩阵的多元子序列。我们构建置信度指标以量化该系列中检测到的异常的水平,并将粒子群优化应用为针对异常检测问题的优化工具。在几个合成的和六个真实的数据集上完成的实验研究表明,所提出的方法可以检测多元时间序列中的异常。借助扩展的模糊聚类揭示的可用聚类,所提出的框架可以检测多元时间序列中的异常,并且适合于识别各种应用领域中的异常振幅和形状模式,例如医疗保健,天气数据分析,财务和疾病爆发检测。我们构建置信度指标以量化该系列中检测到的异常的水平,并将粒子群优化应用为针对异常检测问题的优化工具。在几个合成的和六个真实的数据集上完成的实验研究表明,所提出的方法可以检测多元时间序列中的异常。借助扩展的模糊聚类揭示的可用聚类,所提出的框架可以检测多元时间序列中的异常,并且适合于识别各种应用领域中的异常振幅和形状模式,例如医疗保健,天气数据分析,财务和疾病爆发检测。我们构建置信度指标以量化该系列中检测到的异常的水平,并将粒子群优化应用为针对异常检测问题的优化工具。在几个合成的和六个真实的数据集上完成的实验研究表明,所提出的方法可以检测多元时间序列中的异常。借助扩展的模糊聚类揭示的可用聚类,所提出的框架可以检测多元时间序列中的异常,并且适合于识别各种应用领域中的异常振幅和形状模式,例如医疗保健,天气数据分析,财务和疾病爆发检测。

更新日期:2020-12-15
down
wechat
bug