当前位置: X-MOL 学术J. Intell. Manuf. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Bound smoothing based time series anomaly detection using multiple similarity measures
Journal of Intelligent Manufacturing ( IF 8.3 ) Pub Date : 2020-05-18 , DOI: 10.1007/s10845-020-01583-0
Wenqing Wang , Junpeng Bao , Tao Li

Time series data is pervasive in many applications and the anomaly detection about it is important, which will provide the early warning of some unexpected patterns. In this paper, we propose a multiple similarity based anomalous subsequences detection method, which is unsupervised and domain knowledge free. Firstly, to improve the time efficiency, an anomaly candidates selection scheme is introduced based on the locality sensitive hashing (LSH), which considers a subsequence that does not collide with the others as a potential anomaly. However, if the raw time series is noisy and the anomaly is subtle, the performance of LSH will be degraded. In order to address this problem, we present a smoothing method to remove the noise and highlight the anomalous part in a time series, which can help to decrease the collision probability between an anomaly and the other subsequences. Secondly, we employ Pareto analysis to incorporate multiple similarity measures since there are different types of anomalies in real applications. It is unlikely that a single similarity measure can perform consistently well on different types of anomalies. Thirdly a new anomaly score scheme is provided to evaluate each anomaly candidate, which is based on the number of non-dominated vectors. Finally, we conduct extensive experiments on benchmark datasets from diverse domains and compare our method with the state-of-the-art approaches. The results show that our method can reach higher accuracy.



中文翻译:

使用多个相似性度量的基于边界平滑的时间序列异常检测

时间序列数据在许多应用程序中无处不在,因此对其进行异常检测非常重要,它将为某些意外模式提供预警。在本文中,我们提出了一种基于多重相似度的异常子序列检测方法,该方法是无监督且无领域知识的。首先,为了提高时间效率,引入了基于局部敏感哈希(LSH)的异常候选者选择方案,该方案将不会与其他子序列冲突的子序列视为潜在异常。但是,如果原始时间序列嘈杂且异常微妙,则LSH的性能将下降。为了解决这个问题,我们提出了一种平滑方法,可以消除噪声并突出显示时间序列中的异常部分,这可以帮助减少异常与其他子序列之间的碰撞概率。其次,由于实际应用中存在不同类型的异常,因此我们使用Pareto分析来合并多种相似性度量。单个相似性度量不可能在不同类型的异常上始终如一地表现良好。第三,基于非支配向量的数量,提供了一种新的异常评分方案来评估每个异常候选者。最后,我们对来自不同领域的基准数据集进行了广泛的实验,并将我们的方法与最新方法进行了比较。结果表明,该方法可以达到较高的精度。由于实际应用中存在不同类型的异常,因此我们使用Pareto分析来合并多种相似性度量。单个相似性度量不可能在不同类型的异常上始终如一地表现良好。第三,基于非支配向量的数量,提供了一种新的异常评分方案来评估每个异常候选者。最后,我们对来自不同领域的基准数据集进行了广泛的实验,并将我们的方法与最新方法进行了比较。结果表明,该方法可以达到较高的精度。由于实际应用中存在不同类型的异常,因此我们使用Pareto分析来合并多种相似性度量。单个相似性度量不可能在不同类型的异常上始终如一地表现良好。第三,基于非支配向量的数量,提供了一种新的异常评分方案来评估每个异常候选者。最后,我们对来自不同领域的基准数据集进行了广泛的实验,并将我们的方法与最新方法进行了比较。结果表明,该方法可以达到较高的精度。最后,我们对来自不同领域的基准数据集进行了广泛的实验,并将我们的方法与最新方法进行了比较。结果表明,该方法可以达到较高的精度。最后,我们对来自不同领域的基准数据集进行了广泛的实验,并将我们的方法与最新方法进行了比较。结果表明,该方法可以达到较高的精度。

更新日期:2020-05-18
down
wechat
bug