当前位置: X-MOL 学术J. Parallel Distrib. Comput. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
An adaptive synthesis to handle imbalanced big data with deep siamese network for electricity theft detection in smart grids
Journal of Parallel and Distributed Computing ( IF 3.4 ) Pub Date : 2021-03-23 , DOI: 10.1016/j.jpdc.2021.03.002
Nadeem Javaid , Naeem Jan , Muhammad Umar Javed

The bi-directional flow of energy and information in the smart grid makes it possible to record and analyze the electricity consumption profiles of consumers. Because of the increasing rate of inflation over the past few years, people started looking for means to use electricity illegally, termed as electricity theft. Many data analytics techniques are proposed in the literature for electricity theft detection (ETD). These techniques help in the detection of suspected illegal consumers. However, the existing approaches have a low ETD rate either due to improper handling of the imbalanced class problem in a dataset or the selection of inappropriate classifier. In this paper, a robust big data analytics technique is proposed to resolve the aforementioned concerns. Firstly, adaptive synthesis (ADASYN) is applied to handle the imbalanced class problem of data. Secondly convolutional neural network (CNN) and long-short term memory (LSTM) integrated deep siamese network (DSN) are proposed to discriminate the features of both honest and fraudulent consumers. Specifically, the task of feature extraction from weekly energy consumption profiles is handed over to the CNN module while the LSTM module performs the sequence learning. Finally, the DSN contemplates on the shared features provided by the CNN-LSTM and applies final judgment. The data analytics is performed on different train–test ratios of the real-time smart meters’ data. The simulation results validate the proposed model’s effectiveness in terms of high area under the curve, F1-Score, precision and recall.



中文翻译:

用于智能电网中电力盗窃检测的深度合成网络处理非平衡大数据的自适应综合

智能电网中能量和信息的双向流动使记录和分析消费者的用电情况成为可能。由于过去几年通货膨胀率的上升,人们开始寻找非法使用电力的手段,这被称为窃电。文献中提出了许多数据分析技术来进行电力盗窃检测(ETD)。这些技术有助于检测可疑的非法消费者。然而,由于对数据集中不平衡类问题的不当处理或对不适当分类器的选择,现有方法的ETD率较低。本文提出了一种健壮的大数据分析技术来解决上述问题。首先,自适应综合(ADASYN)用于处理数据的不平衡类问题。其次,提出了卷积神经网络(CNN)和长期短期记忆(LSTM)集成的深度暹罗网络(DSN),以区分诚实和欺诈消费者的特征。具体来说,在LSTM模块执行序列学习的同时,从每周能耗简档中提取特征的任务会移交给CNN模块。最后,DSN会考虑CNN-LSTM提供的共享功能,并做出最终判断。数据分析是对实时智能电表数据的不同测试比率进行的。仿真结果验证了该模型在曲线下高面积方面的有效性,其次,提出了卷积神经网络(CNN)和长期短期记忆(LSTM)集成的深度暹罗网络(DSN),以区分诚实和欺诈消费者的特征。具体来说,在LSTM模块执行序列学习的同时,从每周能耗简档中提取特征的任务会移交给CNN模块。最后,DSN会考虑CNN-LSTM提供的共享功能,并做出最终判断。数据分析是针对实时智能电表数据的不同测试比率进行的。仿真结果验证了该模型在曲线下高面积方面的有效性,其次,提出了卷积神经网络(CNN)和长期短期记忆(LSTM)集成的深度暹罗网络(DSN),以区分诚实和欺诈消费者的特征。具体来说,在LSTM模块执行序列学习的同时,从每周能耗简档中提取特征的任务会移交给CNN模块。最后,DSN会考虑CNN-LSTM提供的共享功能,并做出最终判断。数据分析是对实时智能电表数据的不同测试比率进行的。仿真结果验证了该模型在曲线下高面积方面的有效性,从每周能耗简档中提取特征的任务移交给CNN模块,而LSTM模块执行序列学习。最后,DSN会考虑CNN-LSTM提供的共享功能,并做出最终判断。数据分析是对实时智能电表数据的不同测试比率进行的。仿真结果验证了该模型在曲线下高面积方面的有效性,LSM模块执行序列学习时,将从每周能耗配置文件中提取特征的任务移交给CNN模块。最后,DSN会考虑CNN-LSTM提供的共享功能,并做出最终判断。数据分析是对实时智能电表数据的不同测试比率进行的。仿真结果验证了该模型在曲线下高面积方面的有效性,F1个-得分,精确度和召回率。

更新日期:2021-04-08
down
wechat
bug