1673-159X

CN 51-1686/N

基于混合深度学习的电力用户数据的分析模型

The Model of User Data Analysis System Based on Hybrid Deep Learning Big Data Technology

  • 摘要: 为改善现有窃电检测时由于电力数据特征复杂、数据样本不均等导致的效率低、精度低等问题,提出一种基于深度学习的电力用户数据的分析模型:首先,考虑到窃电数据样本不均衡、样本数量有限,提出基于Wasserstein准则的条件生成对抗网络,以平衡窃电数据,提升数据多样性;其次,提出一种用户电力行为特征提取网络,以增强模型训练效率;最后,提出一种基于梯度提升决策树的电力数据分类模型,以有效减少过拟合问题,从而提高模型分类精度。以中国国家电网公司发布的用电量数据集为例,对所提模型进行分析和验证。与基于随机过采样(ROS)、人工少数类过采样(SMOTE)和生成对抗网络( GAN)等数据增强方法相比,所提数据增强方法可以有效提升模型训练性能。此外,与逻辑回归(LR)、支持向量机(SVM)、长短时记忆网络(LSTM)等模型相比,所提模型在测试集中性能更优,准确率和召回率分别为89.3%和69%。仿真结果进一步验证了所提模型的有效性和准确性。

     

    Abstract: In order to solve the problems of low detection efficiency and low accuracy caused by the complex features of power data and unequal data samples, a power data analysis model based on deep learning is proposed. First of all, considered the imbalance of power theft data samples and the limited number of samples, a condition generation countermeasure network based on Wasserstein criterion is proposed to balance the power theft data and improve the diversity of data. Secondly, an automatic coder based on superposition convolution noise reduction is proposed to extract the user's power behavior characteristics, so as to improve the user's power feature extraction ability and enhance the model training efficiency. Finally, a power data classification model based on gradient lifting decision tree is proposed, which can effectively reduce the over fitting problem and improve the classification accuracy of the model. In the experimental stage, the proposed model is analyzed and verified by taking the power consumption data set released by the State Grid Corporation of China as an example. Compared with random oversampling(ROS), synthetic minority over-sampling technique(SMOTE), generative adversarial network(GAN)and other data enhancement methods, the proposed data enhancement method can effectively improve the training performance of the model. In addition, compared with logistic regression (LR), support vector machine (SVM), long-term and short-term memory network (LSTM) and other models, the proposed model has better performance in the test set, with accuracy and recall of 89.3% and 69%, respectively. The simulation results further verify the effectiveness and accuracy of the proposed model.

     

/

返回文章
返回