首页> 中文期刊>计算机应用 >基于云计算和改进K-means算法的海量用电数据分析方法

基于云计算和改进K-means算法的海量用电数据分析方法

     

摘要

针对小区居民用电数据挖掘效率低、数据量大等难题,进行了基于云计算和改进K-means算法的海量用电数据分析方法研究.针对传统K-means算法中存在初始聚类中心和K值难确定的问题,提出一种基于密度的K-means改进算法.首先,定义样本密度、簇内样本平均距离的倒数和簇间距离三者乘积为权值积,通过最大权值积法依次确定聚类中心,提高了聚类的准确率;然后,基于MapReduce模型实现改进算法的并行化,提高了聚类的效率;最后,以小区400户家庭用电数据为基础,进行海量电力数据的挖掘分析实验.以家庭为单位,提取出用户的峰时耗电率、负荷率、谷电负荷系数以及平段用电量百分比,建立聚类的数据维度特征向量,完成相似用户类型的聚类,同时分析出各类用户的行为特征.基于Hadoop集群的实验结果证明提出的改进K-means算法运行稳定、可靠,具有很好的聚类效果.%For such difficulties as low mining efficiency and large amount of data that the data mining of residential electricity data has to be faced with,the analysis based on improved K-means algorithm and cloud computing on massive data of power utilization was researched.As the initial cluster center and the value K are difficult to determine in traditional K-means algorithm,an improved K-means algorithm based on density was proposed.Firstly,the product of sample density,the reciprocal of the average distance between the samples in the cluster,and the distance between the clusters were defined as weight product,the initial center was determined successively according to the maximum weight product method and the accuracy of the clustering was improved.Secondly,the parallelization of improved K-means algorithm was realized based on MapReduce model and the efficiency of clustering was improved.Finally,the mining experiment of massive power utilization data was carried out on the basis of 400 households' electricity data.Taking a family as a unit,such features as electricity consumption rate during peak hour,load rate,valley load coefficient and the percentage of power utilization during normal hour were calculated,and the feature vector of data dimension was established to complete the clustering of similar user types,at the same time,the behavioral characteristics of each type of users were analyzed.The experimental results on Hadoop cluster show that the improved K-means algorithm operates stably and efficiently and it can achieve better clustering effect.

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号