...
首页> 外文期刊>Concurrency and computation: practice and experience >A MapReduce-based parallel K-means clustering for large-scale CIM data verification
【24h】

A MapReduce-based parallel K-means clustering for large-scale CIM data verification

机译:基于MapReduce的并行K均值聚类用于大规模CIM数据验证

获取原文
获取原文并翻译 | 示例

摘要

The Common Information Model (CIM) has been heavily used in electric power grids for data exchange among a number of auxiliary systems such as communication systems, monitoring systems, and marketing systems. With a rapid deployment of digitalized devices in electric power networks, the volume of data continuously grows, which makes verification of CIM data a challenging issue. This paper presents a parallel K-means clustering algorithm for large-scale CIM data verification. The parallel K-means builds on the MapReduce computing model which has been widely taken up by the community in dealing with data-intensive applications. A genetic algorithm-based load-balancing scheme is designed to balance the workloads among the heterogeneous computing nodes for a further improvement in computation efficiency. The performance of the parallel K-means is initially evaluated in a small-scale in-house MapReduce cluster and subsequently evaluated in a commercial cloud computing platform. Finally, the parallel K-means is evaluated in large-scale simulated MapReduce environments. Both the experimental and simulation results show that the parallel K-means reduces the CIM data-verification time significantly compared with the sequential K-means clustering, while generating a high level of precision in data verification. Copyright © 2015 John Wiley & Sons, Ltd.
机译:通用信息模型(CIM)已在电网中大量使用,用于在许多辅助系统(例如通信系统,监视系统和营销系统)之间进行数据交换。随着数字化设备在电力网络中的快速部署,数据量不断增长,这使得CIM数据验证成为一个具有挑战性的问题。本文提出了一种用于大规模CIM数据验证的并行K均值聚类算法。并行K均值建立在MapReduce计算模型的基础上,该模型已被社区广泛用于处理数据密集型应用程序。设计了一种基于遗传算法的负载均衡方案,以平衡异构计算节点之间的工作量,从而进一步提高计算效率。并行K均值的性能首先在小型内部MapReduce集群中进行评估,然后在商业云计算平台中进行评估。最后,在大规模模拟MapReduce环境中评估了并行K均值。实验和仿真结果均表明,与顺序K-均值聚类相比,并行K-均值显着减少了CIM数据验证时间,同时在数据验证中产生了很高的精度。版权所有©2015 John Wiley&Sons,Ltd.

著录项

  • 来源
  • 作者单位

    Sichuan University School of Electrical Engineering and Information Chengdu China;

    Sichuan University School of Electrical Engineering and Information Chengdu China;

    Sichuan University School of Electrical Engineering and Information Chengdu China;

    Sichuan University School of Electrical Engineering and Information Chengdu China;

    Sichuan University School of Electrical Engineering and Information Chengdu China;

    Tongji University The Key Laboratory of Embedded Systems and Service Computing Shanghai China;

    Brunel University London Department of Electronic and Computer Engineering Uxbridge UK;

    Tongji University The Key Laboratory of Embedded Systems and Service Computing Shanghai China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    CIM verification; stochastic sampling; clustering; MapReduce; load balancing;

    机译:CIM验证;随机抽样;聚类;MapReduce;负载均衡;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号