首页> 中文期刊> 《计算机、材料和连续体(英文) 》 >SMK-means:An Improved Mini Batch K-means Algorithm Based on Mapreduce with Big Data

SMK-means:An Improved Mini Batch K-means Algorithm Based on Mapreduce with Big Data

             

摘要

In recent years,the rapid development of big data technology has also been favored by more and more scholars.Massive data storage and calculation problems have also been solved.At the same time,outlier detection problems in mass data have also come along with it.Therefore,more research work has been devoted to the problem of outlier detection in big data.However,the existing available methods have high computation time,the improved algorithm of outlier detection is presented,which has higher performance to detect outlier.In this paper,an improved algorithm is proposed.The SMK-means is a fusion algorithm which is achieved by Mini Batch K-means based on simulated annealing algorithm for anomalous detection of massive household electricity data,which can give the number of clusters and reduce the number of iterations and improve the accuracy of clustering.In this paper,several experiments are performed to compare and analyze multiple performances of the algorithm.Through analysis,we know that the proposed algorithm is superior to the existing algorithms.

著录项

  • 来源
    《计算机、材料和连续体(英文) 》 |2018年第9期|P.365-379|共15页
  • 作者单位

    Jiangsu Key Laboratory of Atmospheric Environment Monitoring and Pollution Control Jiangsu Collaborative Innovation Centre of Atmospheric Environment and Equipment Technology School of Environmental Science and Engineering Nanjing University of Information Science and Technology 219 Ningliu Road Nanjing 210044 China;

    School of Computer and Software Nanjing University of Information Science and Technology 219 Ningliu Road Nanjing 210044 China;

    School of Computing Edinburgh Napier University 10 Colinton Road Edinburgh EH105DT UK;

    School of Computing Edinburgh Napier University 10 Colinton Road Edinburgh EH105DT UK;

  • 原文格式 PDF
  • 正文语种 chi
  • 中图分类 计算技术、计算机技术 ;
  • 关键词

    Big; data; outlier; detection; SMK-means; Mini; Batch; K-means; simulated; annealing;

    机译:大;数据;异常值;检测;SMK均值;迷你;批次;k型;模拟;退火;
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号