首页> 外文会议>International Conference on Trends in Electronics and Informatics >Incremental MapReduce for K-Medoids Clustering of Big Time-Series Data
【24h】

Incremental MapReduce for K-Medoids Clustering of Big Time-Series Data

机译:大时间序列数据的K-Medoids聚类的增量MapReduce

获取原文

摘要

There is a high necessity to refresh the data mining results, as the former results become stale and obsolete over time due to dynamic and evolving data. Clustering is one of the important data mining techniques that help to group data points with similarity together. To mine the data generated exponentially in these days, MapReduce, a parallel programming framework can be combined MapReduce with the k-medoids clustering algorithm to arrive at the optimum results quickly. Due to the parallel processing architecture of Hadoop, the proposed iterative algorithm for processing incremental data using an intermediate key file exhibited better performance over conventional k-medoids.
机译:刷新数据挖掘结果的必要性很高,因为由于动态和不断发展的数据,随着时间的推移,以前的结果将变得陈旧且过时。聚类是重要的数据挖掘技术之一,可帮助将具有相似性的数据点分组在一起。为了挖掘近来指数级生成的数据,可以将并行编程框架MapReduce与k-medoids聚类算法相结合,以快速获得最佳结果。由于Hadoop的并行处理体系结构,因此提出的用于使用中间密钥文件处理增量数据的迭代算法表现出比常规k-medoids更好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号