首页> 外文会议>International Conference on Electrical Engineering >Enhancing CluStream Algorithm for Clustering Big Data Streaming over Sliding Window
【24h】

Enhancing CluStream Algorithm for Clustering Big Data Streaming over Sliding Window

机译:增强CluStream算法以在滑动窗口上聚类大数据流

获取原文

摘要

Data stream mining becomes a hot research issue in the ongoing time. The main challenge in data stream mining is the knowledge extraction in real-time from an immense, data stream in only one scan. Data stream clustering demonstrates an significant task in data stream processing. This paper introduces SCluStream an algorithm for determining clusters over a sliding window to manage such challenges. The algorithm is an improvement over CluStream which does not involve this sliding window concept. In the sliding window model, only the most recent data is utilized while the old data is eliminated, which allows for faster execution. A better clustering technique is also involved which managed to contribute to accuracy improvement. The proposed algorithm has been tested on two real datasets; charitable donation data set and forest cover type data set. The results showed that comparing SCluStream to CluStream has proven that the former algorithm is more efficient for clustering big data streams in regard to the accuracy as well as the utilized time and memory usages.
机译:数据流挖掘已成为当前研究的热点。数据流挖掘中的主要挑战是仅一次扫描即可从庞大的数据流中实时提取知识。数据流群集演示了数据流处理中的一项重要任务。本文介绍了SCluStream,该算法用于确定滑动窗口上的聚类以应对此类挑战。该算法是对不涉及此滑动窗口概念的CluStream的改进。在滑动窗口模型中,仅使用最新数据,而消除了旧数据,从而可以更快地执行。还涉及一种更好的聚类技术,该技术有助于提高准确性。所提出的算法已在两个真实数据集上进行了测试;慈善捐赠数据集和森林覆盖类型数据集。结果表明,将SCluStream与CluStream进行比较已证明,在准确性以及所使用的时间和内存使用量方面,前一种算法对于群集大数据流更为有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号