【24h】

A Distributed Framework for Online Stream Data Clustering

机译:用于在线流数据群集的分布式框架

获取原文
获取外文期刊封面目录资料

摘要

The recent prevalence of positioning sensors and mobile devices generates a massive amount of spatial-temporal data from moving objects in real-time. As one of the fundamental processes in data analysis, the clustering on spatial-temporal data creates various applications, like event detection and travel pattern extraction. However, most of the existing works only focus on the offline scenario, which is not applicable to online time-sensitive applications due to their low efficiency and ignorance of temporal features. In this paper, we propose a distributed streaming framework for spatial-temporal data clustering, which accepts various clustering algorithms while ensuring low resource consumption and result correctness. The framework includes a dynamic partitioning strategy for continuous load-balancing and a cluster-merging algorithm based on convex hulls [10], which guarantees the result correctness. Extensive experiments on real dataset prove the effectiveness of our proposed framework and its advantage over existing solutions.
机译:定位传感器和移动设备的近期普遍率从实时从移动物体产生大量的空间数据数据。作为数据分析中的基本过程之一,空间数据上的聚类会产生各种应用,如事件检测和旅行模式提取。然而,大多数现有的作品仅关注离线方案,这是由于它们的效率低和时间特征的低效率和无知,这不适用于在线时敏应用。在本文中,我们提出了一种用于空间 - 时间数据聚类的分布式流框架,其在确保低资源消耗和结果正确性时接受各种聚类算法。该框架包括用于连续负载平衡的动态分区策略和基于凸壳的群集合并算法[10],保证结果正确性。关于Real DataSet的广泛实验证明了我们提出的框架的有效性及其在现有解决方案中的优势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号