首页> 外文会议>Data Engineering, ICDE, 2009 IEEE 25th International Conference on >Tracking High Quality Clusters over Uncertain Data Streams
【24h】

Tracking High Quality Clusters over Uncertain Data Streams

机译:在不确定的数据流上跟踪高质量集群

获取原文

摘要

Recently, data mining over uncertain data streams has attracted a lot of attentions because of the widely existed imprecise data generated from a variety of streaming applications. In this paper, we try to resolve the problem of clustering over uncertain data streams. Facing uncertain tuples with different probability distributions, the clustering algorithm should not only consider the tuple value but also emphasis on its uncertainty. To fulfill these dual purposes, a metric named tuple uncertainty will be integrated into the overall procedure of clustering. Firstly, we survey uncertain data model and propose our uncertainty measurement and corresponding properties. Secondly, based on such uncertainty quantification method, we provide a two phase stream clustering algorithm and elaborate implementation detail. Finally, performance experiments over a number of real and synthetic data sets demonstrate the effectiveness and efficiency of our method.
机译:最近,由于从各种流应用程序生成的不精确数据广泛存在,因此在不确定的数据流上进行数据挖掘已引起了广泛的关注。在本文中,我们尝试解决不确定数据流上的聚类问题。面对具有不同概率分布的不确定元组,聚类算法不仅应考虑元组值,而且应强调其不确定性。为了实现这些双重目的,一个名为元组不确定性的度量将被集成到整个聚类过程中。首先,我们调查不确定性数据模型,并提出不确定性度量和相应的属性。其次,基于这种不确定性量化方法,提供了一种两相流聚类算法,并详细说明了实现细节。最后,在大量真实和综合数据集上进行的性能实验证明了我们方法的有效性和效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号