首页> 外文会议>International Conference on Advances in Web-Age Information Management >Dynamic Incremental Data Summarization for Hierarchical Clustering
【24h】

Dynamic Incremental Data Summarization for Hierarchical Clustering

机译:分层群集的动态增量数据摘要

获取原文
获取外文期刊封面目录资料

摘要

In many real world applications, with the databases frequent insertions and deletions, the ability of a data mining technique to detect and react quickly to dynamic changes in the data distribution and clustering over time is highly desired. Data summarizations (e.g., data bubbles) have been proposed to compress large databases into representative points suitable for subsequent hierarchical cluster analysis. In this paper, we thoroughly investigate the quality measure (data summarization index) of incremental data bubbles. When updating databases, we show which factors could affect the mean and standard deviation of data summarization index or not. Based on these statements, a fully dynamic scheme to maintain data bubbles incrementally is proposed. An extensive experimental evaluation confirms our statements and shows that the fully dynamic incremental data bubbles are effective in preserving the quality of the data summarization for hierarchical clustering.
机译:在许多现实世界应用程序中,利用数据库频繁插入和删除,非常需要数据挖掘技术能够快速检测和反应的能力,以便在数据分布和聚类上随着时间的推移而变化。已经提出了数据摘要(例如,数据泡沫)以将大型数据库压缩到适用于后续分层聚类分析的代表性点。在本文中,我们彻底调查了增量数据泡沫的质量措施(数据摘要指数)。更新数据库时,我们显示哪些因素可能影响数据摘要指数的平均值和标准偏差。基于这些陈述,提出了一种以逐步维持数据泡沫的完全动态方案。一个广泛的实验评估证实了我们的陈述,并表明完全动态的增量数据泡沫是有效保留分层聚类数据摘要的质量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号