首页> 外文会议>International Symposium on Pervasive Systems, Algorithms and Networks >Probabilistic Frequent Itemsets Mining Based on Expectation Bound over Uncertain Database
【24h】

Probabilistic Frequent Itemsets Mining Based on Expectation Bound over Uncertain Database

机译:概率频繁项目集基于不确定数据库的期望界定的挖掘

获取原文

摘要

Frequent itemsets discovery is popular in database communities recently. Because real data is often affected by noise, in this paper, we study to find frequent itemsets over probabilistic database under the Possible World Semantics. It is challenging because there may be exponential number of possible worlds for probabilistic database. Although several efficient algorithms are proposed in the literature, it is hard to mine frequent itemsets in large uncertain database due to the high time consuming. To address this issue, we propose an efficient algorithm to mine probabilistic frequent itemsets. A pruning strategy is also presented to accelerate the process of generating candidates. Extensive experiments have been done on synthetic and real databases, demonstrating that the proposed method preforms better than state-of-art methods in most cases.
机译:频繁的项目集发现最近在数据库社区中受欢迎。因为真实数据往往受到噪声的影响,因此在本文中,我们研究在可能的世界语义下,在可能的世界语义下发现常见的项目集。它充满挑战性,因为可能存在概率数据库可能的世界世界。虽然文献中提出了几种高效的算法,但由于耗时的耗时量,仍然在大型不确定数据库中常用的频繁项目。要解决此问题,我们提出了一种高效的算法来挖掘概率频繁项目集。还提出了修剪策略以加速产生候选人的过程。已经在合成和实际数据库上进行了广泛的实验,证明了在大多数情况下,所提出的方法优于最先进的方法更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号