首页> 外文会议>Data Engineering, ICDE, 2009 IEEE 25th International Conference on >Mining of Frequent Itemsets from Streams of Uncertain Data
【24h】

Mining of Frequent Itemsets from Streams of Uncertain Data

机译:从不确定数据流中挖掘频繁项集

获取原文

摘要

Frequent itemset mining plays an essential role in the mining of various patterns and is in demand in many real-life applications. Hence, mining of frequent itemsets has been the subject of numerous studies since its introduction. Generally, most of these studies find frequent itemsets from traditional transaction databases, in which the content of each transaction--namely, items--is definitely known and precise. However, there are many real-life situations in which ones are uncertain about the content of transactions. This calls for the mining of uncertain data. Moreover, due to advances in technology, a flood of precise or uncertain data can be produced in many situations. This calls for the mining of data streams. To deal with these situations, we propose two tree-based mining algorithms to efficiently find frequent itemsets from streams of uncertain data, where each item in the transactions in the streams is associated with an existential probability. Experimental results show the effectiveness of our algorithms in mining frequent itemsets from streams of uncertain data.
机译:频繁项集挖掘在各种模式的挖掘中起着至关重要的作用,并且在许多实际应用中都需要它。因此,自引入以来,频繁项集的挖掘一直是众多研究的主题。通常,大多数研究从传统交易数据库中发现频繁的项目集,其中每个交易的内容(即项目)都是已知且精确的。但是,在现实生活中,有很多情况都无法确定交易的内容。这要求挖掘不确定的数据。此外,由于技术的进步,在许多情况下都可能产生大量的精确或不确定数据。这要求挖掘数据流。为了应对这些情况,我们提出了两种基于树的挖掘算法,以从不确定数据流中高效查找频繁项集,其中流中事务中的每个项都与一个存在概率相关联。实验结果表明,我们的算法在从不确定数据流中提取频繁项集的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号