【24h】

MINI: Mining Informative Non-redundant Itemsets

机译:MINI:挖掘信息性非冗余项目集

获取原文
获取原文并翻译 | 示例

摘要

Frequent itemset mining assists the data mining practitioner in searching for strongly associated items (and transactions) in large transaction databases. Since the number of frequent itemsets is usually extremely large and unmanageable for a human user, recent works have sought to define condensed representations of them, e.g. closed or maximal frequent itemsets. We argue that not only these methods often still fall short in sufficiently reducing of the output size, but they also output many redundant itemsets. In this paper we propose a philosophically new approach that resolves both these issues in a computationally tractable way. We present and empirically validate a statistically founded approach called MINI, to compress the set of frequent itemsets down to a list of informative and non-redundant itemsets.
机译:频繁的项目集挖掘可帮助数据挖掘从业人员在大型交易数据库中搜索紧密关联的项目(和交易)。由于频繁项集的数量通常非常大并且对于人类用户来说是难以管理的,因此最近的工作试图定义它们的简明表示,例如,图2。封闭或最大频繁项集。我们认为,不仅这些方法常常不足以充分减小输出大小,而且它们还会输出许多冗余项集。在本文中,我们提出了一种哲学上新颖的方法,以一种可计算的方式解决了这两个问题。我们提出并凭经验验证了一种称为MINI的统计学基础上的方法,该方法可将频繁项集的集合压缩到信息丰富和非冗余项集的列表中。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号