首页> 外文会议>International Conference on Data Management Technologies and Applications >A Unifying Polynomial Model for Efficient Discovery of Frequent Itemsets
【24h】

A Unifying Polynomial Model for Efficient Discovery of Frequent Itemsets

机译:一种统一多项式模型,用于有效发现频繁项目集

获取原文

摘要

It is well-known that developing a unifying theory is one of the most important issues in Data Mining research. In the last two decades, a great deal has been devoted to the algorithmic aspects of the Frequent Itemset (FI) Mining problem. We are motivated by the need of formal modeling in the field. Thus, we introduce and analyze, in this theoretical study, a new model for the FI mining task. Indeed, we encode the itemsets as words over an ordered alphabet, and state this problem by a formal series over the counting semiring (N,+,×,0,1), whose the range constitutes the itemsets and the coefficients their supports. This formalism offers many advantages in both fundamental and practical aspects: The introduction of a clear and unified theoretical framework through which we can express the main FI-approaches, the possibility of their generalization to mine other more complex objects, and their incrementalization and/or parallelization; in practice, we explain how this problem can be seen as that of word recognition by an automaton, allowing an efficient implementation in O(|Q|) space and O(|F_L||Q|]) time, where Q is the set of states of the automaton used for representing the data, and F_L the set of prefixial maximal FI.
机译:众所周知,发展统一理论是数据挖掘研究中最重要的问题之一。在过去的二十年中,频繁项目集(FI)挖掘问题的算法方面已经致力于大量大量。我们受到在现场正式建模的需求的推动。因此,我们在本理论研究中介绍和分析,这是一个新型挖掘任务的新模型。实际上,我们将项目集编码为上令字母表上的单词,并通过Counting Semiring(n,+,x,0,1)上的正式系列来说明这个问题,其范围构成了项目集和它们支持的系数。这种形式主义在基本和实践方面提供了许多优势:引入了一个明确和统一的理论框架,我们可以通过它表达主要方法,其概括到挖掘其他更复杂的物体的可能性,以及它们的递增和/或并行化;在实践中,我们解释了自动机器可以看到这个问题如何看出单词识别,允许在O(| Q |)空间和O(| f_l || q |)的有效实现,其中q是集合用于代表数据的自动机的状态,以及F_L的预征最大fi集合。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号