首页> 外文期刊>Theoretical computer science >Computational aspects of mining maximal frequent patterns
【24h】

Computational aspects of mining maximal frequent patterns

机译:挖掘最大频繁模式的计算方面

获取原文
获取原文并翻译 | 示例
       

摘要

In this paper we study the complexity-theoretic aspects of mining maximal frequent patterns, from the perspective of counting the number of all distinct solutions. We present the first formal proof that the problem of counting the number of maximal frequent itemsets in a database of transactions, given an arbitrary support threshold, is #P-complete, thereby providing theoretical evidence that the problem of mining maximal frequent itemsets is NP-hard. We also extend our complexity analysis to other similar data mining problems that deal with complex data structures, such as sequences, trees, and graphs. We investigate several variants of these mining problems in which the patterns of interest are subsequences, subtrees, or subgraphs, and show that the associated problems of counting the number of maximal frequent patterns are all either #P-complete or #P-hard.
机译:在本文中,我们将从对所有不同解的数量进行计数的角度研究挖掘最大频繁模式的复杂性理论方面。我们提供了第一个形式证明,即在给定任意支持阈值的情况下,对交易数据库中最大频繁项集的数量进行计数的问题是#P-完全的,从而提供了理论上的证据,即挖掘最大频繁项集的问题是NP-硬。我们还将复杂性分析扩展到处理复杂数据结构(例如序列,树和图)的其他类似数据挖掘问题。我们研究了这些挖掘问题的几种变体,其中感兴趣的模式是子序列,子树或子图,并表明计算最大频繁模式数的相关问题都是#P-complete或#P-hard。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号