首页> 外文会议>IEEE Symposium Series on Computational Intelligence >Memory efficient mining of periodic-frequent patterns in transactional databases
【24h】

Memory efficient mining of periodic-frequent patterns in transactional databases

机译:事务性数据库中内存高效的周期-频繁模式挖掘

获取原文

摘要

Periodic-frequent patterns are an important class of regularities which exists in a transactional database. A frequent pattern is called periodic-frequent if it appears at regular intervals in a transactional database. In the literature, a model of periodic-frequent patterns was proposed and pattern growth like approaches to extract patterns are being explored. In these approaches, a periodic-frequent pattern tree is built in which a transaction-id list is maintained at each path's tail-node. As the typical size of transactional database is very huge in the modern e-commerce era, extraction of periodic-frequent patterns by maintaining transaction-ids in the tree requires more memory. In this paper, to reduce the memory requirements, we introduced a notion of period summary by capturing the periodicity of the patterns in a sequence of transaction-ids. While building the tree, the period summary of the transactions is computed and stored at the tail-node of the tree instead of the transaction-ids. We have also proposed a merging framework for period summaries for mining periodic-frequent patterns. The performance could be improved significantly as the memory required to store the period summaries is significantly less than the memory required to store the transaction-id list. Experimental results show that the proposed approach reduces the memory consumption significantly and also improves the runtime efficiency considerably over the existing approaches.
机译:周期性-频繁模式是事务数据库中存在的一类重要的规则。如果频繁模式在事务数据库中以固定间隔出现,则称为周期性频繁。在文献中,提出了一种周期-频繁模式的模型,并且正在探索模式增长之类的提取模式的方法。在这些方法中,建立了周期性频繁的模式树,其中在每个路径的尾节点处维护了一个交易ID列表。在现代电子商务时代,由于事务数据库的典型大小非常庞大,因此通过在树中维护事务ID来提取周期-频繁模式需要更多的内存。在本文中,为了减少内存需求,我们通过捕获一系列事务ID中的模式的周期性来引入周期汇总的概念。在构建树时,将计算交易的期间摘要并将其存储在树的末尾节点,而不是存储在交易ID中。我们还提出了一种用于汇总周期性频率模式的周期摘要的合并框架。由于存储期间摘要所需的内存明显少于存储事务处理ID列表所需的内存,因此可以显着提高性能。实验结果表明,与现有方法相比,该方法显着降低了内存消耗,并显着提高了运行时效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号