首页> 外文会议>IEEE Advanced Information Management, Communicates, Electronic and Automation Control Conference >A mining algorithm for distributed global maximal frequent itemsets based on Sorted SCan-Tree
【24h】

A mining algorithm for distributed global maximal frequent itemsets based on Sorted SCan-Tree

机译:基于排序的SCan-Tree的分布式全局最大频繁项集挖掘算法

获取原文

摘要

A new algorithm, named SCan-MAX for mining distributed maximal frequent itemsets from databases was proposed, the SCan-MAX used Sorted SCan-tree to store all the information of the transactions from the databases. SCan-MAX firstly scanned the local database and then gets the global 1-frequent itemsets, and then it created the SCan-tree on each node and used orderly sequence to store frequent itemsets. All the local SCan-tree was send to the master node to create the global SCan-tree. Then, the root's sub-tree was send to the slave node to mine local maximal frequent itemsets and send the information of itemsets to the master node to mine the global maximal frequent itemsets. Experimental results show that the algorithm makes full use of the characteristics of SCan-Tree, significantly reduces the amount of communication required between nodes, thus enhanced the efficiency of the algorithm. Theoretical analysis and experimental results show that the algorithm has a good time and space efficiency.
机译:提出了一种新的算法,用于挖掘数据库的分布式最大频繁项集的名为Scan-Max,Scan-Max使用的排序扫描树从数据库中存储事务的所有信息。 Scan-Max首先扫描本地数据库,然后获取全局1常常项目集,然后它在每个节点上创建了扫描树,并使用有序的序列来存储频繁的项目集。所有本地扫描树都被发送到主节点以创建全局扫描树。然后,将根树发送到从节点以挖掘本地最大频繁项集,并将项目集的信息发送到主节点以挖掘全局最大频繁项目集。实验结果表明,该算法充分利用了扫描树的特性,显着降低了节点之间所需的通信量,从而提高了算法的效率。理论分析和实验结果表明,该算法具有良好的时空和空间效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号