...
首页> 外文期刊>Turkish Journal of Electrical Engineering and Computer Sciences >SOTARM: Size of transaction-based association rule mining algorithm
【24h】

SOTARM: Size of transaction-based association rule mining algorithm

机译:SOTARM:基于事务的关联规则挖掘算法的大小

获取原文
           

摘要

SOTARM: Size of transaction-based association rule mining algorithm Authors: ASHA PANDIAN, JEBARAJAN THAVEETHU Abstract: Mining of association rules tries to identify the existence of promising and fruitful relations among the items present in a database. The basic a priori algorithm suffers from multiple database scans, and if the database is large, then the time taken for scanning and generation of candidates is also large. The proposed algorithm attempts to reduce the repeated scanning of the whole database. Using this algorithm, scanning time and also the generation of subitems that are not frequent can be reduced. The former can be done by sorting the transaction records in descending order based on the size of transaction (SOT) and scanning only those transactions whose SOT is greater than or equal to k (size of item sets). The latter can be done by analyzing the item set state. It is not required to generate the next set of candidate item sets using those item sets that are not frequent. Both positive and negative mining has been done in R Studio of the R data mining tool using the R language. Experimental results show that the SOT algorithm performs better than the Apriori, Eclat, PVARM (partition-based validation for association rule mining), and NRRM (nonredundant rule method) algorithms. The work has been tested against various standard datasets such as Adult, Genome, Groceries, and SER (State Electricity Rate) Prediction. The speed-up and efficiency parameter values obtained from the algorithm strongly suggest that the proposed SOTARM algorithm has attained better performance when compared to all the other existing algorithms.
机译:SOTARM:基于事务的关联规则挖掘算法的大小作者:ASHA PANDIAN,JEBARAJAN THAVEETHU摘要:关联规则的挖掘试图识别数据库中存在的项目之间存在有希望且富有成果的关系。基本的先验算法遭受多次数据库扫描的困扰,如果数据库很大,则扫描和生成候选对象所花费的时间也很大。提出的算法试图减少整个数据库的重复扫描。使用该算法,可以减少扫描时间以及不频繁的子项的生成。前者可以通过基于事务大小(SOT)按降序对事务记录进行排序,并仅扫描SOT大于或等于k(项目集大小)的那些事务来完成。后者可以通过分析项目设置状态来完成。不需要使用那些不频繁的项目集来生成下一组候选项目集。使用R语言在R数据挖掘工具的R Studio中完成了正负挖掘。实验结果表明,SOT算法的性能优于Apriori,Eclat,PVARM(基于分区的关联规则挖掘验证)和NRRM(非冗余规则方法)算法。该工作已经针对各种标准数据集进行了测试,例如成人,基因组,食品杂货和SER(状态电价)预测。从该算法获得的提速和效率参数值强烈表明,与所有其他现有算法相比,提出的SOTARM算法具有更好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号