首页> 外文期刊>PLoS One >HUIL-TN & HUI-TN: Mining high utility itemsets based on pattern-growth
【24h】

HUIL-TN & HUI-TN: Mining high utility itemsets based on pattern-growth

机译:Huil-TN&Hui-TN:基于模式增长的矿业高实用项目集

获取原文
           

摘要

In recent years, high utility itemsets (HUIs) mining has been an active research topic in data mining. In this study, we propose two efficient pattern-growth based HUI mining algorithms, called High Utility Itemset based on Length and Tail-Node tree (HUIL-TN) and High Utility Itemset based on Tail-Node tree (HUI-TN). These two algorithms avoid the time-consuming candidate generation stage and the need of scanning the original dataset multiple times for exact utility values. A novel tree structure, named tail-node tree (TN-tree) is proposed as a key element of our algorithms to maintain complete utililty-information of existing itemsets of a dataset. The performance of HUIL-TN and HUI-TN was evaluated against state-of-the-art reference methods on various datasets. Experimental results showed that our algorithms exceed or close to the best performance on all datasets in terms of running time, while other algorithms can only excel in certain types of dataset. Scalability tests were also performed and our algorithms obtained the flattest curves among all competitors.
机译:近年来,高效项目集(HUIS)挖掘一直是数据挖掘的积极研究主题。在这项研究中,我们提出了两个有效的模式 - 生长基于Lui挖掘算法,基于长度和尾部节点树(Huil-Tn)和高实用程序项集,称为高实用程序项集和基于尾部节点树(Hui-TN)。这两个算法避免了耗时的候选生成阶段,并且需要多次扫描原始数据集以进行精确的实用程序值。一个名为尾部节点树(TN树)的新颖树结构被提出为我们算法的一个关键元素,以维护数据集的现有项目集的完整效果信息。对各种数据集的最先进的参考方法评估Huil-Tn和Hui-Tn的性能。实验结果表明,在运行时间方面,我们的算法超过了所有数据集的最佳性能,而其他算法只能在某些类型的数据集中Excel。还进行了可扩展性测试,我们的算法在所有竞争对手之间获得了最平坦的曲线。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号