首页> 外文期刊>International Journal of Data Warehousing and Mining >A Hybrid Method for High-Utility Itemsets Mining in Large High-Dimensional Data
【24h】

A Hybrid Method for High-Utility Itemsets Mining in Large High-Dimensional Data

机译:大型高维数据中高可用性项集挖掘的混合方法

获取原文
获取原文并翻译 | 示例
       

摘要

Existing algorithms for high-utility itemsets mining are column enumeration based, adopting an Apriori-like candidate set generation-and-test approach, and thus are inadequate in datasets with high dimensions or long patterns. To solve the problem, this paper proposed a hybrid model and a row enumeration-based algorithm, i.e., Inter-transaction, to discover high-utility itemsets from two directions: an existing algorithm can be used to seek short high-utility itemsets from the bottom, while Inter-transaction can be used to seek long high-utility itemsets from the top. Inter-transaction makes full use of the characteristic that there are few common items between or among long transactions. By intersecting relevant transactions, the new algorithm can identify long high-utility itemsets, without extending short itemsets step by step. In addition, we also developed new pruning strategies and an optimization technique to improve the performance of Inter-transaction.
机译:现有的用于高效项集挖掘的算法是基于列枚举的,采用了类似Apriori的候选集生成和测试方法,因此在高维或长模式的数据集中是不够的。为了解决这个问题,本文提出了一种混合模型和一种基于行枚举的算法,即Inter-transaction,以从两个方向发现高实用性项目集:可以使用一种现有的算法从网络中寻找短的高实用性项目集。底部,而交互交易可用于从顶部开始寻找长期的高实用性项目集。交互交易充分利用了长交易之间或之间的共同项目很少的特征。通过相交相关交易,新算法可以识别长久的高用途项集,而无需一步一步扩展短项集。此外,我们还开发了新的修剪策略和优化技术来提高交互交易的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号