首页> 外文会议>Artificial intelligence and applications : Machine learning >FREQUENT PATTERN MINING FROM HIGH-DIMENSIONAL DATA USING RECORD SPACE SEARCH
【24h】

FREQUENT PATTERN MINING FROM HIGH-DIMENSIONAL DATA USING RECORD SPACE SEARCH

机译:使用记录空间搜索从高维数据挖掘频繁模式

获取原文
获取原文并翻译 | 示例

摘要

Traditional frequent pattern mining methods have a problem in that the order of calculation exponentially increases with high-dimensional data because of a search using combinations of attributes. The purpose of our work is to develop methods that efficiently extract frequent patterns from very high-dimensional data. We propose HD FPM that can solve the problem using a record space search and a minimum pattern length pruning. The record space search means the search using combinations of records. We can extract frequent patterns from attributes common to the combinations of records. We can also reduce a search space using a minimum pattern length pruning. Several experiments on real microarray datasets show that HD FPM has better performance than previous closed frequent pattern mining algorithms such as FPclose and CHARM in the case that minimum support is low. We also propose parallel HD FPM that can solve the problem using vertical partitioning of a database and parallel processing. Our evaluation of parallel HD FPM performed with a real microarray dataset on 16 PCs has revealed that it is 13 times faster than a sequential one. In conclusion, HD FPM and parallel HD FPM are effective algorithms for frequent pattern mining from high-dimensional data.
机译:传统的频繁模式挖掘方法存在一个问题,即由于使用属性组合进行搜索,因此高维数据的计算顺序呈指数增长。我们工作的目的是开发可从超高维数据中高效提取频繁模式的方法。我们提出了HD FPM,它可以使用记录空间搜索和最小模式长度修剪来解决问题。记录空间搜索是指使用记录组合进行搜索。我们可以从记录组合的公共属性中提取频繁模式。我们还可以使用最小模式长度修剪来减少搜索空间。在真实的微阵列数据集上进行的一些实验表明,在最小支持量较低的情况下,HD FPM具有比以前的封闭频繁模式挖掘算法(例如FPclose和CHARM)更好的性能。我们还提出了并行HD FPM,它可以使用数据库的垂直分区和并行处理来解决该问题。我们对16台PC上的真实微阵列数据集进行的并行HD FPM评估显示,它比顺序PC快13倍。总之,HD FPM和并行HD FPM是从高维数据频繁进行模式挖掘的有效算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号