首页> 外文期刊>Expert Systems with Application >Mining top-k co-occurrence items with sequential pattern
【24h】

Mining top-k co-occurrence items with sequential pattern

机译:使用顺序模式挖掘前k个同现项

获取原文
获取原文并翻译 | 示例
       

摘要

Frequent sequential pattern mining has become one of the most important tasks in data mining. It has many applications, such as sequential analysis, classification, and prediction. How to generate candidates and how to control the combinatorically explosive number of intermediate subsequences are the most difficult problems. Intelligent systems such as recommender systems, expert systems, and business intelligence systems use only a few patterns, namely those that satisfy a number of defined conditions. Challenges include the mining of top-k patterns, top-rank-k patterns, closed patterns, and maximal patterns. In many cases, end users need to find itemsets that occur with a sequential pattern. Therefore, this paper proposes approaches for mining top-k co-occurrence items usually found with a sequential pattern. The Naive Approach Mining (NAM) algorithm discovers top-k co-occurrence items by directly scanning the sequence database to determine the frequency of items. The Vertical Approach Mining (VAM) algorithm is based on vertical database scanning. The Vertical with Index Approach Mining (VIAM) algorithm is based on a vertical database with index scanning. VAM and VIAM use pruning strategies to reduce the search space, thus improving performance. VAM and VIAM are especially effective in mining the co-occurrence items of a long input pattern. The three algorithms were evaluated using real-world databases. The experimental results show that these algorithms perform well, especially VAM and VIAM. (C) 2017 Elsevier Ltd. All rights reserved.
机译:频繁的顺序模式挖掘已成为数据挖掘中最重要的任务之一。它具有许多应用程序,例如顺序分析,分类和预测。如何生成候选对象以及如何控制中间子序列的爆炸性数量是最困难的问题。诸如推荐系统,专家系统和商业智能系统之类的智能系统仅使用一些模式,即那些满足许多已定义条件的模式。挑战包括挖掘前k个模式,前k个模式,闭合模式和最大模式。在许多情况下,最终用户需要查找以顺序模式出现的项目集。因此,本文提出了挖掘通常以顺序模式发现的前k个同现项的方法。天真的方法挖掘(NAM)算法通过直接扫描序列数据库以确定项目的频率来发现前k个同时出现的项目。垂直进近挖掘(VAM)算法基于垂直数据库扫描。带有索引方法的垂直挖掘(VIAM)算法基于带有索引扫描的垂直数据库。 VAM和VIAM使用修剪策略来减少搜索空间,从而提高性能。 VAM和VIAM在挖掘长输入模式的共现项时特别有效。使用实际数据库评估了这三种算法。实验结果表明,这些算法表现良好,尤其是VAM和VIAM。 (C)2017 Elsevier Ltd.保留所有权利。

著录项

  • 来源
    《Expert Systems with Application》 |2017年第11期|123-133|共11页
  • 作者单位

    Univ Sci, VNU HCM, Fac Informat Technol, Ho Chi Minh City, Vietnam;

    Ho Chi Minh City Univ Technol, Fac Informat Technol, Ho Chi Minh City, Vietnam|Sejong Univ, Coll Elect & Informat Engn, Seoul, South Korea;

    Ton Duc Thang Univ, Div Data Sci, Ho Chi Minh City, Vietnam|Ton Duc Thang Univ, Fac Informat Technol, Ho Chi Minh City, Vietnam;

    Peking Univ, Sch Elect Engn & Comp Sci, Minist Educ, Key Lab Machine Percept, Beijing 100871, Peoples R China;

    Univ Sci, VNU HCM, Fac Informat Technol, Ho Chi Minh City, Vietnam;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Top-k mining; Co-occurrence sequential mining; Sequential pattern mining;

    机译:Top-k挖掘;共现顺序挖掘;顺序模式挖掘;
  • 入库时间 2022-08-17 13:29:11

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号