【24h】

TKE: Mining Top-K Frequent Episodes

机译:TKE:挖掘Top-K频繁剧集

获取原文

摘要

Frequent episode mining is a popular data mining task for analyzing a sequence of events. It consists of identifying all subsequences of events that appear at least minsup times. Though traditional episode mining algorithms have many applications, a major problem is that setting the minsup parameter is not intuitive. If set too low, algorithms can have long execution times and find too many episodes, while if set too high, algorithms may find few patterns, and hence miss important information. Choosing minsup to find enough but not too many episodes is typically done by trial and error, which is time-consuming. As a solution, this paper redefines the task of frequent episode mining as top-k frequent episode mining, where the user can directly set the number of episodes k to be found. A fast algorithm named TKE is presented to find the top-k episodes in an event sequence. Experiments on benchmark datasets shows that TKE performs well and that it is a valuable alternative to traditional frequent episode mining algorithms.
机译:频繁情节挖掘是用于分析事件序列的流行数据挖掘任务。它由识别至少出现消极时间的事件的所有子序列组成。尽管传统的情节挖掘算法有许多应用,但主要问题是设置minsup参数不直观。如果设置得太低,算法可能会执行时间长并且找到太多的情节;而如果设置得太高,算法可能会发现很少的模式,因此会丢失重要的信息。选择minsup来查找足够但不太多的情节通常是通过反复试验来完成的,这非常耗时。作为解决方案,本文将频繁情节挖掘的任务重新定义为top-k频繁情节挖掘,用户可以直接设置要查找的情节数k。提出了一种名为TKE的快速算法,以查找事件序列中的前k个情节。在基准数据集上进行的实验表明,TKE表现良好,并且是传统频繁情节挖掘算法的一种有价值的替代方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号