首页> 外文期刊>SIGKDD explorations >Mining High Utility Episodes in Complex Event Sequences
【24h】

Mining High Utility Episodes in Complex Event Sequences

机译:在复杂事件序列中挖掘高实用情节

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Frequent episode mining (FEM) is an interesting research topic in data mining with wide range of applications. However, the traditional framework of FEM treats all events as having the same importance/utility and assumes that a same type of event appears at most once at any time point. These simplifying assumptions do not reflect the characteristics of scenarios in real applications and thus the useful information of episodes in terms of utilities such as profits is lost. Furthermore, most studies on FEM focused on mining episodes in simple event sequences and few considered the scenario of complex event sequences, where different events can occur simultaneously. To address these issues, in this paper, we incorporate the concept of utility into episode mining and address a new problem of mining high utility episodes from complex event sequences, which has not been explored so far. In the proposed framework, the importance/utility of different events is considered and multiple events can appear simultaneously. Several novel features are incorporated into the proposed framework to resolve the challenges raised by this new problem, such as the absence of antimonotone property and the huge set of candidate episodes. Moreover, an efficient algorithm named UP-Span (Utility ePisodes mining by Spanning prefixes) is proposed for mining high utility episodes with several strategies incorporated for pruning the search space to achieve high efficiency. Experimental results on real and synthetic datasets show that UP-Span has excellent performance and serves as an effective solution to the new problem of mining high utility episodes from complex event sequences.
机译:频繁情节挖掘(FEM)是数据挖掘中一个有趣的研究主题,具有广泛的应用范围。但是,FEM的传统框架将所有事件视为具有相同的重要性/效用,并假定同一类型的事件在任何时间点最多出现一次。这些简化的假设并不能反映实际应用中场景的特征,因此会丢失效用方面的有用情节信息,例如利润。此外,大多数关于有限元的研究都集中在简单事件序列中的挖掘事件上,很少考虑复杂事件序列的场景,其中不同事件可以同时发生。为了解决这些问题,在本文中,我们将效用的概念纳入情节挖掘中,并解决了从复杂事件序列中挖掘高效情节的新问题,到目前为止尚未探索。在提出的框架中,考虑了不同事件的重要性/效用,并且多个事件可以同时出现。提议的框架中合并了一些新颖的功能,以解决此新问题所带来的挑战,例如缺少反单调特性和大量候选事件。此外,提出了一种有效的算法UP-Span(通过跨越前缀的实用ePisodes挖掘),用于挖掘高效事件,并结合了几种策略来修剪搜索空间以实现高效率。在真实数据集和合成数据集上的实验结果表明,UP-Span具有出色的性能,可以有效地解决从复杂事件序列中挖掘高效事件的新问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号