首页> 外文会议>Asian Conference on Intelligent Information and Database Systems >A High-Performance Algorithm for Mining Repeating Patterns
【24h】

A High-Performance Algorithm for Mining Repeating Patterns

机译:用于挖掘重复模式的高性能算法

获取原文

摘要

A repeating pattern is a sequence composed of identical elements, repeating in a regular manner. In real life, there are lots of applications such as musical and medical sequences containing valuable repeating patterns. Because the repeating patterns hidden in sequences might contain implicit knowledge, how to retrieve the repeating patterns effectively and efficiently has been a challenging issue in recent years. Although a number of past studies were proposed to deal with this issue, the performance cannot still earn users' satisfactions especially for large datasets. To aim at this issue, in this paper, we propose an efficient algorithm named Fast Mining of Repeating Patterns (FMRP), which achieves high performance for finding repeating patterns by a novel index called Quick-Pattern-Index (QPI). This index can provide the proposed FMRP algorithm with an effective support due to its information of pattern positions. Without scanning a given sequence iteratively, the repeating patterns can be discovered by only one scan of the sequence. The experimental results reveal that our proposed algorithm performs better than the compared methods in terms of execution time.
机译:重复模式是由相同元素组成的序列,以规则的方式重复。在现实生活中,有很多应用,例如包含有价值的重复模式的音乐和医学序列。因为隐藏在序列中的重复模式可能包含隐含知识,所以如何在近年来有效和有效地检索重复模式是一个具有挑战性的问题。虽然提出了许多过去的研究来处理这个问题,但表现仍然无法赢得用户的满意度,特别是对于大型数据集。为了瞄准这个问题,在本文中,我们提出了一个名为重复模式(FMRP)的快速挖掘的有效算法,该算法(FMRP)是通过称为Quick-Pattern-Index(QPI)的新颖索引来查找重复模式的高性能。由于其图案位置的信息,该索引可以提供具有有效支持的提出的FMRP算法。在不扫描给定序列的情况下迭代地,可以仅通过序列的一个扫描发现重复模式。实验结果表明,我们所提出的算法在执行时间方面的比较方法比较好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号