首页> 外文会议>International conference on advances in computing, communications and informatics >Maximal pattern matching with flexible wildcard gaps and one-off constraint
【24h】

Maximal pattern matching with flexible wildcard gaps and one-off constraint

机译:具有灵活通配符间隙和一次性约束的最大模式匹配

获取原文

摘要

Pattern matching is a fundamental operation in finding knowledge from large amount of biosequence data. Finding patterns help in analyzing the property of a sequence. This paper focuses on the problem of maximal pattern matching with flexible wildcard gaps and length constraints under the one-off condition. The problem is to find the maximum number of occurrences of a pattern P with user specified wildcard gap between every two consecutive letters of P in a biological sequence S under the one-off condition and constraint on the overall length of the matching occurrence. To obtain the optimal solution for this problem is difficult. We propose a heuristic algorithm, MOGO, based on the Nettree data structure to solve this problem. Theoretical analysis and experimental results demonstrate that MOGO performs better than the existing algorithms in most of the cases when tested on real world biological sequences.
机译:模式匹配是从大量生物序列数据中寻找知识的基本操作。查找模式有助于分析序列的属性。本文着眼于一次性条件下具有灵活通配符间隙和长度约束的最大模式匹配问题。问题在于找到一种模式P的最大出现次数,该模式P在一次性条件和匹配发生的总长度的约束下,在生物序列S中用户指定的P的每两个连续字母之间具有用户指定的通配符间隙。要获得针对此问题的最佳解决方案是困难的。我们提出了一种基于Nettree数据结构的启发式算法MOGO,以解决此问题。理论分析和实验结果表明,在真实世界的生物序列上进行测试时,MOGO在大多数情况下的性能均优于现有算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号