...
首页> 外文期刊>Theoretical computer science >Motif matching using gapped patterns
【24h】

Motif matching using gapped patterns

机译:使用间隙图案进行图案匹配

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

We present new algorithms for the problem of multiple string matching of gapped patterns, where a gapped pattern is a sequence of strings such that there is a gap of fixed length between each two consecutive strings. The problem has applications in the discovery of transcription factor binding sites in DNA sequences when using generalized versions of the Position Weight Matrix model to describe transcription factor specificities. In these models a motif can be matched as a set of gapped patterns with unit-length keywords. The existing algorithms for matching a set of gapped patterns are worst-case efficient but not practical, or vice versa, in this particular case. The novel algorithms that we present are based on dynamic programming and bit-parallelism, and lie in a middle-ground among the existing algorithms. In fact, their time complexity is close to the best existing bound and, yet, they are also practical. We also provide experimental results which show that the presented algorithms are fast in practice, and preferable if all the strings in the patterns have unit-length.
机译:我们提出了用于间隙模式的多个字符串匹配问题的新算法,其中间隙模式是一串字符串,这样在每两个连续的字符串之间都有固定长度的间隙。当使用位置权重矩阵模型的广义版本描述转录因子特异性时,该问题在发现DNA序列中的转录因子结合位点方面具有应用价值。在这些模型中,主题可以作为带有单位长度关键字的一组空白模式进行匹配。在这种特殊情况下,用于匹配一组间隙模式的现有算法在最坏情况下有效,但不切实际,反之亦然。我们提出的新颖算法基于动态编程和位并行,并且位于现有算法之中。实际上,它们的时间复杂度接近最佳的现有界限,但是它们也是实用的。我们还提供了实验结果,表明所提出的算法在实践中速度很快,如果模式中的所有字符串均具有单位长度,则更可取。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号