【24h】

Motif Matching Using Gapped Patterns

机译:使用采用隐形图案的图案匹配

获取原文

摘要

We consider the problem of matching a set P of gapped patterns against a given text T of length n, where a gapped pattern is a sequence of strings (keywords), over a finite alphabet Σ of size σ, such that there is a gap of fixed length between each two consecutive strings.We assume the RAM model, with words of size w in bits.We are interested in computing the list ofmatching patterns for each position in the text. This problem is a specific instance of the Variable Length Gaps problem [2] (VLG problem) for multiple patterns and has applications in the discovery of transcription factor (TF) binding sites in DNA sequences when using generalized versions of the PositionWeightMatrix (PWM) model to representTF binding specificities. The paper [5] describes howa motif represented as a generalizedPWM can bematched as a set of gapped patternswith unit-length keywords, and presents algorithms for the restricted case of patterns with two unit-length keywords.
机译:我们考虑匹配覆盖图案的集合p对长度n的给定文本t的问题,其中覆盖图案是字符串(关键字),在大小σ的有限字母表中,使得存在间隙每个两个连续字符串之间的固定长度。我们假设RAM模型,具有尺寸W的单词.we对计算文本中每个位置的匹配模式列表有兴趣。该问题是多个模式的可变长度间隙问题[2](VLG问题)的特定实例,并且在使用PositionWightMatrix(PWM)模型的广义版本时,在DNA序列中发现转录因子(TF)结合位点的应用代表结合特异性。本文描述了作为一组GemappedPWM表示的MOTIF作为一组覆盖的图案标志长度关键字,并且为具有两个单位长度关键字的图案的局限性案例提出了算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号