【24h】

Motif Extraction from Weighted Sequences

机译:从加权序列中提取图案

获取原文
获取原文并翻译 | 示例

摘要

We present in this paper three algorithms. The first extracts repeated motifs from a weighted sequence. The motifs correspond to words which occur at least q times and with hamming distance e in a weighted sequence with probability ≥ 1/k each time, where k is a small constant. The second algorithm extracts common motifs from a set of N ≥ 2 weighted sequences with hamming distance e. In the second case, the motifs must occur twice with probability ≥ 1/k, in 1 ≤ q ≤ N distinct sequences of the set. The third algorithm extracts maximal pairs from a weighted sequence. A pair in a sequence is the occurrence of the same substring twice. In addition, the algorithms presented in this paper improve slightly on previous work on these problems.
机译:我们在本文中提出了三种算法。首先从加权序列中提取重复的基序。图案对应于出现至少q次且以加权序列汉明距离e且每次概率≥1 / k的单词,其中k是一个小常数。第二种算法从具有汉明距离e的N≥2个加权序列的集合中提取公共图案。在第二种情况下,主题必须在集合中1≤q≤N个不同的序列中以概率≥1 / k出现两次。第三种算法从加权序列中提取最大对。序列中的一对是同一子串出现两次。此外,本文提出的算法在以前针对这些问题的工作上略有改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号