【24h】

Motif Extraction from Weighted Sequences

机译:来自加权序列的基序提取

获取原文

摘要

We present in this paper three algorithms. The first extracts repeated motifs from a weighted sequence. The motifs correspond to words which occur at least q times and with hamming distance e in a weighted sequence with probability ≥ 1/k each time, where k is a small constant. The second algorithm extracts common motifs from a set of N ≥ 2 weighted sequences with hamming distance e. In the second case, the motifs must occur twice with probability ≥ 1/k, in 1 ≤ q ≤ N distinct sequences of the set. The third algorithm extracts maximal pairs from a weighted sequence. A pair in a sequence is the occurrence of the same substring twice. In addition, the algorithms presented in this paper improve slightly on previous work on these problems.
机译:我们在本文中存在三种算法。第一提取来自加权序列的重复基序。图案对应于至少Q次和在加权序列中发生的单词,并且每次具有概率≥1/ k的加权序列,其中k是小常数。第二种算法从一组N≥2加权序列中提取常见图案,汉明距离E.在第二种情况下,必须在概率≥1/ k的情况下发生两次,以1≤q≤N的结构。第三算法从加权序列提取最大对。序列中的一对是两次发生相同的子字符串。此外,本文介绍的算法略有提高了先前的这些问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号