首页> 外文期刊>BMC Bioinformatics >Locating tandem repeats in weighted sequences in proteins
【24h】

Locating tandem repeats in weighted sequences in proteins

机译:在蛋白质的加权序列中定位串联重复序列

获取原文

摘要

A weighted biological sequence is a string in which a set of characters may appear at each position with respective probabilities of occurrence. We attempt to locate all the tandem repeats in a weighted sequence. A repeated substring is called a tandem repeat if each occurrence of the substring is directly adjacent to each other. By introducing the idea of equivalence classes in weighted sequences, we identify the tandem repeats of every possible length using an iterative partitioning technique. We also present the algorithm for recording the tandem repeats, and prove that the problem can be solved in O(n 2) time.
机译:加权生物序列是一个字符串,其中一组字符可能以每个出现的概率出现在每个位置。我们尝试按加权序列定位所有串联重复序列。如果子字符串的每次出现都彼此直接相邻,则重复的子字符串称为串联重复。通过在加权序列中引入等价类的概念,我们使用迭代划分技术来确定每个可能长度的串联重复序列。我们还提出了记录串联重复序列的算法,并证明该问题可以在O(n 2 )时间内解决。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号