首页> 外文会议>IEEE International Symposium on Bioinformatics and Bioengineering >Iterative Refinement of Repeat Sequence Specification Using Constrained Pattern Matching
【24h】

Iterative Refinement of Repeat Sequence Specification Using Constrained Pattern Matching

机译:使用受限模式匹配的重复序列规范的迭代改进

获取原文
获取外文期刊封面目录资料

摘要

Repeated sequences in genome are structures which indicate important biological functions such as protein binding. They are associated with various genetic diseases. We consider the problem of finding a specification for a "significant" repeating pattern in a given sequence. A significant pattern carries high amount of information, and it has many non-overlapping repeats. We propose for this problem, a method that takes as input an initial specification for a repeating pattern. A pattern is specified by a sequence of letters separated by varying length wildcards. The method presents to the user maximal occurrences for the current pattern specification in a way that no text symbol can be shared as a letter by two different pattern occurrences. This reduces the begin-end position-overlaps among different occurrences. The user modifies the specification manually to eliminate overlapping repeats. This process continues until a specification for a significant pattern is obtained.
机译:基因组中的重复序列是表明蛋白质结合等重要生物学功能的结构。它们与各种遗传疾病有关。我们考虑在给定序列中找到关于“重要”重复模式的规范的问题。显着的模式具有大量信息,并且它具有许多非重叠重复。我们提出了这个问题,一种方法,它是输入重复模式的初始规范。模式由由不同长度通配符分隔的一系列字母指定。该方法以用户为由两个不同模式出现作为字母作为字母共享的方式向用户提出最大误差。这减少了不同出现之间的开始端位置重叠。用户手动修改规范以消除重叠重复。该过程继续,直到获得了显着模式的规范。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号