【24h】

Identification of Distinguishing Motifs

机译:辨别主题

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Motivation: Motif identification for sequences has many important applications in biological studies, e.g., diagnostic probe design, locating binding sites and regulatory signals, and potential drug target identification. There are two versions. 1. Single Group: Given a group of n sequences, find a length-l motif that appears in each of the given sequences and those occurrences of the motif are similar. 2. Two Groups: Given two groups of sequences B and G, find a length-l (distinguishing) motif that appears in every sequence in B and does not appear in anywhere of the sequences in G. Here the occurrences of the motif in the given sequences have errors. Currently, most of existing programs can only handle the case of single group. Moreover, it is very difficult to use edit distance (allowing indels and replacements) for motif detection. Results: (1) We propose a randomized algorithm for the one group problem that can handle indels in the occurrences of the motif. (2) We give an algorithm for the two groups problem. (3) Extensive simulations have been done to evaluate the algorithms.
机译:动机:序列的基序鉴定在生物学研究中具有许多重要应用,例如诊断探针设计,定位结合位点和调控信号以及潜在的药物靶标鉴定。有两个版本。 1.单个组:给定一组n个序列,找到出现在每个给定序列中的长度为1的基序,并且这些基序的出现类似。 2.两组:给定两组序列B和G,找到一个长度为1(可区分)的基序,该基序出现在B的每个序列中,而不出现在G的任何序列中。在这里,基序的出现给定的序列有错误。当前,大多数现有程序只能处理单个组的情况。而且,很难将编辑距离(允许插入和替换)用于图案检测。结果:(1)我们针对一组问题提出了一种随机算法,该算法可以处理主题出现中的插入缺失。 (2)给出了两组问题的算法。 (3)已经进行了广泛的仿真以评估算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号