...
【24h】

Finding subtle motifs with variable gaps in unaligned DNA sequences.

机译:在未比对的DNA序列中发现具有可变缺口的细微图案。

获取原文
获取原文并翻译 | 示例

摘要

Biologists have determined that the control and regulation of gene expression is primarily determined by relatively short sequences in the region surrounding a gene. These sequences vary in length, position, redundancy, orientation, and bases. Finding these short sequences is a fundamental problem in molecular biology with important applications. Though there exist many different approaches to signal (i.e. short sequence) finding, some new study shows that this problem still leaves plenty of room for improvement. In 2000, Pevzner and Sze proposed the Challenge Problem of motif detection. They reported that most current motif finding algorithms are incapable of detecting the target motifs in their Challenge Problem. In this paper, we show that using an iterative-restart design, our new algorithm can correctly find the target motifs. Furthermore, taking into account the fact that some transcription factors form a dimer or even more complex structures, and transcription process can sometimes involve multiple factors with variable spacers in between, we extend the original problem to an even more challenging one by addressing the issue of combinatorial signals with gaps of variable lengths. To demonstrate the effectiveness of our algorithm, we tested it on a series of the new challenge problem as well as real regulons, and compared it with some current representative motif-finding algorithms.
机译:生物学家已经确定,基因表达的控制和调节主要由围绕基因的区域中相对较短的序列决定。这些序列的长度,位置,冗余度,方向和碱基都不同。发现这些短序列是分子生物学中具有重要应用的基本问题。尽管存在许多不同的信号发现方法(即短序列),但一些新的研究表明此问题仍然留有很大的改进空间。在2000年,Pevzner和Sze提出了主题检测的质询问题。他们报告说,大多数当前的主题查找算法均无法检测其“挑战问题”中的目标主题。在本文中,我们证明了使用迭代重启设计,我们的新算法可以正确找到目标图案。此外,考虑到某些转录因子形成二聚体或什至更复杂的结构,并且转录过程有时可能涉及多个因子,而它们之间存在可变间隔,因此我们通过解决以下问题将原始问题扩展到一个更具挑战性的问题具有可变长度间隙的组合信号。为了证明我们算法的有效性,我们在一系列新的挑战性问题以及实际规则上对其进行了测试,并将其与一些当前的代表性主题发现算法进行了比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号