首页> 外文会议>Annual Symposium on Combinatorial Pattern Matching(CPM 2006); 20060705-07; Barcelona(ES) >A Compact Mathematical Programming Formulation for DNA Motif Finding
【24h】

A Compact Mathematical Programming Formulation for DNA Motif Finding

机译:DNA主题查找的紧凑数学编程公式

获取原文
获取原文并翻译 | 示例

摘要

In the motif finding problem one seeks a set of mutually similar subsequences within a collection of biological sequences. This is an important and widely-studied problem, as such shared motifs in DNA often correspond to regulatory elements. We study a combinatorial framework where the goal is to find subsequences of a given length such that the sum of their pairwise distances is minimized. We describe a novel integer linear program for the problem, which uses the fact that distances between subsequences come from a limited set of possibilities. We show how to tighten its linear programming relaxation by adding an exponential set of constraints and give an efficient separation algorithm that can find violated constraints, thereby showing that the tightened linear program can still be solved in polynomial time. We apply our approach to find optimal solutions for the motif finding problem and show that it is effective in practice in uncovering known transcription factor binding sites.
机译:在发现主题的问题中,人们在一系列生物序列中寻找一组相互相似的子序列。这是一个重要且广泛研究的问题,因为DNA中的此类共享基序通常对应于调控元件。我们研究了一种组合框架,其中的目标是找到给定长度的子序列,以使它们的成对距离之和最小。我们描述了一个新颖的整数线性程序,使用了子序列之间的距离来自有限可能性的事实。我们展示了如何通过添加一组指数约束来加强其线性规划松弛,并给出了一种有效的分离算法,该算法可以找到违反的约束,从而显示出仍然可以在多项式时间内求解该加强的线性程序。我们应用我们的方法来找到针对基序发现问题的最佳解决方案,并表明它在实践中有效地发现了已知的转录因子结合位点。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号