首页> 外文会议>International conference on language and automata theory and applications >Duplication-Loss Genome Alignment: Complexity and Algorithm
【24h】

Duplication-Loss Genome Alignment: Complexity and Algorithm

机译:复制-丢失基因组比对:复杂性和算法

获取原文

摘要

Recently, an Alignment approach for the comparison of two genomes, based on an evolutionary model restricted to Duplications and Losses, has been presented. An exact linear programming algorithm has been developed and successfully applied to the Transfer RNA (tRNA) repertoire in Bacteria, leading to interesting observation on tRNA shift of identity. Here, we explore a direct dynamic programming approach for the Duplication-Loss Alignment of two genomes, which proceeds in two steps: (1) (The Dynamic Programming step) Outputs a best candidate alignment between the two genomes and (2) (Minimum Label Alignment problem) Finds an evolutionary scenario of minimum duplication-loss cost that is in agreement with the alignment. We show that the Minimum Label Alignment is APX-hard, even if the number of occurrences of a gene inside a genome is bounded by 5. We then develop a heuristic which is a thousands of times faster than the linear programming algorithm and exhibits a high degree of accuracy on simulated datasets. The heuristic has been implemented in JAVA and is available on request.
机译:近来,已经提出了一种用于比对两个基因组的比较的比对方法,其基于限于复制和损失的进化模型。已开发出一种精确的线性编程算法,并将其成功应用于细菌中的转移RNA(tRNA)库,从而导致对tRNA身份转移的有趣观察。在这里,我们探索了两个基因组的重复-丢失比对的直接动态编程方法,该过程分两个步骤进行:(1)(动态编程步骤)输出两个基因组之间的最佳候选比对和(2)(最小标记)一致性问题)找到与一致性一致的最小重复损失成本的演化方案。我们证明最小标记比对是APX困难的,即使基因组内一个基因的出现次数受5限制。我们然后开发了一种启发式算法,其速度比线性编程算法快数千倍,并且显示出很高的仿真数据集的准确度。该启发式方法已在JAVA中实现,可应要求提供。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号