首页> 外文会议>Bioinformatics research and applications >Algorithms for Rapid Error Correction for the Gene Duplication Problem
【24h】

Algorithms for Rapid Error Correction for the Gene Duplication Problem

机译:基因复制问题的快速纠错算法

获取原文
获取原文并翻译 | 示例

摘要

Gene tree - species tree reconciliation problems infer the patterns and processes of gene evolution within the context of an organismal phylogeny. In one example, the gene duplication problem seeks the evolutionary scenario that implies the minimum number of gene duplications needed to reconcile a gene tree and a species tree. While the gene duplication problem can effectively link gene and species evolution, error in gene trees can profoundly bias the results. We describe novel algorithms that rapidly search local Subtree Prune and Regraft (SPR) or Tree Bisection and Reconnection (TBR) neighborhoods of a gene tree to find a topology that implies the fewest duplications. These algorithms improve on the current solutions by a factor of n for searching SPR neighborhoods and n2 for searching TBR neighborhoods, where n is the number of vertices in the given gene tree. They provide a fast error correction protocol for gene trees, in which we allow small gene tree rearrangements to improve the reconciliation cost. We tested the SPR tree rearrangement algorithm on a collection of 1201 plant gene trees, and in every case, the SPR algorithm identified an alternate topology that implied at least one fewer duplication. We also demonstrate a simple method to use the gene rearrangement algorithm to improve gene tree parsimony phyloge-netic analyses, which infer a species tree based on the gene duplication problem.
机译:基因树-种树和解问题可以推断出生物系统发育背景下基因进化的模式和过程。在一个例子中,基因复制问题寻求一种进化方案,该进化方案暗示了调和基因树和物种树所需的最小基因复制数目。尽管基因复制问题可以有效地将基因与物种进化联系起来,但基因树中的错误会严重影响结果。我们描述了新颖的算法,可以快速搜索基因树的本地子树修剪和移植(SPR)或树二等分和重新连接(TBR)邻域,以找到隐含着最少重复的拓扑。这些算法在当前解决方案上改进了n倍(用于搜索SPR邻域)和n2(用于搜索TBR邻域),其中n是给定基因树中的顶点数。它们为基因树提供了快速的纠错协议,其中我们允许小的基因树重排以提高和解成本。我们在1201种植物基因树的集合上测试了SPR树重排算法,并且在每种情况下,SPR算法都确定了一种替代拓扑,该拓扑隐含了至少一个重复。我们还演示了一种简单的方法,可以使用基因重排算法来改善基因树的简约系统发育分析,从而基于基因复制问题推断出物种树。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号