首页> 外文会议>Annual international conference on computing and combinatorics >Genomic Distances under Deletions and Insertions
【24h】

Genomic Distances under Deletions and Insertions

机译:缺失和插入下的基因组距离

获取原文

摘要

As more and more genomes are sequenced, evolutionary biologists are becoming increasingly interested in evolution at the level of whole genomes, in scenarios in which the genome evolves through insertions, deletions, and movements of genes along its chromosomes. In the mathematical model pioneered by Sankoff and others, a unichromosomal genome is represented by a signed permutation of a multi-set of genes; Hannenhalli and Pevzner showed that the edit distance between two signed permutations of the same set can be computed in polynomial time when all operations are inversions. El-Mabrouk extended that result to allow deletions and a limited form of insertions (which forbids duplications). In this paper we extend El-Mabrouk's work to handle duplications as well as insertions and present an alternate framework for computing (near) minimal edit sequences involving insertions, deletions, and inversions. We derive an error bound for our polynomial-time distance computation under various assumptions and present preliminary experimental results that suggest that performance in practice may be excellent, within a few percent of the actual distance.
机译:随着越来越多的基因组被测序,进化生物学家对整个基因组水平的进化越来越感兴趣,在这种情况下,基因组通过涉及其染色体的基因的插入,缺失和运动的运动而发展。在Sankoff和其他人开创的数学模型中,通过多组基因的签名排列来表示单体形式基因组; Hannenhalli和Pevzner显示,当所有操作都是逆时时,可以在多项式时间内计算相同集合的两个签名排列之间的编辑距离。 EL-Mabrouk扩展了该结果,以允许删除和有限的插入形式(禁止重复)。在本文中,我们扩展了EL-Mabrouk的工作,以处理重复以及插入,并呈现涉及插入,删除和逆势的最小编辑序列的替代框架。我们在各种假设下的多项式距离计算界定了一个错误,并提出了初步实验结果,表明实践中的性能可能是优异的,在几个实际距离的几个百分比内。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号