...
首页> 外文期刊>Algorithms for Molecular Biology >Efficient algorithms for analyzing segmental duplications with deletions and inversions in genomes
【24h】

Efficient algorithms for analyzing segmental duplications with deletions and inversions in genomes

机译:高效的算法,用于分析具有基因组中缺失和倒置的片段重复

获取原文

摘要

Background Segmental duplications, or low-copy repeats, are common in mammalian genomes. In the human genome, most segmental duplications are mosaics comprised of multiple duplicated fragments. This complex genomic organization complicates analysis of the evolutionary history of these sequences. One model proposed to explain this mosaic patterns is a model of repeated aggregation and subsequent duplication of genomic sequences. Results We describe a polynomial-time exact algorithm to compute duplication distance, a genomic distance defined as the most parsimonious way to build a target string by repeatedly copying substrings of a fixed source string. This distance models the process of repeated aggregation and duplication. We also describe extensions of this distance to include certain types of substring deletions and inversions. Finally, we provide a description of a sequence of duplication events as a context-free grammar (CFG). Conclusion These new genomic distances will permit more biologically realistic analyses of segmental duplications in genomes.
机译:背景片段重复或低拷贝重复在哺乳动物基因组中很常见。在人类基因组中,大多数节段重复是由多个重复片段组成的镶嵌图。这种复杂的基因组组织使这些序列的进化历史分析变得复杂。提出用来解释这种镶嵌图案的一个模型是重复聚集和随后重复基因组序列的模型。结果我们描述了一种用于计算复制距离的多项式时间精确算法,这是一种基因组距离,被定义为通过重复复制固定源字符串的子字符串来构建目标字符串的最简约方法。此距离模拟了重复聚合和复制的过程。我们还描述了此距离的扩展,以包括某些类型的子字符串删除和倒置。最后,我们将复制事件序列描述为无上下文语法(CFG)。结论这些新的基因组距离将允许对基因组中的节段重复进行更生物学的现实分析。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号