首页> 外文期刊>Mathematics in Computer Science >Non-overlapping Common Substrings Allowing Mutations
【24h】

Non-overlapping Common Substrings Allowing Mutations

机译:非重叠的通用子字符串,允许突变

获取原文
获取原文并翻译 | 示例
       

摘要

This paper studies several combinatorial problems arising from finding the conserved genes of two genomes (i.e., the entire DNA of two species). The input is a collection of n maximal common substrings of the two genomes. The problem is to find, based on different criteria, a subset of such common substrings with maximum total length. The most basic criterion requires that the common substrings selected have the same ordering in the two genomes and they do not overlap among themselves in either genome. To capture mutations (transpositions and reversals) between the genomes, we do not insist the substrings selected to have the same ordering. Conceptually, we allow one ordering to go through some mutations to become the other ordering. If arbitrary mutations are allowed, the problem of finding a maximum-length, non-overlapping subset of substrings is found to be NP-hard. However, arbitrary mutations probably overmodel the problem and are likely to find more noise than conserved genes. We consider two criteria that attempt to model sparse and non-overlapping mutations. We show that both can be solved in polynomial time using dynamic programming.
机译:本文研究了由于发现两个基因组的保守基因(即两个物种的整个DNA)而引起的几个组合问题。输入是两个基因组的n个最大公共子串的集合。问题是要根据不同的标准找到具有最大总长度的此类常见子串的子集。最基本的标准要求选择的公共子串在两个基因组中具有相同的顺序,并且它们在任何一个基因组中都不相互重叠。为了捕获基因组之间的突变(转座和逆转),我们不坚持选择的子串具有相同的顺序。从概念上讲,我们允许一个命令经历一些突变而变成另一个命令。如果允许任意突变,则发现找到最大长度,不重叠的子串子集的问题被认为是NP困难的。但是,任意突变都可能使问题过大,并且可能比保守基因发现更多的噪音。我们考虑尝试对稀疏和非重叠突变建模的两个标准。我们证明了两者都可以使用动态规划在多项式时间内求解。

著录项

  • 来源
    《Mathematics in Computer Science》 |2008年第4期|543-555|共13页
  • 作者单位

    Department of Computer Science University of Pittsburgh Room 6406 Sennott Square Building 210 South Bouquet Street Pittsburgh PA 15260 USA;

    Department of Computer Science The University of Hong Kong Chow Yei Ching Building Pokfulam Road Hong Kong China;

    Department of Computer Science School of Computing National University of Singapore Computing 1 Law Link Singapore 117590 Republic of Singapore;

    Department of Computer Science The University of Liverpool Ashton Building Ashton Street Liverpool L69 3BX United Kingdom;

    Department of Computer Science The University of Hong Kong Chow Yei Ching Building Pokfulam Road Hong Kong China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Primary 92-08; Secondary 68W05; Whole genome alignment; conserved genes; mutations; algorithms;

    机译:初级92-08;次级68W05;整个基因组比对;保守基因;突变;算法;
  • 入库时间 2022-08-18 02:27:41

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号