...
首页> 外文期刊>Journal of computational biology: A journal of computational molecular cell biology >Gapped permutation pattern discovery for gene order comparisons
【24h】

Gapped permutation pattern discovery for gene order comparisons

机译:缺口排列模式发现,用于基因顺序比较

获取原文
获取原文并翻译 | 示例
           

摘要

The list of species whose complete DNA sequence have been read is growing steadily, and it is believed that comparative genomics is in its early days. Permutations patterns (groups of genes in some "close" proximity) on gene sequences of genomes across species is being studied under different models, to cope with this explosion of data. The challenge is to (intelligently and efficiently) analyze the genomes in the context of other genomes. In this paper, we present a generalized model that uses three notions, gapped permutation patterns (with gap g), genome clusters, via quorum, K > 1, parameter, and, possible multiplicity in the patterns. The task is to automatically discover all permutation patterns (with possible multiplicity), that occur with gap g in at least K of the givenmgenomes. We present O(log mN(I) + vertical bar Sigma vertical bar log vertical bar Sigma vertical bar N-O) time algorithm where m is the number of sequences, each defined on Sigma, N-I is the size of the input and N-O is the size of the maximal gene clusters that appear in at least K of them genomes.
机译:完整的DNA序列已被阅读的物种列表正在稳步增长,据信比较基因组学处于早期。为了应对这种爆炸性的数据发展,正在不同的模型下研究物种间基因组基因序列上的排列模式(一些“紧密”接近的基因组)。挑战在于(智能高效地)在其他基因组的背景下分析基因组。在本文中,我们提出了一个通用模型,该模型使用三个概念,带间隙的排列模式(带间隙g),基因组簇(通过定额,K> 1,参数)以及模式中的可能多重性。任务是自动发现所有排列模式(可能具有多重性),这些排列模式至少在给定基因组的K个间隔g中出现。我们提出O(log mN(I)+垂直条Sigma垂直条log对数垂直条Sigma垂直条NO)时间算法,其中m是序列数,每条序列在Sigma上定义,NI是输入的大小,NO是大小至少有K个基因组中出现的最大基因簇中的至少一个。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号