首页> 外文期刊>Journal of physics, A. Mathematical and theoretical >Symbolic complexity for nucleotide sequences: a sign of the genome structure
【24h】

Symbolic complexity for nucleotide sequences: a sign of the genome structure

机译:核苷酸序列的符号复杂性:基因组结构的标志

获取原文
获取原文并翻译 | 示例
           

摘要

We introduce a method for estimating the complexity function (which counts the number of observable words of a given length) of a finite symbolic sequence, which we use to estimate the complexity function of coding DNA sequences for several species of the Hominidae family. In all cases, the obtained symbolic complexities show the same characteristic behavior: exponential growth for small word lengths, followed by linear growth for larger word lengths. The symbolic complexities of the species we consider exhibit a systematic trend in correspondence with the phylogenetic tree. Using our method, we estimate the complexity function of sequences obtained by some known evolution models, and in some cases we observe the characteristic exponential-linear growth of the Hominidae coding DNA complexity. Analysis of the symbolic complexity of sequences obtained from a specific evolution model points to the following conclusion: linear growth arises from the random duplication of large segments during the evolution of the genome, while the decrease in the overall complexity from one species to another is due to a difference in the speed of accumulation of point mutations.
机译:我们介绍了一种估计有限符号序列的复杂度函数(计算给定长度的可观察单词数)的方法,该方法用于估计人科几类物种的DNA序列的复杂度函数。在所有情况下,所获得的符号复杂度都表现出相同的特征行为:较小字长的指数增长,然后较大字长的线性增长。我们认为该物种的象征复杂性表现出与系统发生树相对应的系统趋势。使用我们的方法,我们估计通过某些已知的进化模型获得的序列的复杂度函数,并且在某些情况下,我们观察到了编码DNA复杂性的人科的特征性指数线性增长。对从特定进化模型获得的序列的符号复杂性的分析得出以下结论:线性增长是由于基因组进化过程中大片段的随机复制而引起的,而整体复杂性的降低是由于一种物种向另一种物种的降低点突变积累速度的差异。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号