【24h】

Genetic signal representation and analysis

机译:遗传信号表示与分析

获取原文

摘要

An original tetrahedral representation of the Genetic Code (GC), that better catches its structure, degeneracy and evolution trends, is defined. The possibility to reduce the dimensionality of the description by the projection of the GC tetrahedron on an adequately oriented plane is also considered, leading to complex representations of the GC. On these base, optimal symbolic-to-digital mappings of the linear, one-dimensional and one-directional strands of nucleic acids into real or complex genetic signals are derived at nucleotide, codon and amino acid levels. By converting the sequences of nucleotides and polypeptides into digital genetic signals, this approach opens the possibility to use a large variety of signal processing methods for their processing and analysis. It is also shown that some essential features of nucleotide sequences can be better extracted using this representation. Some preliminary results in the comparative analysis of the statistical properties of intragenic vs. intergenic genetic signals are also presented. The use of Independent Component Analysis (ICA) to search for control sequences in the intergenic DNA, i.e., the part of the genome that does not encode proteins, is suggested.
机译:定义了遗传密码(GC)的原始四面体代表,更好地捕获其结构,退化和进化趋势。还考虑了通过在适当定向的平面上投射GC四体体的投影来降低描述的维度的可能性,导致GC的复杂表示。在这些基础上,在核苷酸,密码子和氨基酸水平衍生出真实或复杂的遗传信号中的线性,一维和单向链的最佳象征性象征性映射,核酸成真实或复杂的遗传信号。通过将核苷酸和多肽的序列转化为数字遗传信号,这种方法打开了利用各种信号处理方法来处理和分析的可能性。还表明,可以使用该表示可以更好地提取核苷酸序列的一些基本特征。还提出了一些初步结果在腺癌与非基因遗传信号的统计性质的比较分析中。建议使用独立分量分析(ICA)来搜索代表性DNA中的控制序列,即不会编码蛋白质的基因组的一部分。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号