【24h】

Genomic Signatures from DNA Word Graphs

机译:DNA字图的基因组签名

获取原文
获取原文并翻译 | 示例

摘要

Genomes have both deterministic and random aspects, with the underlying DNA sequences exhibiting features at numerous scales, from codons and cis-elements through genes and on to regions of conserved or divergent gene order. The DNA Words program aims to identify mathematical structures that characterize genomes at multiple scales. The focus of this work is the fine structure of genomic sequences, the manner in which short nucleotide sequences fit together to comprise the genome as an abstract sequence, within a graph-theoretic setting. A DNA word graph is a generalization of a de Bruijn graph that records the occurrence counts of node and edges in a genomic sequence. A DNA word graph can be derived from a genomic sequence generated by a finite Markov chain or a subsequence of a sequenced genome. Both theoretically and empirically, DNA word graphs give rise to genomic signatures. Several genomic signatures are derived from the structure of a DNA word graph, including an information-rich and visually appealing genomic bar code. Application of genomic signatures to several genomes demonstrate their practical value in identifying and distinguishing genomic sequences.
机译:基因组既具有确定性又具有随机性,其基本的DNA序列在众多规模上都具有特征,从密码子和顺式元件到基因,再到保守或发散的基因顺序区域。 DNA Words程序旨在识别可在多个尺度上表征基因组的数学结构。这项工作的重点是基因组序列的精细结构,即在图论理论环境中短核苷酸序列相互配合以组成基因组作为抽象序列的方式。 DNA字图是de Bruijn图的概括,它记录了基因组序列中节点和边的出现次数。 DNA字图可以从有限马尔可夫链或测序基因组的子序列产生的基因组序列中得出。在理论上和经验上,DNA字图都产生了基因组特征。 DNA字图的结构衍生出几个基因组特征,包括信息量丰富且视觉上吸引人的基因组条形码。将基因组签名应用于多个基因组证明了其在鉴定和区分基因组序列中的实用价值。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号