首页> 外文期刊>EURASIP journal on applied signal processing >Segmentation of DNA into Coding and Noncoding Regions Based on Recursive Entropic Segmentation and Stop-Codon Statistics
【24h】

Segmentation of DNA into Coding and Noncoding Regions Based on Recursive Entropic Segmentation and Stop-Codon Statistics

机译:基于递归熵分割和终止密码子统计的DNA分割为编码区和非编码区

获取原文
获取原文并翻译 | 示例
           

摘要

Heterogeneous DNA sequences can be partitioned into homogeneous domains that are comprised of the four nucleotides A, C, G, and T and the stop codons. Recursively, we apply a new entropic segmentation method on DNA sequences using Jensen-Shannon and Jensen-Renyi divergences in order to find the borders between coding and noncoding DNA regions. We have chosen 12-and 18-symbol alphabets that capture (i) the differential nucleotide composition in codons and (ii) the differential stop-codon composition along all the three phases in both strands of the DNA. The new segmentation method is based on the Jensen-Renyi divergence measure, nucleotide statistics, and stop-codon statistics in both DNA strands. The recursive segmentation process requires no prior training on known datasets. Consequently, for three entire genomes of bacteria, we find that the use of nucleotide composition, stop-codon composition, and Jensen-Renyi divergence improve the accuracy of finding the borders between coding and noncoding regions in DNA sequences.
机译:异质DNA序列可以划分为均质结构域,该结构域由四个核苷酸A,C,G和T以及终止密码子组成。递归地,我们使用Jensen-Shannon和Jensen-Renyi散度对DNA序列应用一种新的熵分割方法,以便找到编码和非编码DNA区域之间的边界。我们选择了12个和18个符号的字母来捕获(i)密码子中不同的核苷酸组成,以及(ii)沿着DNA两条链中所有三个相的差异终止密码子组成。新的分割方法基于两条DNA链中的Jensen-Renyi散度测度,核苷酸统计数据和终止密码子统计数据。递归分割过程不需要事先对已知数据集进行训练。因此,对于细菌的三个完整基因组,我们发现使用核苷酸组成,终止密码子组成和Jensen-Renyi差异提高了在DNA序列中发现编码区和非编码区之间边界的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号