首页> 外文会议>International conference on bioinformatics and computational biology >Algorithm for finding coding signal using homogeneous Markov chains independently for three codon positions
【24h】

Algorithm for finding coding signal using homogeneous Markov chains independently for three codon positions

机译:用于三个密码子位置的均质马尔可夫链独立查找编码信号的算法

获取原文

摘要

Many currently used algorithms for protein coding sequences require large learning sets of true genes to estimate sensible values for used parameters which are necessary to make the prediction reasonable. They also fail in recognition of short genes which usually contain weak coding signal. To avoid these problems, we worked out a new algorithm for finding protein coding potential in prokaryotic genomes. This algorithm uses homogeneous Markov chain for modeling nucleotide transition between fixed positions in codons thereby reduces order of Markov chain retaining simultaneously information on dependence between nucleotides in sequence on relatively long distances. We tested performance of this algorithm in relationship to size of the learning set with true and false positive rates for different model orders. We also made some comparisons between our algorithm and commonly used GeneMark. The presented algorithm works better especially for smaller learning sets.
机译:当前许多用于蛋白质编码序列的算法需要大量的真实基因学习集,以估计所用参数的合理值,这对于使预测变得合理是必不可少的。它们也不能识别通常包含弱编码信号的短基因。为了避免这些问题,我们设计了一种新的算法来寻找原核生物基因组中的蛋白质编码潜力。该算法使用均质马尔可夫链对密码子固定位置之间的核苷酸过渡进行建模,从而降低了马尔可夫链的顺序,同时保留了相对较长距离上核苷酸之间依存关系的信息。我们测试了该算法的性能与学习集大小之间的关系,以及不同模型阶次的正确率和错误率。我们还对算法和常用的GeneMark进行了比较。提出的算法尤其适用于较小的学习集,效果更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号