首页> 外文期刊>Nucleic Acids Research >GeneMark.hmm: new solutions for gene finding.
【24h】

GeneMark.hmm: new solutions for gene finding.

机译:GeneMark.hmm:基因发现的新解决方案。

获取原文
获取原文并翻译 | 示例
           

摘要

The number of completely sequenced bacterial genomes has been growing fast. There are computer methods available for finding genes but yet there is a need for more accurate algorithms. The GeneMark. hmm algorithm presented here was designed to improve the gene prediction quality in terms of finding exact gene boundaries. The idea was to embed the GeneMark models into naturally derived hidden Markov model framework with gene boundaries modeled as transitions between hidden states. We also used the specially derived ribosome binding site pattern to refine predictions of translation initiation codons. The algorithm was evaluated on several test sets including 10 complete bacterial genomes. It was shown that the new algorithm is significantly more accurate than GeneMark in exact gene prediction. Interestingly, the high gene finding accuracy was observed even in the case when Markov models of order zero, one and two were used. We present the analysis of false positive and false negative predictions with the caution that these categories are not precisely defined if the public database annotation is used as a control.
机译:完全测序的细菌基因组的数量一直在快速增长。有用于发现基因的计算机方法,但是仍需要更精确的算法。 GeneMark。本文介绍的hmm算法旨在根据精确的基因边界来提高基因预测质量。想法是将GeneMark模型嵌入到自然派生的隐式马尔可夫模型框架中,其基因边界被建模为隐性状态之间的转换。我们还使用了特殊衍生的核糖体结合位点模式来完善翻译起始密码子的预测。在包括10个完整细菌基因组的几个测试集上评估了该算法。结果表明,在精确的基因预测中,新算法比GeneMark准确得多。有趣的是,即使在使用零阶,一阶和二阶的马尔可夫模型的情况下,也观察到了很高的基因发现准确性。我们在对错误肯定和错误否定预测进行分析时要谨慎,如果将公共数据库注释用作控件,则无法准确定义这些类别。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号