首页> 外文期刊>Nucleic Acids Research >DETECTION OF NEW GENES IN A BACTERIAL GENOME USING MARKOV MODELS FOR THREE GENE CLASSES
【24h】

DETECTION OF NEW GENES IN A BACTERIAL GENOME USING MARKOV MODELS FOR THREE GENE CLASSES

机译:使用马尔可夫模型对三种基因类别的细菌基因组中的新基因进行检测

获取原文
获取原文并翻译 | 示例
           

摘要

We further investigated the statistical features of the three classes of Escherichia coli genes that have been previously delineated by factorial correspondence analysis and dynamic clustering methods, A phased Markov model for a nucleotide sequence of each gene class was developed and employed for gene prediction using the GeneMark program, The protein-coding region prediction accuracy was determined for class-specific Markov models of different orders when the programs implementing these models were applied to gene sequences from the same or other classes. It is shown that at least two training sets and two program versions derived for different classes of E.coli genes are necessary in order to achieve a high accuracy of coding region prediction for uncharacterized sequences, Some annotated E.coli genes from Class I and Class III are shown to be spurious, whereas many open reading frames (ORFs) that have not been annotated in GenBank as genes are predicted to encode proteins, The amino acid sequences of the putative products of these ORFs initially did not show similarity to already known proteins. However, conserved regions have been identified in several of them by screening the latest entries in protein sequence databases and applying methods for motif search, while some other of these new genes have been identified in independent experiments.
机译:我们进一步调查了三类大肠杆菌基因的统计特征,这些特征先前已通过因子对应分析和动态聚类方法进行了描述,针对每个基因类别的核苷酸序列建立了阶段性马尔可夫模型,并使用GeneMark进行了基因预测当将实现这些模型的程序应用于来自相同或其他类别的基因序列时,确定了针对不同阶次的特定类马尔可夫模型的蛋白质编码区预测准确性。结果表明,至少有两个针对不同类别的大肠杆菌基因而衍生的训练集和两个程序版本是必需的,以实现对未表征序列的编码区域预测的高精度。来自I类和Class的一些带注释的E.coli基因III被证明是伪造的,而许多开放阅读框(ORF)尚未在GenBank中注释,因为预测基因会编码蛋白质。这些ORF的推定产物的氨基酸序列最初并未显示出与已知蛋白质的相似性。但是,通过筛选蛋白质序列数据库中的最新条目并应用基序搜索方法,已经在其中几个中鉴定了保守区,而这些新基因中的其他一些则是在独立实验中鉴定出来的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号