首页> 美国卫生研究院文献>Applied and Environmental Microbiology >Operon Prediction for Sequenced Bacterial Genomes without Experimental Information
【2h】

Operon Prediction for Sequenced Bacterial Genomes without Experimental Information

机译:没有实验信息的细菌序列基因组的Operon预测。

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Various computational approaches have been proposed for operon prediction, but most algorithms rely on experimental or functional data that are only available for a small subset of sequenced genomes. In this study, we explored the possibility of using phylogenetic information to aid in operon prediction, and we constructed a Bayesian hidden Markov model that incorporates comparative genomic data with traditional predictors, such as intergenic distances. The prediction algorithm performs as well as the best previously reported method, with several significant advantages. It uses fewer data sources and so it is easier to implement, and the method is more broadly applicable than previous methods—it can be applied to essentially every gene in any sequenced bacterial genome. Furthermore, we show that near-optimal performance is easily reached with a generic set of comparative genomes and does not depend on a specific relationship between the subject genome and the comparative set. We applied the algorithm to the Bacillus anthracis genome and found that it successfully predicted all previously verified B. anthracis operons. To further test its performance, we chose a predicted operon (BA1489-92) containing several genes with little apparent functional relatedness and tested their cotranscriptional nature. Experimental evidence shows that these genes are cotranscribed, and the data have interesting implications for B. anthracis biology. Overall, our findings show that this algorithm is capable of highly sensitive and accurate operon prediction in a wide range of bacterial genomes and that these predictions can lead to the rapid discovery of new functional relationships among genes.
机译:已经提出了各种计算方法用于操纵子预测,但是大多数算法依赖于实验或功能数据,这些数据仅可用于一小部分测序的基因组。在这项研究中,我们探索了使用系统发育信息来辅助操纵子预测的可能性,并构建了贝叶斯隐马尔可夫模型,该模型将比较基因组数据与传统预测因子(例如基因间距离)相结合。预测算法的性能与以前报告的最佳方法一样好,具有许多明显的优点。它使用较少的数据源,因此更易于实现,并且该方法比以前的方法具有更广泛的适用性-它基本上可以应用于任何已测序细菌基因组中的每个基因。此外,我们表明,用比较基因组的通用集很容易达到近乎最佳的性能,而不取决于主题基因组和比较基因组之间的特定关系。我们将该算法应用于炭疽芽孢杆菌基因组,发现它成功地预测了所有先前验证过的炭疽芽孢杆菌操纵子。为了进一步测试其性能,我们选择了一个预测的操纵子(BA1489-92),该操纵子包含几个几乎没有明显的功能相关性的基因,并测试了它们的共转录性质。实验证据表明这些基因是共转录的,这些数据对炭疽芽孢杆菌生物学具有重要意义。总体而言,我们的发现表明,该算法能够在多种细菌基因组中进行高度敏感和准确的操纵子预测,并且这些预测可以导致快速发现基因之间的新功能关系。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号