...
首页> 外文期刊>Genome research >Inferring genome-wide functional linkages in E. coli by combining improved genome context methods: comparison with high-throughput experimental data.
【24h】

Inferring genome-wide functional linkages in E. coli by combining improved genome context methods: comparison with high-throughput experimental data.

机译:通过结合改进的基因组上下文方法来推断大肠杆菌中的全基因组功能连接:与高通量实验数据进行比较。

获取原文
获取原文并翻译 | 示例

摘要

Cellular functions are determined by interactions among proteins in the cells. Recognition of these interactions forms an important step in understanding biology at the systems level. Here, we report an interaction network of Escherichia coli, obtained by training a Support Vector Machine on the high quality of interactions in the EcoCyc database, and with the assumption that the periplasmic and cytoplasmic proteins may not interact with each other. The data features included correlation coefficient between bit score phylogenetic profiles, frequency of their co-occurrence in predicted operons, and a new measure--the distance between translational start sites of the genes. The combined genome context methods show a high accuracy of prediction on the test data and predict a total of 78,122 binary interactions. The majority of the interactions identified by high-throughput experimental methods correspond to indirect interaction (interactions through neighbors) in the predicted network. Correlation of the predicted network with the gene essentiality data shows that the essential genes in E. coli exhibit a high linking number, whereas the nonessential genes exhibit a low linking number. Furthermore, our predicted protein-protein interaction network shows that the proteins involved in replication, DNA repair, transcription, translation, and cell wall synthesis are highly connected. We therefore believe that our predicted network will serve as a useful resource in understanding prokaryotic biology.
机译:细胞功能由细胞中蛋白质之间的相互作用决定。对这些相互作用的认识是在系统层面理解生物学的重要一步。在这里,我们报告了大肠杆菌的相互作用网络,该网络是通过在EcoCyc数据库中对相互作用的高质量进行训练的支持向量机而获得的,并且假设周质和细胞质蛋白可能不会相互作用。数据特征包括位评分系统发育谱之间的相关系数,在预测操纵子中它们的共现频率以及一种新的度量-基因翻译起始位点之间的距离。组合的基因组上下文方法显示了对测试数据的高精度预测,并预测了总计78,122个二元相互作用。通过高通量实验方法确定的大多数交互对应于预测网络中的间接交互(通过邻居的交互)。预测网络与基因必需性数据的相关性表明,大肠杆菌中的必需基因表现出高的连接数,而非必需基因表现出低的连接数。此外,我们预测的蛋白质-蛋白质相互作用网络显示,参与复制,DNA修复,转录,翻译和细胞壁合成的蛋白质高度相关。因此,我们认为我们的预测网络将成为了解原核生物学的有用资源。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号