首页> 外文会议>Asia-Pacific Bioinformatics Conference >Integrating experimental and literature protein-protein interaction data for protein complex prediction
【24h】

Integrating experimental and literature protein-protein interaction data for protein complex prediction

机译:整合蛋白质复杂预测的实验和文献蛋白质 - 蛋白质相互作用数据

获取原文

摘要

Background: Accurate determination of protein complexes is crucial for understanding cellular organization and function. High-throughput experimental technigues have generated a large amount of protein-protein interaction (PPI) data, allowing prediction of protein complexes from PPI networks. However, the high-throughput data often includes false positives and false negatives, making accurate prediction of protein complexes difficult.Method: The biomedical literature contains large guantities of PPI data that, along with high-throughput experimental PPI data, are valuable for protein complex prediction. In this study, we employ a natural language processing technigue to extract PPI data from the biomedical literature. This data is subseguently integrated with high-throughput PPI and gene ontology data by constructing attributed PPI networks, and a novel method for predicting protein complexes from the attributed PPI networks is proposed. This method allows calculation of the relative contribution of high-throughput and biomedical literature PPI data.Results: Many well-characterized protein complexes are accurately predicted by this method when apply to two different yeast PPI datasets. The results show that (i) biomedical literature PPI data can effectively improve the performance of protein complexprediction; (ii) our method makes good use of high-throughput and biomedical literature PPI data along with gene ontology data to achieve state-of-the-art protein complex prediction capabilities.
机译:背景:精确测定蛋白质复合物对于理解细胞组织和功能至关重要。高通量实验技术产生了大量的蛋白质 - 蛋白质相互作用(PPI)数据,从而允许预测来自PPI网络的蛋白质复合物。然而,高通量数据通常包括误报和假阴性,使得精确预测蛋白质复合物困难。方法:生物医学文献含有大规模的PPI数据,以及高通量实验PPI数据,对蛋白质复合物有价值预言。在本研究中,我们采用自然语言处理技术来从生物医学文献中提取PPI数据。通过构建归属PPI网络,该数据具有高通量PPI和基因本体数据,提出了一种从归属PPI网络预测蛋白质复合物的新方法。该方法允许计算高通量和生物医学文献PPI Data的相对贡献结果表明,(i)生物医学文献PPI数据可以有效提高蛋白质复合规范的性能; (ii)我们的方法良好地利用高通量和生物医学文献PPI数据以及基因本体数据,以实现最先进的蛋白质复杂预测能力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号