...
首页> 外文期刊>Journal of computational and theoretical nanoscience >Sequence and Functional Annotations-Based Prediction of Protein-Protein Interactions
【24h】

Sequence and Functional Annotations-Based Prediction of Protein-Protein Interactions

机译:基于序列和函数注释的蛋白质 - 蛋白质相互作用的预测

获取原文
获取原文并翻译 | 示例

摘要

Protein-protein interactions (PPIs) are the basis of most cellular processes and biological functions, and many computational methods complementary to experimental methods have been proposed for predicting PPIs. However, two kinds of typical computational methods, namely, sequence-based and prior-knowledge-driven ones, are mainly studied separately. Consequently, the latent trait that links and complements protein sequence and prior knowledge is overlooked. Most methods also suitably perform in small genomes like S. cerevisiae but perform less suitably in large genomes such as Homo sapiens. These models have poor generalization capability as well. Therefore, a novel, effective, robust, and highly reliable method of predicting PPIs must be developed. In this paper, a protein functional annotation-based method (FNM) was presented for the first time. The method incorporated protein essentiality, domain-domain interactions, gene ontology annotations, and sequence annotations. Different fusion strategies were then used to establish three methods (JFM, ELM, and WELM) that combined protein sequence features with functional annotation features in FNM. On both datasets of the small-genome S. cerevisiae and the large-genome Homo sapiens, fusion methods demonstrated a high success rate of classification than other sequence-based methods. Finally, a yeast and human protein interaction network was obtained, and cluster analysis showed that the PPI networks were biologically meaningful, indicating that they can be used for network mining research, such as detecting protein complexes and hub proteins, searching for disease-causing proteins and drug targets, etc.
机译:蛋白质 - 蛋白质相互作用(PPI)是大多数细胞过程和生物学功能的基础,并且已经提出了许多与实验方法互补的计算方法用于预测PPI。然而,两种典型的计算方法,即基于序列和先前知识驱动的方法,主要研究。因此,潜在的潜在特征是忽视蛋白质序列和先验知识的潜在特征。大多数方法也适当地在S.酿酒酵母等小基因组中进行,但在大型基因组(如Homo Sapiens)中表现不太合适。这些型号也具有较差的泛化能力。因此,必须开发一种新颖的,有效,稳健和高度可靠的预测PPI方法。本文首次提出了一种基于蛋白质功能注释的方法(FNM)。该方法掺入蛋白质基质,结构域域相互作用,基因本体学注释和序列注释。然后使用不同的融合策略来建立三种方法(JFM,ELM和WELM),其组合蛋白质序列特征在FNM中具有功能性注释特征。在小基因组S.酿酒酵母和大基因组同源SAPIENS的两个数据集上,融合方法表明了比其他基于序列的方法的高成功率。最后,获得了酵母和人蛋白质相互作用网络,并且聚类分析表明,PPI网络在生物学上有意义,表明它们可用于网络采矿研究,例如检测蛋白质复合物和轮毂蛋白,寻找疾病导致蛋白质的蛋白质和药物目标等。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号