...
首页> 外文期刊>BMC Medical Genomics >HPOAnnotator: improving large-scale prediction of HPO annotations by low-rank approximation with HPO semantic similarities and multiple PPI networks
【24h】

HPOAnnotator: improving large-scale prediction of HPO annotations by low-rank approximation with HPO semantic similarities and multiple PPI networks

机译:HPoannotator:通过使用HPO语义相似性和多个PPI网络,通过低秩近似提高HPO注释的大规模预测

获取原文
           

摘要

As a standardized vocabulary of phenotypic abnormalities associated with human diseases, the Human Phenotype Ontology (HPO) has been widely used by researchers to annotate phenotypes of genes/proteins. For saving the cost and time spent on experiments, many computational approaches have been proposed. They are able to alleviate the problem to some extent, but their performances are still far from satisfactory. For inferring large-scale protein-phenotype associations, we propose HPOAnnotator that incorporates multiple Protein-Protein Interaction (PPI) information and the hierarchical structure of HPO. Specifically, we use a dual graph to regularize Non-negative Matrix Factorization (NMF) in a way that the information from different sources can be seamlessly integrated. In essence, HPOAnnotator solves the sparsity problem of a protein-phenotype association matrix by using a low-rank approximation. By combining the hierarchical structure of HPO and co-annotations of proteins, our model can well capture the HPO semantic similarities. Moreover, graph Laplacian regularizations are imposed in the latent space so as to utilize multiple PPI networks. The performance of HPOAnnotator has been validated under cross-validation and independent test. Experimental results have shown that HPOAnnotator outperforms the competing methods significantly. Through extensive comparisons with the state-of-the-art methods, we conclude that the proposed HPOAnnotator is able to achieve the superior performance as a result of using a low-rank approximation with a graph regularization. It is promising in that our approach can be considered as a starting point to study more efficient matrix factorization-based algorithms.
机译:作为与人类疾病相关的表型异常的标准化词汇,研究人员已广泛使用人类表型本体(HPO)来注释基因/蛋白的表型。为了节省实验所花费的成本和时间,已经提出了许多计算方法。他们能够在一定程度上缓解问题,但他们的表现仍然远非令人满意。对于推断大规模蛋白质表型关联,我们提出了含有多种蛋白质 - 蛋白质相互作用(PPI)信息和HPO的层次结构的HPoandator。具体地,我们使用双图来规范非负矩阵分解(NMF),以便可以无缝地集成来自不同源的信息。实质上,HPoannotator通过使用低秩近似来解决蛋白质表型关联矩阵的稀疏问题。通过组合HPO的分层结构和蛋白质的共同注释,我们的模型可以捕获HPO语义相似之处。此外,图表Laplacian规范化施加在潜在空间中,以便利用多个PPI网络。 HPoannotator的性能已在交叉验证和独立测试下验证。实验结果表明,HPoannotator显着优于竞争方法。通过与最先进的方法进行广泛的比较,我们得出结论,所提出的HPoandator能够通过使用较低级别的近似值来实现优越的性能。有希望的是,我们的方法可以被认为是研究更有效的基于矩阵分子的算法的起点。

著录项

相似文献

  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号