首页> 美国卫生研究院文献>ACS Omega >Function Prediction for G Protein-Coupled Receptorsthrough Text Mining and Induction Matrix Completion
【2h】

Function Prediction for G Protein-Coupled Receptorsthrough Text Mining and Induction Matrix Completion

机译:G蛋白偶联受体的功能预测通过文本挖掘和归纳矩阵完成

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

G protein-coupled receptors (GPCRs) constitute the key component of cellular signal transduction. Accurately annotating the biological functions of GPCR proteins is vital to the understanding of the physiological processes they involve in. With the rapid development of text mining technologies and the exponential growth of biomedical literature, it becomes urgent to explore biological functional information from various literature for systematically and reliably annotating these known GPCRs. We design a novel three-stage approach, TM–IMC, using text mining and inductive matrix completion, for automated prediction of the gene ontology (GO) terms of the GPCR proteins. Large-scale benchmark tests show that inductive matrix completion models contribute to GPCR-GO association prediction for both molecular function and biological process aspects. Moreover, our detailed data analysis shows that information extracted from GPCR-associated literature indeed contributes to the prediction of GPCR–GO associations. The study demonstrated a new avenue to enhance the accuracy of GPCR functionannotation through the combination of text mining and induction matrixcompletion over baseline methods in critical assessment of proteinfunction annotation algorithms and literature-based GO annotationmethods. Source codes of TM–IMC and the involved datasets canbe freely downloaded from for academic purposes.
机译:G蛋白偶联受体(GPCR)构成细胞信号转导的关键组成部分。准确注释GPCR蛋白质的生物学功能对于理解它们所涉及的生理过程至关重要。随着文本挖掘技术的迅速发展和生物医学文献的指数增长,迫切需要从各种文献中系统地探索生物学功能信息并可靠地注释这些已知的GPCR。我们使用文本挖掘和归纳矩阵完成设计了一种新颖的三阶段方法TM–IMC,用于自动预测GPCR蛋白质的基因本体(GO)术语。大规模基准测试表明,归纳矩阵完成模型有助于分子功能和生物学过程方面的GPCR-GO关联预测。此外,我们的详细数据分析表明,从GPCR相关文献中提取的信息确实有助于GPCR-GO关联的预测。该研究表明了提高GPCR功能准确性的新途径通过文本挖掘和归纳矩阵相结合的注释蛋白质关键评估中超过基线方法的完成函数注释算法和基于文献的GO注释方法。 TM–IMC的源代码和涉及的数据集可以可从学术目的免费下载。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号