首页> 外文期刊>Methods: A Companion to Methods in Enzymology >Literature mining of protein phosphorylation using dependency parse trees
【24h】

Literature mining of protein phosphorylation using dependency parse trees

机译:依赖解析树木蛋白质磷酸化的文献挖掘

获取原文
获取原文并翻译 | 示例
           

摘要

As one of the most common post-translational modifications (PTMs), protein phosphorylation plays an important role in various biological processes, such as signaling transduction, cellular metabolism, differentiation, growth, regulation and apoptosis. Protein phosphorylation is of great value not only in illustrating the underlying molecular mechanisms but also in treatment of diseases and design of new drugs. Recently, there is an increasing interest in automatically extracting phosphorylation information from biomedical literatures. However, it still remains a challenging task due to the tremendous volume of literature and circuitous modes of expression for protein phosphorylation. To address this issue, we propose a novel text-mining method for efficiently retrieving and extracting protein phosphorylation information from literature. By employing natural language processing (NLP) technologies, this method transforms each sentence into dependency parse trees that can precisely reflect the intrinsic relationship of phosphorylation-related key words, from which detailed information of substrates, kinases and phosphorylation sites is extracted based on syntactic patterns. Compared with other existing approaches, the proposed method demonstrates significantly improved performance, suggesting it is a powerful bioinformatics approach to retrieving phosphorylation information from a large amount of literature. A web server for the proposed method is freely available at http://bioinformatics.ustc.edu.cn/pptm/.
机译:作为最常见的翻译后修饰(PTMS)之一,蛋白质磷酸化在各种生物过程中起重要作用,例如信号传递转导,细胞代谢,分化,生长,调控和凋亡。蛋白质磷酸化不仅具有很大的价值,不仅是说明潜在的分子机制,而且具有重要的分子机制,而且具有重要的分子机制,而且是治疗新药的疾病和设计。最近,对自动从生物医学文献中提取磷酸化信息的兴趣越来越令人兴趣。然而,由于蛋白质磷酸化的巨大文献和迂回模式,它仍然是一个具有挑战性的任务。为了解决这个问题,我们提出了一种新的文本挖掘方法,用于从文献中有效地检索和提取蛋白质磷酸化信息。通过采用自然语言处理(NLP)技术,该方法将每个句子转变为依赖性解析树,可以精确地反映磷酸化相关关键词的内在关系,基于句法模式提取底物,激酶和磷酸化位点的详细信息。与其他现有方法相比,该方法的性能显着提高,表明它是一种强大的生物信息学方法,可以从大量文献中检索磷酸化信息。用于该方法的Web服务器在http://bioinformatics.ustc.edu.cn/pptm/上自由使用。

著录项

  • 来源
  • 作者

    WangM.; XiaH.; SunD.; ChenZ.; LiA.;

  • 作者单位

    School of Information Science and Technology University of Science and Technology of China Hefei;

    School of Information Science and Technology University of Science and Technology of China Hefei;

    School of Information Science and Technology University of Science and Technology of China Hefei;

    School of Life Sciences University of Science and Technology of China Hefei AH230027 China;

    School of Information Science and Technology University of Science and Technology of China Hefei;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 生物化学;
  • 关键词

    Dependency parse tree; Phosphorylation; Systems biology; Text mining;

    机译:依赖解析树;磷酸化;系统生物学;文本挖掘;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号