首页> 外文期刊>Methods: A Companion to Methods in Enzymology >Literature mining of protein phosphorylation using dependency parse trees
【24h】

Literature mining of protein phosphorylation using dependency parse trees

机译:使用依赖解析树进行蛋白质磷酸化的文献挖掘

获取原文
获取原文并翻译 | 示例
           

摘要

As one of the most common post-translational modifications (PTMs), protein phosphorylation plays an important role in various biological processes, such as signaling transduction, cellular metabolism, differentiation, growth, regulation and apoptosis. Protein phosphorylation is of great value not only in illustrating the underlying molecular mechanisms but also in treatment of diseases and design of new drugs. Recently, there is an increasing interest in automatically extracting phosphorylation information from biomedical literatures. However, it still remains a challenging task due to the tremendous volume of literature and circuitous modes of expression for protein phosphorylation. To address this issue, we propose a novel text-mining method for efficiently retrieving and extracting protein phosphorylation information from literature. By employing natural language processing (NLP) technologies, this method transforms each sentence into dependency parse trees that can precisely reflect the intrinsic relationship of phosphorylation-related key words, from which detailed information of substrates, kinases and phosphorylation sites is extracted based on syntactic patterns. Compared with other existing approaches, the proposed method demonstrates significantly improved performance, suggesting it is a powerful bioinformatics approach to retrieving phosphorylation information from a large amount of literature. A web server for the proposed method is freely available at http://bioinformatics.ustc.edu.cn/pptm/.
机译:作为最常见的翻译后修饰(PTM)之一,蛋白质磷酸化在各种生物过程中发挥重要作用,例如信号转导,细胞代谢,分化,生长,调节和凋亡。蛋白质磷酸化不仅在阐明潜在的分子机制方面具有重要价值,而且在疾病治疗和新药设计中也具有重要价值。最近,人们越来越感兴趣从生物医学文献中自动提取磷酸化信息。然而,由于大量文献和蛋白质磷酸化的modes回表达方式,它仍然是一项艰巨的任务。为了解决这个问题,我们提出了一种新颖的文本挖掘方法,可以有效地从文献中检索和提取蛋白质磷酸化信息。通过使用自然语言处理(NLP)技术,该方法将每个句子转换为依存分析树,这些树可以准确反映磷酸化相关关键字的内在关系,并根据句法模式从中提取底物,激酶和磷酸化位点的详细信息。与其他现有方法相比,所提出的方法显示出显着改善的性能,表明它是从大量文献中检索磷酸化信息的强大生物信息学方法。可以从http://bioinformatics.ustc.edu.cn/pptm/免费获得用于该方法的Web服务器。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号