首页> 外文会议>IEEE International Conference on Bioinformatics and Biomedicine >A hybrid protein-protein interaction triple extraction method for biomedical literature
【24h】

A hybrid protein-protein interaction triple extraction method for biomedical literature

机译:生物医学文献中一种蛋白质-蛋白质相互作用的三重提取方法

获取原文

摘要

Protein-protein interaction extraction research can be widely applied to the field of life science research. However, most of the machine learning based methods focus on binary PPI relation extraction, which loses rich relationship type information that is critical to the PPIs study. The rule based open information extraction methods can extract the PPI triple (i.e. “protein1, interaction word, protein2”), but suffers from low recall rate problem. In this paper, we propose a hybrid protein-protein interaction triple extraction method. In this method, firstly, machine learning techniques are used to recognize protein entities and extract relational protein pairs. Then, the syntactic patterns and a dictionary are employed to find out corresponding interaction words that represent the relationships between two proteins. This method obtains an F-score of 40.18% on the AImed corpus, which is much higher than the result achieved by the rule based Stanford open information extraction method.
机译:蛋白质-蛋白质相互作用提取研究可广泛应用于生命科学研究领域。但是,大多数基于机器学习的方法都专注于二进制PPI关系提取,这会丢失对PPI研究至关重要的丰富的关系类型信息。基于规则的开放信息提取方法可以提取PPI三元组(即“ protein1,interact word,protein2”),但存在召回率低的问题。在本文中,我们提出了一种混合蛋白-蛋白相互作用的三重提取方法。在这种方法中,首先,机器学习技术用于识别蛋白质实体并提取相关的蛋白质对。然后,使用句法模式和字典来找出代表两个蛋白质之间关系的对应交互词。该方法在AImed语料库上获得40.18%的F分数,这比基于规则的斯坦福公开信息提取方法所获得的结果要高得多。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号