首页> 外文会议>16th workshop on biomedical natural language processing >Deep learning for extracting protein-protein interactions from biomedical literature
【24h】

Deep learning for extracting protein-protein interactions from biomedical literature

机译:从生物医学文献中提取蛋白质相互作用的深度学习

获取原文
获取原文并翻译 | 示例

摘要

State-of-the-art methods for protein-protein interaction (PPI) extraction are primarily feature-based or kernel-based by leveraging lexical and syntactic information. But how to incorporate such knowledge in the recent deep learning methods remains an open question. In this paper, we propose a multichannel dependency-based convolutional neural network model (McDepCNN). It applies one channel to the embedding vector of each word in the sentence, and another channel to the embedding vector of the head of the corresponding word. Therefore, the model can use richer information obtained from different channels. Experiments on two public benchmarking datasets, AIMed and Biolnfer, demonstrate that McDepCNN compares favorably to the state-of-the-art rich-feature and single-kernel based methods. In addition, McDepCNN achieves 24.4% relative improvement in Fl-score over the state-of-the-art methods on cross-corpus evaluation and 12% improvement in Fl-score over kernel-based methods on "difficult" instances. These results suggest that McDepCNN generalizes more easily over different corpora, and is capable of capturing long distance features in the sentences.
机译:蛋白质和蛋白质相互作用(PPI)提取的最新方法主要是通过利用词汇和句法信息来基于特征或基于核的。但是如何将这些知识整合到最近的深度学习方法中仍然是一个悬而未决的问题。在本文中,我们提出了一个基于多通道依赖的卷积神经网络模型(McDepCNN)。它将一个通道应用于句子中每个单词的嵌入向量,将另一个通道应用于相应单词的头部的嵌入向量。因此,模型可以使用从不同渠道获得的更丰富的信息。在两个公开的基准数据集AIMed和Biolnfer上进行的实验表明,McDepCNN与先进的基于功能丰富和基于单内核的方法相比具有优势。此外,McDepCNN在跨语料库评估方面,F1分数相对于最新方法实现了24.4%的相对改进,在“困难”情况下,与基于内核的方法相比,McDepCNN相对于基于内核的方法实现了Fl得分的12%改善。这些结果表明,McDepCNN可以更轻松地在不同语料库上进行泛化,并且能够捕获句子中的长距离特征。

著录项

  • 来源
  • 会议地点 Vancouver(CA)
  • 作者

    Yifan Peng; Zhiyong Lu;

  • 作者单位

    National Center for Biotechnology Information National Library of Medicine National Institutes of Health Bethesda, MD 20894;

    National Center for Biotechnology Information National Library of Medicine National Institutes of Health Bethesda, MD 20894;

  • 会议组织
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号