首页> 外文期刊>BMC Medical Informatics and Decision Making >Hierarchical sequence labeling for extracting BEL statements from biomedical literature
【24h】

Hierarchical sequence labeling for extracting BEL statements from biomedical literature

机译:从生物医学文献中提取BEL陈述的分层序列标记

获取原文
           

摘要

Extracting relations between bio-entities from biomedical literature is often a challenging task and also an essential step towards biomedical knowledge expansion. The BioCreative community has organized a shared task to evaluate the robustness of the causal relationship extraction algorithms in Biological Expression Language (BEL) from biomedical literature. We first map the sentence-level BEL statements in the BC-V training corpus to the corresponding text segments, thus generating hierarchically tagged training instances. A hierarchical sequence labeling model was afterwards induced from these training instances and applied to the test sentences in order to construct the BEL statements. The experimental results on extracting BEL statements from BioCreative V Track 4 test corpus show that our method achieves promising performance with an overall F-measure of 31.6%. Furthermore, it has the potential to be enhanced by adopting more advanced machine learning approaches. We propose a framework for hierarchical relation extraction using hierarchical sequence labeling on the instance-level training corpus derived from the original sentence-level corpus via word alignment. Its main advantage is that we can make full use of the original training corpus to induce the sequence labelers and then apply them to the test corpus.
机译:从生物医学文献中提取生物实体之间的关系通常是一项艰巨的任务,同时也是迈向生物医学知识扩展的重要步骤。 BioCreative社区组织了一项共享任务,以评估生物医学文献中生物表达语言(BEL)中因果关系提取算法的鲁棒性。我们首先将BC-V训练语料库中的句子级BEL语句映射到相应的文本段,从而生成分层标记的训练实例。然后,从这些训练实例中引入层次序列标签模型,并将其应用于测试语句,以构建BEL语句。从BioCreative V Track 4测试语料库中提取BEL语句的实验结果表明,我们的方法以31.6%的整体F值实现了有希望的性能。此外,它有可能通过采用更高级的机器学习方法而得到增强。我们提出了一个框架,该框架使用实例序列训练语料库上通过单词对齐从原始句子级语料库导出的层次序列标签来进行层次关系提取。它的主要优点是我们可以充分利用原始的训练语料库来诱导序列标记,然后将其应用于测试语料库。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号