首页> 外文OA文献 >From POS tagging to dependency parsing for biomedical event extraction
【2h】

From POS tagging to dependency parsing for biomedical event extraction

机译:从POS标记到依赖解析生物医学事件提取的解析

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Abstract Background Given the importance of relation or event extraction from biomedical research publications to support knowledge capture and synthesis, and the strong dependency of approaches to this information extraction task on syntactic information, it is valuable to understand which approaches to syntactic processing of biomedical text have the highest performance. Results We perform an empirical study comparing state-of-the-art traditional feature-based and neural network-based models for two core natural language processing tasks of part-of-speech (POS) tagging and dependency parsing on two benchmark biomedical corpora, GENIA and CRAFT. To the best of our knowledge, there is no recent work making such comparisons in the biomedical context; specifically no detailed analysis of neural models on this data is available. Experimental results show that in general, the neural models outperform the feature-based models on two benchmark biomedical corpora GENIA and CRAFT. We also perform a task-oriented evaluation to investigate the influences of these models in a downstream application on biomedical event extraction, and show that better intrinsic parsing performance does not always imply better extrinsic event extraction performance. Conclusion We have presented a detailed empirical study comparing traditional feature-based and neural network-based models for POS tagging and dependency parsing in the biomedical context, and also investigated the influence of parser selection for a biomedical event extraction downstream task. Availability of data and materials We make the retrained models available at https://github.com/datquocnguyen/BioPosDep.
机译:摘要背景鉴于生物医学研究出版物的关系或事件提取的重要性,支持知识捕获和综合,以及对该信息提取任务的方法对句法信息的强大依赖性,了解生物医学文本的句法处理的方法是有价值的最高的性能。结果我们对比较了最先进的传统特征和神经网络的模型进行了实证研究,用于两个核心自然语言处理任务的两种核心自然语言处理任务,在两个基准生物医学的基础上进行语音(POS)标记和依赖性解析, Genia和Craft。据我们所知,最近没有在生物医学背景下进行比较的工作;特别是没有对这些数据的神经模型进行详细分析。实验结果表明,一般来说,神经模型在两个基准生物医学Corpia和Craft上表现出基于特征的模型。我们还执行面向任务的评估,以调查这些模型在生物医学事件提取的下游应用中的影响,并表明更好的内在解析性能并不总是意味着更好的外在事件提取性能。结论我们介绍了一种详细的实证研究,比较了传统的基于特征和神经网络的基于神经网络的模型,用于在生物医学背景下进行POS标签和依赖性解析,并研究了解析器选择对生物医学事件提取下游任务的影响。数据和材料的可用性我们使HTTPS://github.com/datquocnguyen/bioposdep可获得培训型号。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号