首页> 外文会议>Annual Meeting of the Association for Computational Linguistics;International Joint Conference on natural Language Processing >A Neural Transition-based Joint Model for Disease Named Entity Recognition and Normalization
【24h】

A Neural Transition-based Joint Model for Disease Named Entity Recognition and Normalization

机译:基于神经转换的疾病联合模型,称为实体识别和归一化

获取原文

摘要

Disease is one of the fundamental entities in biomedical research. Recognizing such entities from biomedical text and then normalizing them to a standardized disease vocabulary offer a tremendous opportunity for many downstream applications. Previous studies have demonstrated that joint modeling of the two sub-tasks has superior performance than the pipelined counterpart. Although the neural joint model based on multi-task learning framework has achieved state-of-the-art performance, it suffers from the boundary inconsistency problem due to the separate decoding procedures. Moreover, it ignores the rich information (e.g., the text surface form) of each candidate concept in the vocabulary, which is quite essential for entity normalization. In this work, we propose a neural transition-based joint model to alleviate these two issues. We transform the end-to-end disease recognition and normalization task as an action sequence prediction task, which not only jointly learns the model with shared representations of the input, but also jointly searches the output by state transitions in one search space. Moreover, we introduce attention mechanisms to take advantage of the text surface form of each candidate concept for better normalization performance. Experimental results conducted on two publicly available datasets show the effectiveness of the proposed method.
机译:疾病是生物医学研究中的基本实体之一。从生物医学文本中识别这些实体,然后将它们标准化为标准化的疾病词汇,为许多下游应用提供巨大的机会。以前的研究表明,两个子任务的联合建模具有优于流水线对应物的优越性。尽管基于多任务学习框架的神经联合模型已经实现了最先进的性能,但由于单独的解码程序,它遭受了边界不一致问题。此外,它忽略了词汇表中每个候选概念的丰富的信息(例如,文本表面形式),这对于实体归一化是至关重要的。在这项工作中,我们提出了一种基于神经转换的联合模型来缓解这两个问题。我们将端到端的疾病识别和归一化任务转换为动作序列预测任务,这不仅共同了解了输入的共享表示,而且还共同搜索了一个搜索空间中的状态转换。此外,我们引入了注意机制,利用每个候选概念的文本表面形式,以获得更好的归一化性能。在两个公共数据集中进行的实验结果表明了该方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号