首页> 外文会议>International joint conference on natural language processing >Combining Spans into Entities: A Neural Two-Stage Approach for Recognizing Discontiguous Entities
【24h】

Combining Spans into Entities: A Neural Two-Stage Approach for Recognizing Discontiguous Entities

机译:将跨度与实体结合起来:一种识别不连续实体的神经两阶段方法

获取原文

摘要

In medical documents, it is possible that an entity of interest not only contains a discontiguous sequence of words but also overlaps with another entity. Entities of such structures are intrinsically hard to recognize due to the large space of possible entity combinations. In this work, we propose a neural two-stage approach to recognizing discontiguous and overlapping entities by decomposing this problem into two subtasks: 1) it first detects all the overlapping spans that either form entities on their own or present as segments of discontiguous entities, based on the representation of segmental hypergraph, 2) next it learns to combine these segments into discontiguous entities with a classilier, which fillers out other incorrect combinations of segments. Two neural components are designed for these subtasks respectively and they are learned jointly using a shared encoder for text. Our model achieves the state-of-the-art performance in a standard dataset, even in the absence of external features that previous methods used.
机译:在医疗文件,有可能感兴趣的实体不仅包含单词的不连续的顺序也与其他实体重叠。这种结构的实体是本质上难以识别由于可能的实体组合大的空间。在这项工作中,我们提出了一种神经两阶段的方法来识别不连续的并通过分解这一问题重叠实体成两个子任务:1)它首先检测所有重叠跨度上他们自己的或本任一形式的实体作为不连续的实体区段基于超图节段性的表示,2)下它学会这些段组合成具有不连续classilier实体,填料出区段的其它不正确组合。两个神经元件分别设计为这些子任务,他们正在对文本使用编码器共享共同的经验教训。我们的模型实现了标准数据集的状态的最先进的性能,即使在不存在外部特征,以往的方法中使用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号