...
首页> 外文期刊>Knowledge-Based Systems >ASRNN: A recurrent neural network with an attention model for sequence labeling
【24h】

ASRNN: A recurrent neural network with an attention model for sequence labeling

机译:Asrnn:具有序列标记的注意力模型的经常性神经网络

获取原文
获取原文并翻译 | 示例

摘要

Natural language processing (NLP) is useful for handling text and speech, and sequence labeling plays an important role by automatically analyzing a sequence (text) to assign category labels to each part. However, the performance of these conventional models depends greatly on hand-crafted features and task-specific knowledge, which is a time consuming task. Several conditional random fields (CRF)-based models for sequence labeling have been presented, but the major limitation is how to use neural networks for extracting useful representations for each unit or segment in the input sequence. In this paper, we propose an attention segmental recurrent neural network (ASRNN) that relies on a hierarchical attention neural semi-Markov conditional random fields (semi-CRF) model for the task of sequence labeling. Our model uses a hierarchical structure to incorporate character-level and word-level information and applies an attention mechanism to both levels. This enables our method to differentiate more important information from less important information when constructing the segmental representation. We evaluated our model on three sequence labeling tasks, including named entity recognition (NER), chunking, and reference parsing. Experimental results show that the proposed model benefited from the hierarchical structure, and it achieved competitive and robust performance on all three sequence labeling tasks. (C) 2020 Elsevier B.V. All rights reserved.Y
机译:自然语言处理(NLP)对于处理文本和语音很有用,并且序列标记通过自动分析序列(文本)来为每个部分分配类别标签来播放重要作用。然而,这些传统模型的性能很大程度上取决于手工制作的特征和特定的特定知识,这是一个耗时的任务。已经提出了几种条件随机字段(CRF)的序列标签模型,但主要限制是如何使用神经网络来提取输入序列中每个单元或段的有用表示。在本文中,我们提出了一种注意力复发性神经网络(ASRNN),其依赖于序列标记任务任务​​的分层关注神经半马尔可夫条件随机字段(半CRF)模型。我们的模型使用分层结构来包含字符级和字级信息,并将注意力机制应用于两个级别。这使我们的方法能够在构建分段表示时从不太重要的信息中区分更重要的信息。我们在三个序列标签任务中评估了我们的模型,包括命名实体识别(ner),块和引用解析。实验结果表明,拟议的模型受益于层次结构,并在所有三个序列标签任务中取得了竞争力和强大的性能。 (c)2020 Elsevier B.v.保留所有权利.Y

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号