ASRNN: A recurrent neural network with an attention model for sequence labeling

Lin Jerry Chun-Wei; Shao Yinan; Djenouri Youcef; Yun Unil

首页> 外文期刊>Knowledge-Based Systems >ASRNN: A recurrent neural network with an attention model for sequence labeling

【24h】

ASRNN: A recurrent neural network with an attention model for sequence labeling

机译：Asrnn：具有序列标记的注意力模型的经常性神经网络

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Natural language processing (NLP) is useful for handling text and speech, and sequence labeling plays an important role by automatically analyzing a sequence (text) to assign category labels to each part. However, the performance of these conventional models depends greatly on hand-crafted features and task-specific knowledge, which is a time consuming task. Several conditional random fields (CRF)-based models for sequence labeling have been presented, but the major limitation is how to use neural networks for extracting useful representations for each unit or segment in the input sequence. In this paper, we propose an attention segmental recurrent neural network (ASRNN) that relies on a hierarchical attention neural semi-Markov conditional random fields (semi-CRF) model for the task of sequence labeling. Our model uses a hierarchical structure to incorporate character-level and word-level information and applies an attention mechanism to both levels. This enables our method to differentiate more important information from less important information when constructing the segmental representation. We evaluated our model on three sequence labeling tasks, including named entity recognition (NER), chunking, and reference parsing. Experimental results show that the proposed model benefited from the hierarchical structure, and it achieved competitive and robust performance on all three sequence labeling tasks. (C) 2020 Elsevier B.V. All rights reserved.Y

机译：自然语言处理（NLP）对于处理文本和语音很有用，并且序列标记通过自动分析序列（文本）来为每个部分分配类别标签来播放重要作用。然而，这些传统模型的性能很大程度上取决于手工制作的特征和特定的特定知识，这是一个耗时的任务。已经提出了几种条件随机字段（CRF）的序列标签模型，但主要限制是如何使用神经网络来提取输入序列中每个单元或段的有用表示。在本文中，我们提出了一种注意力复发性神经网络（ASRNN），其依赖于序列标记任务任务的分层关注神经半马尔可夫条件随机字段（半CRF）模型。我们的模型使用分层结构来包含字符级和字级信息，并将注意力机制应用于两个级别。这使我们的方法能够在构建分段表示时从不太重要的信息中区分更重要的信息。我们在三个序列标签任务中评估了我们的模型，包括命名实体识别（ner），块和引用解析。实验结果表明，拟议的模型受益于层次结构，并在所有三个序列标签任务中取得了竞争力和强大的性能。（c）2020 Elsevier B.v.保留所有权利.Y

著录项

来源
《Knowledge-Based Systems》 |2021年第5期|106548.1-106548.11|共11页
作者
Lin Jerry Chun-Wei; Shao Yinan; Djenouri Youcef; Yun Unil;
展开▼
作者单位

Qingdao Univ Technol Sch Informat & Control Engn Qingdao Peoples R China|Western Norway Univ Appl Sci Dept Comp Sci Elect Engn & Math Sci Bergen Norway;

Alibaba Inc Hangzhou Zhejiang Peoples R China;

SINTEF Digital Oslo Norway;

Sejong Univ Dept Comp Engn Seoul South Korea;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Semi-CRF; Attention mechanism; Sequence labeling; Neural network;

机译：半CRF;注意机制;序列标记;神经网络;

相似文献

外文文献
中文文献
专利

1. Deep Learning for Load Forecasting: Sequence to Sequence Recurrent Neural Networks With Attention [J] . Sehovac Ljubisa, Grolinger Katarina Quality Control, Transactions . 2020,第期

机译：对负荷预测的深度学习：序列以序列复发性神经网络的注意力
2. Attend and Imagine: Multi-Label Image Classification With Visual Attention and Recurrent Neural Networks [J] . Lyu Fan, Wu Qi, Hu Fuyuan, IEEE transactions on multimedia . 2019,第8期

机译：参加和想象：具有视觉注意力和递归神经网络的多标签图像分类
3. 3-D Convolutional Recurrent Neural Networks With Attention Model for Speech Emotion Recognition [J] . Mingyi Chen, Xuanji He, Jing Yang, IEEE signal processing letters . 2018,第10期

机译：具有注意力模型的3-D卷积递归神经网络用于语音情感识别
4. Attention-Based Recurrent Neural Network for Sequence Labeling [C] . Bofang Li, Tao Liu, Zhe Zhao, Aisa-Pacific web and web-age information management joint conference on web and big data . 2018

机译：基于注意力的递归神经网络的序列标记
5. Convolutional Recurrent Neural Networks and Attention Mechanisms for Robust Deep Learning [D] . Zheng, Jian . 2019

机译：坚固深度学习的卷积经常性神经网络和注意力机制
6. Predicting the Outcome of Patient-Provider Communication Sequences using Recurrent Neural Networks and Probabilistic Models [O] . Mehedi Hasan, Alexander Kotov, April Idalski Carcone, 2018

机译：使用递归神经网络和概率模型预测患者与提供者的交流序列的结果
7. Deep Learning for Load Forecasting: Sequence to Sequence Recurrent Neural Networks With Attention [O] . Ljubisa Sehovac, Katarina Grolinger 2020

机译：对负荷预测的深度学习：序列以序列复发性神经网络的注意力

ASRNN: A recurrent neural network with an attention model for sequence labeling

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅