Learning Context Using Segment-Level LSTM for Neural Sequence Labeling

Youhyun Shin; Sang-goo Lee

首页> 外文期刊>Audio, Speech, and Language Processing, IEEE/ACM Transactions on >Learning Context Using Segment-Level LSTM for Neural Sequence Labeling

【24h】

Learning Context Using Segment-Level LSTM for Neural Sequence Labeling

机译：使用段级LSTM进行神经序列标记的学习背景

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This article introduces an approach that learns segment-level context for sequence labeling in natural language processing (NLP). Previous approaches limit their basic unit to a word for feature extraction because sequence labeling is a token-level task in which labels are annotated word-by-word. However, the text segment is an ultimate unit for labeling, and we are easily able to obtain segment information from annotated labels in a IOB/IOBES format. Most neural sequence labeling models expand their learning capacity by employing additional layers, such as a character-level layer, or jointly training NLP tasks with common knowledge. The architecture of our model is based on the charLSTM-BiLSTM-CRF model, and we extend the model with an additional segment-level layer called segLSTM. We therefore suggest a sequence labeling algorithm called charLSTM-BiLSTM-CRF-segLSTM

$^{sLM}$

which employs an additional segment-level long short-term memory (LSTM) that trains features by learning adjacent context in a segment. We demonstrate the performance of our model on four sequence labeling datasets, namely, Peen Tree Bank, CoNLL 2000, CoNLL 2003, and OntoNotes 5.0. Experimental results show that our model performs better than state-of-the-art variants of BiLSTM-CRF. In particular, the proposed model enhances the performance of tasks for finding appropriate labels of multiple token segments.

机译：本文介绍了一种方法，用于了解自然语言处理中的序列标记的段级上下文（NLP）。以前的方法将其基本单元限制为特征提取的单词，因为序列标记是标签逐字的标记级任务。但是，文本段是用于标记的最终单位，我们很容易获得IOB / IOBES格式中的注释标签的段信息。大多数神经序列标记模型通过采用其他层，例如字符级层，或具有共同知识的联合培训NLP任务来扩展其学习能力。我们的模型的体系结构基于Charlstm-Bilstm-CRF模型，我们将模型扩展了一个名为SEGLSTM的附加段级层。因此，我们建议一个名为Charlstm-Bilstm-CRF-SEGLSTM的序列标记算法<内联公式XMLNS：MML =“http://www.w3.org/1998/math/mathml”xmlns：xlink =“http://www.w3.org/1999/xlink”> $ ^ {SLM} $ 其中采用额外的段级长短短期存储器（LSTM），其通过在段中学习相邻上下文来列车。我们展示了我们在四个序列标签数据集中的模型的表现，即Peen Tree Bank，Conll 2000，Conll 2003和Ontonotes 5.0。实验结果表明，我们的模型比Bilstm-CRF的最先进变种更好。特别是，所提出的模型增强了用于查找多个令牌段的适当标签的任务的性能。

著录项

来源
《Audio, Speech, and Language Processing, IEEE/ACM Transactions on》 |2020年第2020期|105-115|共11页
作者
Youhyun Shin; Sang-goo Lee;
展开▼
作者单位

Department of Computer Science Engineering Seoul National University Seoul South Korea;

Department of Computer Science Engineering Seoul National University Seoul South Korea;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Labeling; Feature extraction; Task analysis; Hidden Markov models; Crystals; Natural language processing; Tagging;

机译：标签;特征提取;任务分析;隐藏的马尔可夫模型;水晶;自然语言处理;标记;

相似文献

外文文献
中文文献
专利

1. Segment-level probabilistic sequence kernel and segment-level pyramid match kernel based extreme learning machine for classification of varying length patterns of speech [J] . Shikha Gupta, Ahmed Karanath, Kansul Mahrifa, International journal of speech technology . 2019,第1期

机译：基于段级概率序列核和段级金字塔匹配核的极限学习机，用于语音不同长度模式的分类
2. sl-LSTM: A Bi-Dlrectional LSTM With Stochastic Gradient Descent Optimization for Sequence Labeling Tasks in Big Data [J] . Victor Nancy, Lopez Daphne International journal of grid and high performance computing . 2020,第3期

机译：SL-LSTM：具有随机梯度下降优化的双DLrectional LSTM，用于大数据中的序列标记任务
3. sl-LSTM: A Bi-Directional LSTM With Stochastic Gradient Descent Optimization for Sequence Labeling Tasks in Big Data [J] . International journal of applied mechanics . 2020,第3期

机译：SL-LSTM：具有随机梯度下降优化的双向LSTM，用于大数据中的序列标记任务
4. Touch Wear: Context-Dependent and Self-Learning Personal Speech Assistant for Wearable Systems with Deep Neural Networks: Using Contextual LSTMs on Recurrent Neural Networks [C] . Joshua Ho, Chien-Min Wang International Conference on Smart Portable, Wearable, Implantable and Disability-oriented Devices and Systems . 2018

机译：触摸磨损：具有深度神经网络的可穿戴系统的上下文依赖和自学个人语音助手：在经常性神经网络上使用上下文LSTMS
5. Multiple Label Recognition of Deep Learning Using Convolutional Neural Network [D] . He, Mingju. 2018

机译：利用卷积神经网络多重标签识别深度学习
6. EvoLSTM: context-dependent models of sequence evolution using a sequence-to-sequence LSTM [O] . Dongjoon Lim, Mathieu Blanchette -1

机译：EvoLSTM：使用序列到序列LSTM的上下文相关的序列进化模型
7. LSTM-CF: Unifying Context Modeling and Fusion with LSTMs for RGB-D Scene Labeling [O] . Gan, Y, Lin, L, Liang, X, 2016

机译：LsTm-CF：统一上下文建模和融合LsTm用于RGB-D场景标记

Learning Context Using Segment-Level LSTM for Neural Sequence Labeling

摘要

著录项

相似文献

相关主题

期刊订阅