Does Higher Order LSTM Have Better Accuracy for Segmenting and Labeling Sequence Data?

机译：高阶LSTM在分割和标记序列数据方面是否具有更好的准确性？

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Existing neural models usually predict the tag of the current token independent of the neighboring tags. The popular LSTM-CRF model considers the tag dependencies between every two consecutive tags. However, it is hard for existing neural models to take longer distance dependencies of tags into consideration. The scalability is mainly limited by the complex model structures and the cost of dynamic programming during training. In our work, we first design a new model called "high order LSTM" to predict multiple tags for the current token which contains not only the current tag but also the previous several tags. We call the number of tags in one prediction as "order". Then we propose a new method called Multi-Order BiLSTM (MO-BiLSTM) which combines low order and high order LSTMs together. MO-BiLSTM keeps the scalability to high order models with a pruning technique. We evaluate MO-BiLSTM on all-phrase chunking and NER datasets. Experiment results show that MO-BiLSTM achieves the state-of-the-art result in chunking and highly competitive results in two NER datasets.

机译：现有的神经模型通常独立于相邻标签来预测当前令牌的标签。流行的LSTM-CRF模型考虑每两个连续标签之间的标签依赖关系。但是，现有的神经模型很难考虑标签的更长距离依赖性。可伸缩性主要受复杂的模型结构和训练过程中动态编程成本的限制。在我们的工作中，我们首先设计一个称为“高阶LSTM”的新模型，以预测当前令牌的多个标签，该令牌不仅包含当前标签，还包含先前的几个标签。我们将一个预测中的标签数称为“顺序”。然后，我们提出了一种称为多阶BiLSTM（MO-BiLSTM）的新方法，该方法将低阶LSTM和高阶LSTM组合在一起。 MO-BiLSTM通过修剪技术保持了对高阶模型的可伸缩性。我们在全短语组块和NER数据集上评估MO-BiLSTM。实验结果表明，MO-BiLSTM在两个NER数据集中以分块方式获得了最先进的结果，并获得了极具竞争力的结果。

著录项

来源
《International conference on computational linguistics》|2018年|723-733|共11页
会议地点
作者
Yi Zhang; Xu Sun; Shuming Ma; Yang Yang; Xuancheng Ren;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. sl-LSTM: A Bi-Dlrectional LSTM With Stochastic Gradient Descent Optimization for Sequence Labeling Tasks in Big Data [J] . Victor Nancy, Lopez Daphne International journal of grid and high performance computing . 2020,第3期

机译：SL-LSTM：具有随机梯度下降优化的双DLrectional LSTM，用于大数据中的序列标记任务
2. sl-LSTM: A Bi-Directional LSTM With Stochastic Gradient Descent Optimization for Sequence Labeling Tasks in Big Data [J] . International journal of applied mechanics . 2020,第3期

机译：SL-LSTM：具有随机梯度下降优化的双向LSTM，用于大数据中的序列标记任务
3. Learning Context Using Segment-Level LSTM for Neural Sequence Labeling [J] . Youhyun Shin, Sang-goo Lee Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2020,第期

机译：使用段级LSTM进行神经序列标记的学习背景
4. Does Higher Order LSTM Have Better Accuracy for Segmenting and Labeling Sequence Data? [C] . Yi Zhang, Xu Sun, Shuming Ma, International conference on computational linguistics . 2018

机译：高阶LSTM是否具有更好的分段和标记序列数据的准确性？
5. Examining the Contextual Effects of Racial Profiling, and the Long-Term Consequences of Punitive Interventions: Testing Labeling Theory with the National Longitudinal Study of Adolescent to Adult Health Data [D] . Valdimarsdóttir, Margrét. 2020

机译：检查种族分析的上下文影响，以及惩罚性干预的长期后果：测试标签理论与美国青少年对成人健康数据的纵向研究
6. LSTMVoter: chemical named entity recognition using a conglomerate of sequence labeling tools [O] . Wahed Hemati, Alexander Mehler 2019

机译：LSTMVoter：使用序列标记工具组合的化学命名实体识别
7. Segmenting and Labeling Query Sequences in a Multidatabase Environment [O] . Aybar C. Acar, Amihai Motro 2014

机译：在多数据库环境中分割和标记查询序列
8. Neural network: Multiple sensor based method for recognition of gene coding segments in human DNA sequence data. [R] . Uberbacher, E. C., Mann, R. C., Hand, R. C., 1991

机译：神经网络：基于多传感器的方法识别人类DNa序列数据中的基因编码区段。

Does Higher Order LSTM Have Better Accuracy for Segmenting and Labeling Sequence Data?

摘要

著录项

相似文献

相关主题

期刊订阅