Bidirectional LSTM with attention mechanism and convolutional layer for text classification

Liu Gang; Guo Jiabao

首页> 外文期刊>Neurocomputing >Bidirectional LSTM with attention mechanism and convolutional layer for text classification

【24h】

Bidirectional LSTM with attention mechanism and convolutional layer for text classification

机译：具有注意力机制和卷积层的双向LSTM用于文本分类

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Neural network models have been widely used in the field of natural language processing (NLP). Recurrent neural networks (RNNs), which have the ability to process sequences of arbitrary length, are common methods for sequence modeling tasks. Long short-term memory (LSTM) is one kind of RNNs and has achieved remarkable performance in text classification. However, due to the high dimensionality and sparsity of text data, and to the complex semantics of the natural language, text classification presents difficult challenges. In order to solve the above problems, a novel and unified architecture which contains a bidirectional LSTM (BiLSTM), attention mechanism and the convolutional layer is proposed in this paper. The proposed architecture is called attention-based bidirectional long short-term memory with convolution layer (AC-BiLSTM). In AC-BiLSTM, the convolutional layer extracts the higher-level phrase representations from the word embedding vectors and BiLSTM is used to access both the preceding and succeeding context representations. Attention mechanism is employed to give different focus to the information out-putted from the hidden layers of BiLSTM. Finally, the softmax classifier is used to classify the processed context information. AC-BiLSTM is able to capture both the local feature of phrases as well as global sentence semantics. Experimental verifications are conducted on six sentiment classification datasets and a question classification dataset, including detailed analysis for AC-BiLSTM. The results clearly show that AC-BiLSTM outperforms other state-of-the-art text classification methods in terms of the classification accuracy. (C) 2019 Elsevier B.V. All rights reserved.

机译：神经网络模型已在自然语言处理（NLP）领域中广泛使用。递归神经网络（RNN）具有处理任意长度序列的能力，是序列建模任务的常用方法。长短期记忆（LSTM）是一种RNN，在文本分类中取得了显着的性能。但是，由于文本数据的高度维度和稀疏性以及自然语言的复杂语义，文本分类提出了艰巨的挑战。为了解决上述问题，提出了一种新颖的，统一的体系结构，该体系结构包含双向LSTM（BiLSTM），注意力机制和卷积层。所提出的架构称为带卷积层的基于注意力的双向长期短期存储（AC-BiLSTM）。在AC-BiLSTM中，卷积层从单词嵌入向量中提取更高级别的短语表示，而BiLSTM用于访问前面和后面的上下文表示。注意机制用于从BiLSTM的隐藏层输出的信息给予不同的关注。最后，softmax分类器用于对处理后的上下文信息进行分类。 AC-BiLSTM能够捕获短语的局部特征以及全局句子语义。对六个情感分类数据集和一个问题分类数据集进行了实验验证，包括对AC-BiLSTM的详细分析。结果清楚地表明，在分类准确性方面，AC-BiLSTM优于其他最新的文本分类方法。（C）2019 Elsevier B.V.保留所有权利。

著录项

来源
《Neurocomputing》 |2019年第14期|325-338|共14页
作者
Liu Gang; Guo Jiabao;
展开▼
作者单位

Hubei Univ Technol, Sch Comp Sci, Wuhan 430072, Hubei, Peoples R China;

Hubei Univ Technol, Sch Comp Sci, Wuhan 430072, Hubei, Peoples R China;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Long short-term memory; Attention mechanism; Natural language processing; Text classification;

机译：短期记忆;注意力机制;自然语言处理;文本分类;

相似文献

外文文献
中文文献
专利

1. Bidirectional LSTM with attention mechanism and convolutional layer for text classification [J] . Liu Gang, Guo Jiabao Neurocomputing . 2019,第Apra14期

机译：Bidirectional LSTM与文本分类的注意机制和卷积层
2. Recurrently exploring class-wise attention in a hybrid convolutional and bidirectional LSTM network for multi-label aerial image classification [J] . Hua Yuansheng, Mou Lichao, Zhu Xiao Xiang ISPRS Journal of Photogrammetry and Remote Sensing . 2019,第MARa期

机译：反复探索卷积和双向LSTM混合网络中用于多标签航空图像分类的类关注
3. Multi‐layered attentional peephole convolutional LSTM for abstractive text summarization [J] . Md. Motiur Rahman, Fazlul Hasan Siddiqui ETRI journal . 2021,第2期

机译：用于抽象文本摘要的多层注意窥视孔卷积LSTM
4. Bidirectional LSTM with Hierarchical Attention for Text Classification [C] . Jianping Li, Yimou Xu, Huaye Shi IEEE Advanced Information Technology, Electronic and Automation Control Conference . 2019

机译：具有层次注意的双向LSTM文本分类
5. Learning, Classification and Prediction of Maneuvers of Surround Vehicles at Intersections using LSTMs [D] . Khosroshahi, Aida. 2017

机译：使用LSTMS在交叉路口中的环绕式车辆运动的学习，分类和预测
6. Recurrently exploring class-wise attention in a hybrid convolutional and bidirectional LSTM network for multi-label aerial image classification [O] . Yuansheng Hua, Lichao Mou, Xiao Xiang Zhu -1

机译：反复探索卷积和双向LSTM混合网络中用于多标签航空图像分类的类关注
7. Outpatient Text Classification Using Attention-Based Bidirectional LSTM for Robot-Assisted Servicing in Hospital [O] . Che-Wen Chen, Shih-Pang Tseng, Ta-Wen Kuan, 2020

机译：使用基于注意的文本分类，用于医院的机器人辅助服务的PENTRION-BITECTIONAL LSTM

Bidirectional LSTM with attention mechanism and convolutional layer for text classification

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅