基于短语注意机制的文本分类

江伟; 金忠

首页> 中文期刊> 《中文信息学报》 >基于短语注意机制的文本分类

基于短语注意机制的文本分类

AI论文写作 >>

AI期刊论文写作 >>

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

基于词注意机制的双向循环神经网络在解决文本分类问题时,存在如下问题:直接对词加权生成文本表示会损失大量信息,从而难以在小规模数据集上训练网络.此外,词必须结合上下文构成短语才具有明确语义,且文本语义常常是由其中几个关键短语决定,所以通过学习短语的权重来合成的文本语义表示要比通过学习词的权重来合成的更准确.为此,该文提出一种基于短语注意机制的神经网络框架NN-PA.其架构是在词嵌入层后加入卷积层提取N-gram短语的表示,再用带注意机制的双向循环神经网络学习文本表示.该文还尝试了五种注意机制.实验表明:基于不同注意机制的NN-PA系列模型不仅在大、小规模数据集上都能明显提高分类正确率,而且收敛更快.其中,模型NN-PA1和NN-PA2明显优于主流的深度学习模型,且NN-PA2在斯坦福情感树库数据集的五分类任务上达到目前最高的正确率53.35%.%In text classification,bidirectional recurrent neural network based on word-level attention is defected in the way generating text representation directly from words,which will cause a lot of information loss and make it hard to train the network on a limited data.In fact,words need to be combined into phrases with clear semantics in the context,and the text semantic meaning is often determined by several key phrases,therefore,the text representa-tion generated by learning the weight of phrases may be more precise than that by the words.This paper proposes a novel neural network architecture based on the phrase-level attention mechanism.Specifically,a convolutional layer is added after the word embedding layer to extract the representations of N-gram phrase,and the text representation is learnt by bidirectional recurrent neural network with attention mechanism.We test five kinds of attention mecha-nisms in the experiment.Experimental results show that a series of NN-PA models based on different attention mechanism can improve classification performance on both of small and large scale datasets,and converge faster. Both NN-PA1 and NN-PA2 models outperform the state-of-art models based on deep learning techniques,and NN-PA2 gets 53.35% accuracy on the five-classification task on Stanford Sentiment Treebank,which is best result to our best knowledge.

著录项

来源
《中文信息学报》 |2018年第2期|102-109,119|共9页
作者
江伟; 金忠;
展开▼
作者单位

南京理工大学计算机科学与工程学院,江苏南京210094;

南京理工大学高维信息智能感知与系统教育部重点实验室,江苏南京210094;

南京理工大学计算机科学与工程学院,江苏南京210094;

南京理工大学高维信息智能感知与系统教育部重点实验室,江苏南京210094;

展开▼
原文格式 PDF
正文语种 chi
中图分类信息处理（信息加工）;
关键词
文本分类; 循环神经网络; 卷积层; 注意机制;

相似文献

中文文献
外文文献
专利

1. 基于LSTM结合注意力机制的长文本分类优化研究 [J] . 于庆洋 . 互联网周刊 . 2023,第3期
2. 基于注意力机制和CNN的多标签文本分类模型 [J] . 杨春霞 ,吴佳君 ,瞿涛 . 计算机应用与软件 . 2024,第3期
3. 基于多粒度图与注意力机制的半监督短文本分类 [J] . 游奔 ,李晓红 ,姚锦 . 计算机工程 . 2024,第5期
4. 基于对比学习和注意力机制的文本分类方法 [J] . 钱来 ,赵卫伟 . 计算机工程 . 2024,第7期
5. 基于Elmo和注意力机制的双通道文本分类模型 [J] . 陈小莹 ,艾金勇 . 计算机仿真 . 2024,第10期
6. 一种简单的基于奖励机制的文本分类算法 [C] . . 第四届全国信息检索与内容安全学术会议 . 2008
7. 基于Albert和多头自注意力机制的图神经网络文本分类的研究 [A] . 潘鹏程 . 2023

基于短语注意机制的文本分类

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅