【24h】

A hybrid method for bilingual text sentiment classification based on deep learning

机译:基于深度学习的双语文本情感分类混合方法

获取原文

摘要

Text sentiment classification has occupied a pivotal position in sentiment analysis research, it offers important opinion mining functions. Nowadays, with explosion of information, many researchers are focusing on sentiment classification research on massive amounts of data. However, the traditional machine learning methods cannot acquire text semantic information and most research achievements are about single language, in this paper, a hybrid method which integrates the deep learning features and shallow learning features is proposed. The hybrid method can not only realize single language text sentiment classification but realize bilingual text sentiment classification as well. Models such as recurrent neural networks (RNNs) with long short term memory(LSTM), Naïve Bayes Support Vector Machine (NB-SVM), word vectors and bag-of-words are explored. Firstly, these models are studied separately in sentiment classification task. The paper then integrates the above methods as a whole to complete the task. Different combination strategies are discussed regarding the contribution of each method. The experiments show that the accuracy can reach 89% and the hybrid method performs much better than any other method individually. The proposed method achieves a performance close to the state-of-the-art methods based on the had-engineered features. What's more, the hybrid model can learn more linguistic phenomena with the growth of the accuracy of emotional tendency discrimination when more background knowledge is available.
机译:文本情感分类在情感分析研究中占有举足轻重的地位,具有重要的观点挖掘功能。如今,随着信息的爆炸式增长,许多研究人员正在集中精力对大量数据进行情感分类研究。然而,传统的机器学习方法无法获取文本语义信息,并且大多数研究成果都是关于单一语言的,因此,本文提出了一种融合了深度学习特征和浅层学习特征的混合方法。混合方法不仅可以实现单语言文本情感分类,还可以实现双语文本情感分类。探索了具有长期短期记忆(LSTM)的递归神经网络(RNN),朴素贝叶斯支持向量机(NB-SVM),词向量和词袋等模型。首先,在情感分类任务中分别研究了这些模型。然后,本文将上述方法作为一个整体进行了整合,以完成任务。关于每种方法的贡献,讨论了不同的组合策略。实验表明,该方法的准确率可以达到89%,并且混合方法的性能要比其他任何一种方法都要好得多。所提出的方法实现了与基于已设计特征的最新技术相近的性能。此外,当可获得更多背景知识时,随着情感倾向辨别准确性的提高,混合模型可以学习更多的语言现象。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号