首页> 中文期刊> 《计算机科学技术学报:英文版》 >A New Method for Sentiment Analysis Using Contextual Auto-Encoders

A New Method for Sentiment Analysis Using Contextual Auto-Encoders

         

摘要

Sentiment analysis, a hot research topic, presents new challenges for understanding users' opinions and judg-ments expressed online. They aim to classify the subjective texts by assigning them a polarity label. In this paper, weintroduce a novel machine learning framework using auto-encoders network to predict the sentiment polarity label at theword level and the sentence level. Inspired by the dimensionality reduction and the feature extraction capabilities of theauto-encoders, we propose a new model for distributed word vector representation "PMI-SA" using as input pointwise-mutual-information "PMI" word vectors. The resulted continuous word vectors are combined to represent a sentence. Anunsupervised sentence embedding method, called Contextual Recursive Auto-Encoders "CoRAE", is also developed forlearning sentence representation. Indeed, CoRAE follows the basic idea of the recursive auto-encoders to deeply composethe vectors of words constituting the sentence, but without relying on any syntactic parse tree. The CoRAE model consistsin combining recursively each word with its context words (neighbors' words: previous and next) by considering the wordorder. A support vector machine classifier with fine-tuning technique is also used to show that our deep compositionalrepresentation model CoRAE improves significantly the accuracy of sentiment analysis task. Experimental results demon-strate that CoRAE remarkably outperforms several competitive baseline methods on two databases, namely, Sanders twittercorpus and Facebook comments corpus. The CoRAE model achieves an efficiency of 83.28% with the Facebook dataset and97.57% with the Sanders dataset.

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号