...
首页> 外文期刊>Journal of Information Science >Semisupervised sentiment analysis method for online text reviews
【24h】

Semisupervised sentiment analysis method for online text reviews

机译:关于在线文本评论的半质化情绪分析方法

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Sentiment analysis plays an important role in understanding individual opinions expressed in websites such as social media and product review sites. The common approaches to sentiment analysis use the sentiments carried by words that express opinions and are based on either supervised or unsupervised learning techniques. The unsupervised learning approach builds a word-sentiment dictionary, but it requires lengthy time periods and high costs to build a reliable dictionary. The supervised learning approach uses machine learning models to learn the sentiment scores of words; however, training a classifier model requires large amounts of labelled text data to achieve a good performance. In this article, we propose a semisupervised approach that performs well despite having only small amounts of labelled data available for training. The proposed method builds a base sentiment dictionary from a small training dataset using a lasso-based ensemble model with minimal human effort. The scores of words not in the training dataset are estimated using an adaptive instance-based learning model. In a pretrained word2vec model space, the sentiment values of the words in the dictionary are propagated to the words that did not exist in the training dataset. Through two experiments, we demonstrate that the performance of the proposed method is comparable to that of supervised learning models trained on large datasets.
机译:情感分析在了解在社交媒体和产品审查网站等网站中的个人观点中发挥着重要作用。情绪分析的常见方法使用表达意见的词语携带的情绪,并基于监督或无监督的学习技术。无监督的学习方法构建了一个词情词典,但它需要冗长的时间段和高成本来构建可靠的字典。监督学习方法使用机器学习模型来学习词语的情绪;但是,培训分类器模型需要大量标记的文本数据来实现良好的性能。在本文中,我们提出了一种半熟的方法,尽管只有少量标记的数据可用于培训,但仍能表现良好。所提出的方法使用基于卢斯的集合模型从小型训练数据集构建基本情绪词典,具有最小的人力努力。使用基于自适应实例的学习模型估计不在训练数据集中的单词的分数。在预先训练的Word2Vec模型空间中,字典中单词的情绪值传播到训练数据集中不存在的单词。通过两个实验,我们证明了所提出的方法的性能与大型数据集培训的监督学习模型的性能相当。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号