首页> 外文期刊>International Journal of Pattern Recognition and Artificial Intelligence >A Sentiment Classification Model Using Group Characteristics of Writing Style Features
【24h】

A Sentiment Classification Model Using Group Characteristics of Writing Style Features

机译:利用写作风格特征的群体特征建立情感分类模型

获取原文
获取原文并翻译 | 示例
       

摘要

Sentiment analysis is becoming increasingly important mainly because of the growth of web comments. Sentiment polarity classification is a popular process in this field. Writing style features, such as lexical and word-based features, are often used in the authorship identification and gender classification of online messages. However, writing style features were only used in feature selection for sentiment classification. This research presents an exploratory study of the group characteristics of writing style features on the Internet Movie Database (IMDb) movie sentiment data set. Furthermore, this study utilizes the specific group characteristics of writing style in improving the performance of sentiment classification. We determine the optimum clustering number of user reviews based on writing style features distribution. According to the classification model trained on a training subset with specific writing style clustering tags, we determine that the model trained on the data set of a specific writing style group has an optimal e r ect on the classification accuracy, which is better than the model trained on the entire data set in a particular positive or negative polarity. Through the polarity characteristics of specific writing style groups, we propose a general model in improving the performance of the existing classification approach. Results of the experiments on sentiment classification using the IMDb data set demonstrate that the proposed model improves the performance in terms of classification accuracy.
机译:情绪分析变得越来越重要,这主要是由于Web评论的增长。情感极性分类是该领域中流行的过程。写作风格功能(例如词汇和基于单词的功能)经常用于在线邮件的作者身份识别和性别分类。但是,写作风格特征仅用于情感分类的特征选择。这项研究提出了对互联网电影数据库(IMDb)电影情感数据集上的写作风格特征的群体特征的探索性研究。此外,本研究利用写作风格的特定群体特征来改善情感分类的性能。我们根据写作风格特征分布确定最佳的用户评论聚类数量。根据在具有特定写作风格聚类标签的训练子集上训练的分类模型,我们确定在特定写作风格组的数据集上训练的模型对分类精度有最佳影响,这比训练后的模型更好在整个数据集上具有特定的正极性或负极性。通过特定写作风格组的极性特征,我们提出了一个通用模型来改善现有分类方法的性能。使用IMDb数据集进行情感分类的实验结果表明,该模型在分类准确度方面提高了性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号