首页> 外文会议>International Conference on Audio, Language and Image Processing >Hybrid model based sentiment classification of Chinese micro-blog
【24h】

Hybrid model based sentiment classification of Chinese micro-blog

机译:基于混合模型的中国微博情感分类

获取原文

摘要

Through analysis and study of emotional characteristics in Chinese micro-blog, such as Sina Weibo, this paper proposed a multidimensional sentiment classification method based on micro-blog emoticon by dividing micro-blog into 7 types of emotions categories: happiness, fondness, sorrow, anger, fear, detestation and surprise. We used predefined micro-blog emoticon sets to initial screen large-scale unmarked data, and automatically labeled them, then used this emotional corpus as training set to train the emotion classifier, which divided micro-blog data into multiple emotion categories. The experimental results show that accuracy rate of using unigram model for each class can reach 63.7%. And the adoption of different feature selection methods for Support Vector Machines and Naive Bias classifier experiment, by which the obtained accuracy rate and recall rate has reached higher than 71%.
机译:通过分析和研究中国微博的情绪特征,如新浪微博,本文提出了一种基于微博学表情的多维情绪分类方法,通过将微博客分成7种类型的情绪类别:幸福,喜爱,悲伤,愤怒,恐惧,厌恶和惊喜。我们使用预定义的微博表情符号集到初始屏幕大规模未标记的数据,并自动标记为它们,然后使用这种情绪语料库作为培训设置,以培训情绪分类器,将微博数据分为多种情感类别。实验结果表明,每个班级使用UNIGRAM模型的准确率可达63.7%。并采用不同的特征选择方法,用于支持向量机和天真偏置分类器实验,通过该方法获得的精度率和召回率达到高于71%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号