...
首页> 外文期刊>Mathematical Problems in Engineering: Theory, Methods and Applications >A Self-Adaptive Hidden Markov Model for Emotion Classification in Chinese Microblogs
【24h】

A Self-Adaptive Hidden Markov Model for Emotion Classification in Chinese Microblogs

机译:中国微博情感分类的自适应隐马尔可夫模型

获取原文
           

摘要

Microblogging is increasingly becoming one of the most popular online social media for people to express ideas and emotions. The amount of socially generated content from this medium is enormous. Text mining techniques have been intensively applied to discover the hidden knowledge and emotions from this huge dataset. In this paper,we propose a modified version of hidden Markov model (HMM) classifier, called self-adaptive HMM, whose parameters are optimized by Particle Swarm Optimization algorithms. Since manually labeling large-scale dataset is difficult, we also employ the entropy to decide whether a new unlabeled tweet shall be contained in the training dataset after being assigned an emotion using our HMM-based approach. In the experiment, we collected about 200,000 Chinese tweets from Sina Weibo. The results show that theF-score of our approach gets 76% on happiness and fear and 65% on anger, surprise, and sadness. In addition, the self-adaptive HMM classifier outperforms Naive Bayes and Support Vector Machine on recognition of happiness, anger, and sadness.
机译:微博正逐渐成为人们表达思想和情感的最受欢迎的在线社交媒体之一。从这种媒介社交产生的内容数量巨大。文本挖掘技术已被广泛应用于从这个庞大的数据集中发现隐藏的知识和情感。本文提出了一种改进的隐马尔可夫模型分类器,称为自适应HMM,其参数通过粒子群优化算法进行了优化。由于手动标记大型数据集很困难,因此我们还使用熵来确定使用基于HMM的方法分配了情感后,训练数据集中是否应包含新的未标记推文。在实验中,我们从新浪微博收集了约20万条中国推文。结果表明,我们的方法的F得分在幸福和恐惧中获得76%,在愤怒,惊奇和悲伤中获得65%。此外,自适应HMM分类器在识别幸福,愤怒和悲伤方面优于朴素贝叶斯和支持向量机。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号