...
首页> 外文期刊>Annals. Computer Science Series >A Predictive Model for Tweet Sentiment Analysis and Classification
【24h】

A Predictive Model for Tweet Sentiment Analysis and Classification

机译:推特情感分析和分类的预测模型

获取原文

摘要

Sentiment analysis over Twitter offers organisations and users a fast and effective way to monitor publics’ feelings towards events especially during crises, hence, motivated much work on twitter data. In this study, predictions on positive, negative and neutral sentiment based on security are analysed. A polarity classification of tweet messages was done with VADER algorithm considering contextual analysis. The analysis was performed by removing stop words in the tweet along with Wordnet lemmatiser for the morphological analysis of words in the features sets. As well as subjected to word sense disambiguation to consider contextual usages of words using a path length corpus based lexicon. Term Frequency (TF) and Term Frequency Inverse Document Frequency (TFIDF) are used as feature extraction from tweets and evaluation of the features reduction was carried out by calculating the accuracy of the predictions on sentiment and tweet messages with Chi-Square to explore the possibly useful features. Finally, validations are done with machine learning models at different sequence to compare the performance between each model.
机译:通过Twitter进行的情绪分析为组织和用户提供了一种快速有效的方法来监视公众对事件的感觉,尤其是在危机期间,因此激发了有关Twitter数据的大量工作。在这项研究中,分析了基于安全性对积极,消极和中立情绪的预测。考虑到上下文分析,使用VADER算法对推文消息进行了极性分类。分析是通过删除推文中的停用词以及Wordnet lemmatiser进行的,以对功能集中的词语进行形态分析。除了使用基于路径长度的语料库的词义消除歧义以考虑词的上下文用法。术语频率(TF)和术语频率逆文档频率(TFIDF)用作推文的特征提取,并通过使用Chi-Square计算情感和推文消息的预测准确性来进行特征缩减的评估,以探索可能的有用的功能。最后,使用不同顺序的机器学习模型进行验证,以比较每个模型之间的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号