【24h】

Sentiment analysis for Arabizi text

机译:阿拉伯朱杉中文本的情感分析

获取原文

摘要

This paper has used supervised learning to assign sentiment or polarity labels to tweets written in Arabizi. Arabizi is a form of writing Arabic text which relies on using Latin letters rather than Arabic letters. This form of writing is common with the Arab youth. A rule-based converter was designed and applied on the tweets to convert them from Arabizi to Arabic. Subsequently, the resultant tweets were annotated with their respective sentiment labels using crowdsourcing. This ArabiziDataset consists of 3206 tweets. Results obtained by this work reveal that SVM accuracies are higher than Naive Bayes accuracies. Secondly, removal of stopwords and mapping emoticons to their corresponding words did not greatly improve the accuracies for Arabizi data. Thirdly, eliminating neutral tweets at early stage in the classification improves Precision for both Naive Bayes and SVM. However, Recall values fluctuated, sometimes they got improved; on other times they did not improve.
机译:本文使用了监督学习,将情绪或极性标签分配给raprizi编写的推文。 Arabizi是一种编写阿拉伯语文本的形式,它依赖于使用拉丁语字母而不是阿拉伯语字母。这种形式与阿拉伯青年常见。基于规则的转换器设计并应用于推文上,以将它们从阿拉伯人转换为阿拉伯语。随后,使用众包用它们各自的情绪标签用它们的各自情绪标签进行注释。此raveridataset由3206推文组成。通过这项工作获得的结果表明SVM精度高于幼稚贝叶斯精度。其次,将止动件和映射表情界定到其对应的单词中的映射并没有大大提高阿拉伯提升数据的准确性。第三,在分类的早期阶段消除中性推特提高了幼稚贝叶斯和SVM的精度。但是,回忆值波动,有时它们得到改善;在其他时候,他们没有改善。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号