首页> 外文会议>IEEE/ACS International Conference on Computer Systems and Applications >Sentiment Analysis for Arabic Text using Ensemble Learning
【24h】

Sentiment Analysis for Arabic Text using Ensemble Learning

机译:使用集成学习对阿拉伯语文本进行情感分析

获取原文

摘要

In this paper, an ensemble of machine learning classifiers approach is used to classify the sentiment polarity of Arabic text. This approach is based on the majority voting algorithm in conjunction with four classifiers, namely Naive Bayes, Support Vector Machines, Decision Trees and K-Nearest Neighbor algorithms. Four combinations of these classifiers are formed and three classifiers are chosen for each voting combination. The performance of each classifier is evaluated and compared to ensemble voting combination performance. Different experiments have been performed to evaluate unigram and bigram features. Three datasets with different sizes are used in our experiments. The first dataset contains 500 movie reviews, the second one contains 2000 Arabic tweets and the third one contains 16448 of Arabic book reviews. The experimental results show that the ensemble of the classifiers comparatively gives better results than individual classifiers. They also reveal that the support vector machine classifier outperforms the other individual classifiers. Moreover, the results of the bigram feature are better than the results of the unigram feature.
机译:本文采用机器学习分类器集成方法对阿拉伯文本的情感极性进行分类。该方法基于多数投票算法以及四个分类器,即朴素贝叶斯,支持向量机,决策树和K最近邻居算法。形成这些分类器的四个组合,并为每个投票组合选择三个分类器。评估每个分类器的性能,并将其与整体投票组合性能进行比较。已经执行了不同的实验来评估unigram和bigram特征。我们的实验中使用了三个大小不同的数据集。第一个数据集包含500条电影评论,第二个数据集包含2000条阿拉伯语推文,第三个数据集包含16448条阿拉伯语书评。实验结果表明,与单个分类器相比,分类器的集成给出了更好的结果。他们还揭示了支持向量机分类器优于其他单个分类器。此外,双字组特征的结果比单字组特征的结果更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号