首页> 外文会议>International conference on pattern recognition >Sentiment analysis of Arabic tweets using text mining techniques
【24h】

Sentiment analysis of Arabic tweets using text mining techniques

机译:使用文本挖掘技术分析阿拉伯文推文的情感

获取原文

摘要

Sentiment analysis has become a flourishing field of text mining and natural language processing. Sentiment analysis aims to determine whether the text is written to express positive, negative, or neutral emotions about a certain domain. Most sentiment analysis researchers focus on English texts, with very limited resources available for other complex languages, such as Arabic. In this study, the target was to develop an initial model that performs satisfactorily and measures Arabic Twitter sentiment by using machine learning approach, Naieve Bayes and Decision Tree for classification algorithms. The datasets used contains more than 2,000 Arabic tweets collected from Twitter. We performed several experiments to check the performance of the two algorithms classifiers using different combinations of text-processing functions. We found that available facilities for Arabic text processing need to be made from scratch or improved to develop accurate classifiers. The small functionalities developed by us in a Python language environment helped improve the results and proved that sentiment analysis in the Arabic domain needs lot of work on the lexicon side.
机译:情感分析已成为文本挖掘和自然语言处理的蓬勃发展领域。情感分析旨在确定文字是否表达对某个领域的正面,负面或中性情绪。大多数情绪分析研究人员专注于英语文本,而其他复杂语言(如阿拉伯语)的可用资源却非常有限。在这项研究中,目标是通过使用机器学习方法,Naieve Bayes和决策树作为分类算法,开发出令人满意的性能并测量阿拉伯语Twitter情感的初始模型。所使用的数据集包含从Twitter收集的2,000多条阿拉伯语推文。我们执行了一些实验,以使用文本处理功能的不同组合来检查两个算法分类器的性能。我们发现,需要从头开始制作或改进阿拉伯语文本处理的可用工具,以开发准确的分类器。我们在Python语言环境中开发的小功能有助于改善结果,并证明阿拉伯语领域的情感分析需要在词典方面进行大量工作。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号