【24h】

PREPROCESSING ARABIC DIALECT FOR SENTIMENT MINING: STATE OF ART

机译:预处理的阿拉伯语方言为情绪挖掘:艺术状态

获取原文
           

摘要

Sentiment Analysis concerns the analysis of ideas, emotions, evaluations, values, attitudes and feelings about products, services, companies, individuals, tasks, events, titles and their characteristics. With the increase in applications on the Internet and social networks, Sentiment Analysis has become more crucial in the field of text mining research and has since been used to explore users’ opinions on various products or topics discussed on the Internet. Developments in the fields of Natural Language Processing and Computational Linguistics have contributed positively to Sentiment Analysis studies, especially for sentiments written in non-structured or semi-structured languages. In this paper, we present a literature review on the pre-processing task on the field of sentiment analysis and an analytical and comparative study of different researches conducted in Arabic social networks. This study allowed as concluding that several works have dealt with the generation of stop words dictionary. In this context, two approaches are adopted: first, the manual one, which gives rise to a limited list, and second, the automatic, where the list of stop words is extracted from social networks based on defined rules. For stemming two, algorithms have been proposed to isolate prefixes and suffixes from words in dialects. However, few works have been interested in dialects directly without translation. The Moroccan dialect in particular is considered as the 5th dialect studied among Arabic dialects after Jordanian, Egyptian, Tunisian and Algerian dialects. Despite the significant lack in studies carried out on Arabic dialects, we were able to extract several conclusions about the difficulties and challenges encountered through this comparative study, as well as the possible ways and tracks to study in any dialects sentiment analysis pre-processing solution.
机译:情绪分析涉及分析产品,服务,公司,个人,任务,活动,标题及其特征的产品,服务,公司,个人,任务,事件,标题及其特征的思想,情感,评估,价值观,态度,态度,态度。随着互联网和社交网络的应用程序增加,情绪分析在文本挖掘研究领域变得更加重要,因为已被用于探索用户对互联网上讨论的各种产品或主题的意见。自然语言处理和计算语言学领域的发展对情绪分析研究产生了积极的贡献,特别是对于以非结构化或半结构化语方式编写的情绪。在本文中,我们对海洋社交网络中的不同研究进行了对情感分析的预处理任务的文献综述,以及在阿拉伯社会网络中进行的不同研究。这项研究允许得出结论,即几项工程处理了止损字典的产生。在此上下文中,采用了两种方法:首先,手动一个,它引起限制列表,第二个是自动的,其中基于定义的规则从社交网络中提取停止单词列表。对于STEMMING SIZ2,已提出算法将前缀和后缀从方言中的单词隔离。但是,很少有效在没有翻译的情况下直接对方言感兴趣。特别是摩洛哥方言特别被认为是约旦,埃及,突尼斯和阿尔及利亚方言之后阿拉伯语方言研究的第五方言。尽管在阿拉伯语方言进行的研究中缺乏巨大缺乏,我们能够提取关于通过这一比较研究遇到的困难和挑战的若干结论,以及在任何方言情绪分析预处理解决方案中进行研究的可能方法和轨道。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号