首页> 外文会议>International Conference on Computer and Information Sciences >SSA-SDA: Subjectivity and Sentiment Analysis of Sudanese Dialect Arabic
【24h】

SSA-SDA: Subjectivity and Sentiment Analysis of Sudanese Dialect Arabic

机译:SSA-SDA:苏丹语方言阿拉伯语的主观性和情绪分析

获取原文

摘要

In the past few years, the Subjectivity and Sentiment Analysis (SSA) has garnered a lot of attention, and many unsuccessful attempts were made for developing an SSA model for the morphologically-rich languages like Arabic. In this study, we aimed to fill the gap by designing a new manual and auto-annotated corpus of the Sudanese-Dialect Arabic (SDA) along with a novel polarity lexicon. The corpus is a collection of political tweets, which were annotated by Twitter on a sentence-level. We have also described some automated, online SSA-tagging tools, which could explore all annotated data. Furthermore, we have investigated the effect of various pre-processing techniques on the Sudanese SSA process. Here, we presented a novel SSA technique for the Sudanese dialect, which displayed a good subjectivity classification result and showed an 83.5% accuracy for the Decision Tree performance.
机译:在过去的几年里,主体性和情感分析(SSA)已经获得了很多关注,并且为制定了许多不成功的尝试,用于为阿拉伯语等形态学的语言开发SSA模型。在这项研究中,我们旨在通过设计苏丹方言阿拉伯语(SDA)的新手动和自动注释的语料库以及新颖的极性词典来填补差距。语料库是一系列政治推文,由Twitter在句子级别注释。我们还描述了一些自动的在线SSA标记工具,可以探索所有注释数据。此外,我们研究了各种预处理技术对苏丹SSA过程的影响。在这里,我们为苏丹语方言提出了一种新的SSA技术,它显示出良好的主观性分类结果,并为决策树性能显示了83.5%的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号