首页> 外文会议>International Conference on Advanced Intelligent Systems and Informatics >Pre-trained Word Embeddings for Arabic Aspect-Based Sentiment Analysis of Airline Tweets
【24h】

Pre-trained Word Embeddings for Arabic Aspect-Based Sentiment Analysis of Airline Tweets

机译:用于阿拉伯语基于方面的媒体的预先训练的Word Embeddings的航空公司推文

获取原文

摘要

Recently, the use of word embeddings has become one of the most significant advancements in natural language processing (NLP). In this paper, we compared two word embedding models for aspect-based sentiment analysis (ABSA) of Arabic tweets. The ABSA problem was formulated as a two step process of aspect detection followed by sentiment polarity classification of the detected aspects. The compared embeddings models include fastText Arabic Wikipedia and AraVec-Web, both available as pre-trained models. Our corpus consisted of 5K airline service related tweets in Arabic, manually labeled for ABSA with imbalanced aspect categories. For classification, we used a support vector machine classifier for both, aspect detection, and sentiment polarity classification. Our results indicated that fastText Arabic Wikipedia word embeddings performed slightly better than AraVec-Web.
机译:最近,Word Embeddings的使用已成为自然语言处理(NLP)中最重要的进步之一。 在本文中,我们比较了阿拉伯语推文的基于方面情感分析(ABSA)的两个字嵌入模型。 将ABSA问题标配制为方面检测的两个步骤过程,然后是检测到的方面的情感分类。 比较的嵌入式模型包括FastText阿拉伯维披巾和Aravec-Web,既可用作预先训练的型号。 我们的语料库由5K航空服务相关的Tweets,以阿拉伯语,手动标记为ABSA,具有不平衡的宽容类别。 对于分类,我们使用了支持向量机分类器,用于两者,方向检测和情感极性分类。 我们的结果表明,FastText阿拉伯维基亚语Word Embeddings略微比Aravec-Web更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号