首页> 外文期刊>Computer speech and language >SAMAR: Subjectivity and sentiment analysis for Arabic social media
【24h】

SAMAR: Subjectivity and sentiment analysis for Arabic social media

机译:SAMAR:阿拉伯社交媒体的主观性和情感分析

获取原文
获取原文并翻译 | 示例

摘要

SAMAR is a system for subjectivity and sentiment analysis (SSA) for Arabic social media genres. Arabic is a morphologically rich language, which presents significant complexities for standard approaches to building SSA systems designed for the English language. Apart from the difficulties presented by the social media genres processing, the Arabic language inherently has a high number of variable word forms leading to data sparsity. In this context, we address the following 4 pertinent issues: how to best represent lexical information; whether standard features used for English are useful for Arabic; how to handle Arabic dialects; and, whether genre specific features have a measurable impact on performance. Our results show that using either lemma or lexeme information is helpful, as well as using the two part of speech tagsets (RTS and ERTS). However, the results show that we need individualized solutions for each genre and task, but that lemmatization and the ERTS POS tagset are present in a majority of the settings.
机译:SAMAR是阿拉伯社交媒体类型的主观性和情感分析(SSA)系统。阿拉伯语是一种形态丰富的语言,它为构建为英语设计的SSA系统的标准方法带来了极大的复杂性。除了社交媒体体裁处理带来的困难外,阿拉伯语固有地具有大量可变单词形式,从而导致数据稀疏。在这种情况下,我们解决以下四个相关问题:如何最好地表示词汇信息;用于英语的标准功能是否对阿拉伯语有用;如何处理阿拉伯语;以及特定类型的功能是否会对性能产生可衡量的影响。我们的结果表明,使用引理或词素信息以及使用语音标记集的两个部分(RTS和ERTS)都是有帮助的。但是,结果表明,我们需要为每种体裁和任务提供个性化的解决方案,但是在大多数设置中都存在去词性化和ERTS POS标签集。

著录项

  • 来源
    《Computer speech and language》 |2014年第1期|20-37|共18页
  • 作者单位

    Department of Linguistics, Indiana University, 1021 E 3rd. St., Bloomington, IN 47405, USA,School of Library and Information Science, 1320 East 10th Street, Bloomington, IN 47405, USA;

    Department of Computer Science, School of Engineering & Applied Science, The George Washington University, Washington, DC, USA;

    Department of Linguistics, Indiana University, 1021 E 3rd. St., Bloomington, IN 47405, USA;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Subjectivity and sentiment analysis; Morphologically rich language; Arabic; Social media data;

    机译:主观性和情感分析;形态丰富的语言;阿拉伯;社交媒体数据;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号