首页> 外文会议>9th International conference on language resources and evaluation >The USAGE review corpus for fine-grained, multi-lingual opinion analysis
【24h】

The USAGE review corpus for fine-grained, multi-lingual opinion analysis

机译:用于细粒度,多语言意见分析的使用审查语料库

获取原文

摘要

Opinion mining has received wide attention in recent years. Models for this task are typically trained or evaluated with a manually annotated dataset. However, fine-grained annotation of sentiments including information about aspects and their evaluation is very labour-intensive. The data available so far is limited. Contributing to this situation, this paper describes the Bielefeld University Sentiment Analysis Corpus for German and English (USAGE), which we offer freely to the community and which contains the annotation of product reviews from Amazon with both aspects and subjective phrases. It provides information on segments in the text which denote an aspect or a subjective evaluative phrase which refers to the aspect. Relations and coreferences are explicitly annotated. This dataset contains 622 English and 611 German reviews, allowing to investigate how to port sentiment analysis systems across languages and domains. We describe the methodology how the corpus was created and provide statistics including inter-annotator agreement. We further provide figures for a baseline system and results for German and English as well as in a cross-domain setting. The results are encouraging in that they show that aspects and phrases can be extracted robustly without the need of tuning to a particular type of products.
机译:意见挖掘已经得到了广泛的重视,近年来。此任务类型的典型培训或评估与手动注释数据集。然而,情绪包括有关的方面和他们的评价信息的细粒度的注释是劳动密集型的。提供的数据,到目前为止是有限的。造成这种情况,本文介绍了德语和英语(使用),这是我们自由的社会,其中包含来自亚马逊的产品评论与这两个方面和主观短语注释提供比勒费尔德大学情感分析语料库。它提供了在其中表示一个方面或一种主观评价短语是指一个方面的文本段的信息。关系和coreferences明确注释。此数据集包含622英语和德语611个评测,让研究如何跨越语言和域名端口情感分析系统。我们描述胼是如何创建的方法,并提供统计数据,包括-注释间协议。我们还提供了一种基线系统和数字结果德语和英语,以及在跨域设置。结果是令人鼓舞的,因为它们表明方面和短语可以稳健地抽出,而没有特定类型的产品需要调整的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号