首页> 外文会议>9th International conference on language resources and evaluation >The USAGE review corpus for fine-grained, multi-lingual opinion analysis
【24h】

The USAGE review corpus for fine-grained, multi-lingual opinion analysis

机译:USAGE审查语料库,可进行细粒度的多语言意见分析

获取原文

摘要

Opinion mining has received wide attention in recent years. Models for this task are typically trained or evaluated with a manually annotated dataset. However, fine-grained annotation of sentiments including information about aspects and their evaluation is very labour-intensive. The data available so far is limited. Contributing to this situation, this paper describes the Bielefeld University Sentiment Analysis Corpus for German and English (USAGE), which we offer freely to the community and which contains the annotation of product reviews from Amazon with both aspects and subjective phrases. It provides information on segments in the text which denote an aspect or a subjective evaluative phrase which refers to the aspect. Relations and coreferences are explicitly annotated. This dataset contains 622 English and 611 German reviews, allowing to investigate how to port sentiment analysis systems across languages and domains. We describe the methodology how the corpus was created and provide statistics including inter-annotator agreement. We further provide figures for a baseline system and results for German and English as well as in a cross-domain setting. The results are encouraging in that they show that aspects and phrases can be extracted robustly without the need of tuning to a particular type of products.
机译:近年来,观点挖掘已受到广泛关注。通常使用人工注释的数据集来训练或评估用于此任务的模型。但是,对情感进行细粒度的注释(包括有关方面及其评估的信息)非常费力。到目前为止,可用数据有限。针对这种情况,本文介绍了比勒费尔德大学德语和英语情感分析语料库(USAGE),我们向社区免费提供该语料库,其中包含来自亚马逊的产品评论注释,包括方面和主观短语。它提供了有关文本中表示方面的信息或代表该方面的主观评估短语的信息。关系和共指被明确注释。该数据集包含622篇英语评论和611篇德国评论,可以研究如何跨语言和跨域移植情感分析系统。我们描述了语料库是如何创建的,并提供了包括注释者之间的协议在内的统计信息。我们还将提供基准系统的数字以及德语和英语以及跨域设置的结果。结果令人鼓舞,因为它们表明可以稳定地提取方面和短语,而无需调整为特定类型的产品。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号