A Multi-lingual Annotated Dataset for Aspect-Oriented Opinion Mining

机译：面向方面的观点挖掘的多语言注释数据集

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We present the Trip-MAML dataset, a Multi-Lingual dataset of hotel reviews that have been manually annotated at the sentence-level with Multi-Aspect sentiment labels. This dataset has been built as an extension of an existent English-only dataset, adding documents written in Italian and Spanish. We detail the dataset construction process, covering the data gathering, selection, and annotation. We present inter-annotator agreement figures and baseline experimental results, comparing the three languages. Trip-MAML is a multi-lingual dataset for aspect-oriented opinion mining that enables researchers (ⅰ) to face the problem on languages other than English and (ⅱ) to the experiment the application of cross-lingual learning methods to the task.

机译：我们介绍了Trip-MAML数据集，这是一个酒店评论的多语言数据集，该数据集已在句子级别使用多方面情感标签进行了手动注释。该数据集已被构建为现有的仅英语数据集的扩展，增加了以意大利语和西班牙语编写的文档。我们详细介绍了数据集的构建过程，涵盖了数据收集，选择和注释。我们提供了注释者之间的协议数字和基准实验结果，比较了这三种语言。 Trip-MAML是面向方面的观点挖掘的多语言数据集，使研究人员（ⅰ）可以使用英语以外的其他语言来面对问题，并且（ⅱ）可以尝试将跨语言学习方法应用于任务。

著录项

来源
《Conference on empirical methods in natural language processing》|2015年|2533-2538|共6页
会议地点
作者
Salud Maria Jimenez Zafra; Giacomo Berardi; Andrea Esuli; Diego Marcheggiani; Maria Teresa Martin-Valdivia; Alejandro Moreo Fernandez;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Multi-lingual opinion mining on YouTube [J] . Aliaksei Severyn, Alessandro Moschitti, Olga Uryupina, Information Processing & Management . 2016,第1期

机译：YouTube上的多语言意见挖掘
2. Employing a data mining approach for identification of mobile opinion leaders and their content usage patterns in large telecommunications datasets [J] . Chen Chih Ping, Weng Ju-Yin, Yang Chin-Sheng, Technological forecasting and social change . 2018,第MAY期

机译：采用数据挖掘方法来识别大型电信数据集中的移动意见领袖及其内容使用模式
3. A NOVEL MODEL FOR TEXT DOCUMENT REPRESENTATION: APPLICATION ON OPINION MINING DATASETS [J] . ASMAA MOUNTASSIR, HOUDA BENBRAHIM, ILHAM BERRADA Journal of computer science engineering and information technology research . 2014,第2期

机译：文本文档表示的新模型：在意见挖掘数据集上的应用
4. A Multi-lingual Annotated Dataset for Aspect-Oriented Opinion Mining [C] . Salud Maria Jimenez Zafra, Giacomo Berardi, Andrea Esuli, Conference on empirical methods in natural language processing . 2015

机译：面向方面意见采矿的多语言注释数据集
5. Mining massive moving object datasets from RFID flow analysis to traffic mining [D] . Gonzalez, Hector 2008

机译：从RFID流量分析到流量挖掘，挖掘海量移动物体数据集
6. Annotated real and synthetic datasets for non-invasive foetal electrocardiography post-processing benchmarking [O] . Giulia Baldazzi, Eleonora Sulas, Monica Urru, 2020

机译：用于非侵入性胎儿心电图后处理基准的注释真实和合成数据集
7. A Multi-lingual Annotated Dataset for Aspect-Oriented Opinion Mining [O] . Salud M. Jiménez-Zafra, Giacomo Berardi, Andrea Esuli, 2015

机译：面向方面意见采矿的多语言注释数据集

A Multi-lingual Annotated Dataset for Aspect-Oriented Opinion Mining

摘要

著录项

相似文献

相关主题

期刊订阅