首页> 外文会议>RIVF International Conference on Computing and Communication Technologies >Leveraging Foreign Language Labeled Data for Aspect-Based Opinion Mining
【24h】

Leveraging Foreign Language Labeled Data for Aspect-Based Opinion Mining

机译:利用外语标签数据进行基于方面的意见挖掘

获取原文

摘要

Aspect-based opinion mining is the task of identifying sentiment at the aspect level in opinionated text, which consists of two subtasks: aspect category extraction and sentiment polarity classification. While aspect category extraction aims to detect and categorize opinion targets such as product features, sentiment polarity classification assigns a sentiment label, i.e. positive, negative, or neutral, to each identified aspect. Supervised learning methods have been shown to deliver better accuracy for this task but they require labeled data, which is costly to obtain, especially for resource-poor languages like Vietnamese. To address this problem, we present a supervised aspect-based opinion mining method that utilizes labeled data from a foreign language (English in this case), which is translated to Vietnamese by an automated translation tool (Google Translate). Because aspects and opinions in different languages may be expressed by different words, we propose using word embeddings, in addition to other features, to reduce the vocabulary difference between the original and translated texts, thus improving the effectiveness of aspect category extraction and sentiment polarity classification processes. We also introduce an annotated corpus of aspect categories and sentiment polarities extracted from restaurant reviews in Vietnamese, and conduct a series of experiments on the corpus. Experimental results demonstrate the effectiveness of the proposed approach.
机译:基于方面的观点挖掘是在有观点的文本中在方面级别上识别情感的任务,该任务包括两个子任务:方面类别提取和情感极性分类。方面类别提取旨在检测并分类意见目标(例如产品功能),而情感极性分类则将情感标签(即正面,负面或中立)分配给每个已识别的方面。有监督的学习方法已经显示出可以更好地完成此任务的准确性,但是它们需要标记的数据,这是昂贵的,尤其是对于越南这样的资源匮乏的语言而言。为了解决这个问题,我们提出了一种基于方面的监督式观点挖掘方法,该方法利用来自外语(在这种情况下为英语)的带标签数据,通过自动翻译工具(Google翻译)将其翻译成越南语。由于不同语言中的方面和观点可能由不同的单词表达,因此我们建议使用单词嵌入功能以及其他功能来减少原始文本和翻译文本之间的词汇差异,从而提高方面类别提取和情感极性分类的有效性流程。我们还介绍了从越南餐馆评论中提取的带有方面类别和情感极性的带注释的语料库,并对该语料库进行了一系列实验。实验结果证明了该方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号