首页> 外文会议>International Workshop on Computational Processing of the Portuguese Language >Characterizing Opinion Mining: A Systematic Mapping Study of the Portuguese Language
【24h】

Characterizing Opinion Mining: A Systematic Mapping Study of the Portuguese Language

机译:表现意见采矿:葡萄牙语的系统映射研究

获取原文

摘要

The growth of social media and user-generated content (UGC) on the Internet provides a huge quantity of information that allows discovering the experiences, opinions, and feelings of users or customers. Opinion Mining (OM) is a sub-field of text mining in which the main task is to extract opinions from UGC. Given that Portuguese is one of the most common spoken languages in the world, and it is also the second most frequent on Twitter, the goal of this work is to plot the landscape of current studies that relates the application of OM for Portuguese. A systematic mapping review (SMR) method was applied to search, select and to extract data from the included studies. Manual and automated searches retrieved 6075 studies up to year 2014, from which 25 articles were included. Almost 70 % of all approaches focus on the Brazilian Portuguese variant. Naive Bayes and Support Vector Machine were the main classifiers and SentiLex-PT was the most used lexical resource. Portugal and Brazil are the main contributors in processing the Portuguese language.
机译:Internet上的社交媒体和用户生成的内容(UGC)的增长提供了大量信息,允许发现用户或客户的经验,意见和感受。意见挖掘(OM)是一个宣传挖掘的子领域,其中主要任务是从UGC中提取意见。鉴于葡萄牙语是世界上最常见的口语语言之一,而且在推特上也是最常见的,这项工作的目标是绘制当前研究的景观,这些研究涉及葡萄牙语的应用。系统映射评估(SMR)方法被应用于搜索,选择和从附带的研究中提取数据。手动和自动化搜索检索到2014年的6075项研究,其中包括25篇文章。所有方法的近70%侧重于巴西葡萄牙变体。天真的贝叶斯和支持向量机是主要的分类器和Sentilex-PT是最使用的词汇资源。葡萄牙和巴西是加工葡萄牙语的主要贡献者。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号