...
首页> 外文期刊>Knowledge-Based Systems >Enriching semantic knowledge bases for opinion mining in big data applications
【24h】

Enriching semantic knowledge bases for opinion mining in big data applications

机译:丰富语义知识库,用于大数据应用中的意见挖掘

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

This paper presents a novel method for contextualizing and enriching large semantic knowledge bases for opinion mining with a focus on Web intelligence platforms and other high-throughput big data applications. The method is not only applicable to traditional sentiment lexicons, but also to more comprehensive, multi-dimensional affective resources such as SenticNet. It comprises the following steps: (ⅰ) identify ambiguous sentiment terms, (ⅱ) provide context information extracted from a domain-specific training corpus, and (ⅲ) ground this contextual information to structured background knowledge sources such as ConceptNet and WordNet. A quantitative evaluation shows a significant improvement when using an enriched version of SenticNet for polarity classification. Crowdsourced gold standard data in conjunction with a qualitative evaluation sheds light on the strengths and weaknesses of the concept grounding, and on the quality of the enrichment process.
机译:本文提出了一种新颖的方法,用于集中化和丰富大型语义知识库以进行意见挖掘,重点是Web智能平台和其他高吞吐量大数据应用程序。该方法不仅适用于传统情感词典,还适用于更全面的多维情感资源,例如SenticNet。它包括以下步骤:(ⅰ)识别不明确的情感术语,(ⅱ)提供从特定领域的训练语料库中提取的上下文信息,并且(ⅲ)将此上下文信息基于诸如ConceptNet和WordNet之类的结构化背景知识源。当使用丰富版本的SenticNet进行极性分类时,定量评估显示出显着改进。众包的金标准数据与定性评估相结合,阐明了概念基础的优点和缺点以及浓缩过程的质量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号