首页> 外文学位 >Opinion and entity mining on web content.
【24h】

Opinion and entity mining on web content.

机译:对Web内容的意见和实体挖掘。

获取原文
获取原文并翻译 | 示例

摘要

One of the important types of information on the Web is the opinions expressed in the user generated content, e.g., customer reviews of products, forum posts, and blogs. We study the problem of determining the semantic orientations (positive, negative or neutral) of opinions expressed on product features in reviews. We propose a holistic lexicon-based approach to solving the problem by exploiting external evidences and linguistic conventions of natural language expressions. This approach allows the system to handle opinion words that are context dependent, which cause major difficulties for existing algorithms.;In this thesis, the problem that we also discuss is the assignment of entities that have been talked about in each sentence. If the sentence contains the product names, they need to be identified. We call this problem entity discovery. If the product names are not explicitly mentioned in the sentence but are implied due to the use of pronouns and language conventions, we need to infer the products. We call this problem entity assignment . In this thesis, we propose two effective methods to solve the problems. Entity discovery is based on pattern discovery and entity assignment is based on mining of comparative sentences. We also discuss another project on object and attribute coreference resolution. We show that some important features related to opinions can be exploited to perform the task more accurately. Experimental results using blog posts demonstrate the effectiveness of the technique.
机译:网络上重要的信息类型之一是在用户生成的内容中表达的观点,例如,产品,论坛帖子和博客的客户评论。我们研究确定在评论中针对产品功能表达的观点的语义取向(正面,负面或中性)的问题。我们提出一种基于词典的整体方法,通过利用外部证据和自然语言表达的语言惯例来解决问题。这种方法使系统能够处理与上下文相关的意见词,这给现有算法带来了很大的困难。在本文中,我们还讨论的问题是每个句子中已经讨论过的实体的分配。如果句子中包含产品名称,则需要识别它们。我们称这个问题为实体发现。如果产品名称未在句子中明确提及,但由于使用代词和语言约定而被隐含,则我们需要推断产品。我们称这个问题为实体分配。本文提出了两种有效的解决方法。实体发现基于模式发现,而实体分配基于对比较语句的挖掘。我们还将讨论有关对象和属性共指解析的另一个项目。我们表明,可以利用一些与意见相关的重要功能来更准确地执行任务。使用博客文章的实验结果证明了该技术的有效性。

著录项

  • 作者

    Ding, Xiaowen.;

  • 作者单位

    University of Illinois at Chicago.;

  • 授予单位 University of Illinois at Chicago.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2010
  • 页码 124 p.
  • 总页数 124
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 遥感技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号