首页> 外文会议>ACM international conference on information and knowledge management >On the Design of LDA Models for Aspect-based Opinion Mining
【24h】

On the Design of LDA Models for Aspect-based Opinion Mining

机译:基于宽高的思想矿业LDA模型设计

获取原文

摘要

Aspect-based opinion mining, which aims to extract aspects and their corresponding ratings from customers reviews, provides very useful information for customers to make purchase decisions. In the past few years several probabilistic graphical models have been proposed to address this problem, most of them based on Latent Dirichlet Allocation (LDA). While these models have a lot in common, there are some characteristics that distinguish them from each other. These fundamental differences correspond to major decisions that have been made in the design of the LDA models. While research papers typically claim that a new model outperforms the existing ones, there is normally no "one-size-fits-all" model. In this paper, we present a set of design guidelines for aspect-based opinion mining by discussing a series of increasingly sophisticated LDA models. We argue that these models represent the essence of the major published methods and allow us to distinguish the impact of various design decisions. We conduct extensive experiments on a very large real life dataset from Epinions.com (500K reviews) and compare the performance of different models in terms of the likelihood of the held-out test set and in terms of the accuracy of aspect identification and rating prediction.
机译:基于Aspect的意见挖掘,其目的是提取方面和客户评论其相应的收视率,为客户提供了做出购买决策非常有用的信息。在过去的几年中几个概率图模型已经被提出来解决这个问题,他们大多基于潜在狄利克雷分配(LDA)。虽然这些模型有很多共同点,也有一些特点使其区别于对方。这些根本的差异对应于已经在LDA模型的设计已取得重大决策。虽然研究论文通常声称,一个新的模式优于现有的,通常没有“一个尺寸适合所有人”的模式。在本文中,我们通过讨论一系列日益复杂的LDA模型的提出一套基于方面,意见挖掘设计指南。我们认为,这些模型代表的主要发布方法的实质内容,让我们能够区分各种设计决策的影响。我们进行一个非常大的现实生活中的数据集大量的实验从Epinions.com(500K评论)和保留检验组的可能性方面和纵横鉴定和评级预测的准确度方面比较不同车型的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号