首页> 外文会议>ACM international conference on information and knowledge management >On the Design of LDA Models for Aspect-based Opinion Mining
【24h】

On the Design of LDA Models for Aspect-based Opinion Mining

机译:基于方面的观点挖掘的LDA模型设计

获取原文

摘要

Aspect-based opinion mining, which aims to extract aspects and their corresponding ratings from customers reviews, provides very useful information for customers to make purchase decisions. In the past few years several probabilistic graphical models have been proposed to address this problem, most of them based on Latent Dirichlet Allocation (LDA). While these models have a lot in common, there are some characteristics that distinguish them from each other. These fundamental differences correspond to major decisions that have been made in the design of the LDA models. While research papers typically claim that a new model outperforms the existing ones, there is normally no "one-size-fits-all" model. In this paper, we present a set of design guidelines for aspect-based opinion mining by discussing a series of increasingly sophisticated LDA models. We argue that these models represent the essence of the major published methods and allow us to distinguish the impact of various design decisions. We conduct extensive experiments on a very large real life dataset from Epinions.com (500K reviews) and compare the performance of different models in terms of the likelihood of the held-out test set and in terms of the accuracy of aspect identification and rating prediction.
机译:基于方面的观点挖掘旨在从客户评论中提取方面及其相应的评分,为客户做出购买决策提供非常有用的信息。在过去的几年中,已经提出了几种概率图形模型来解决该问题,其中大多数是基于潜在狄利克雷分配(LDA)的。虽然这些模型有很多共同点,但是有一些特征使它们彼此区别。这些基本差异对应于LDA模型设计中已做出的主要决策。尽管研究论文通常声称新模型要优于现有模型,但通常没有“一刀切”的模型。在本文中,我们通过讨论一系列日益复杂的LDA模型,提出了一套基于方面的观点挖掘的设计准则。我们认为这些模型代表了主要公开方法的本质,并允许我们区分各种设计决策的影响。我们对Epinions.com上的一个非常大的现实数据集进行了广泛的实验(500K条评论),并根据保留的测试集的可能性以及方面识别和评级预测的准确性来比较不同模型的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号