首页> 外文会议>International Symposium on Dependable Computing and Internet of Things >Aspect Extraction in Product Reviews via an Improved Unsupervised Method
【24h】

Aspect Extraction in Product Reviews via an Improved Unsupervised Method

机译:通过改进的无监督方法,在产品评论中提取产品评论

获取原文

摘要

This paper studied on aspects extraction from product reviews by unsupervised topic model, which is an important subtask of opinion mining. The topic distribution of topic model, such as LDA, leans to the high-frequency words since the words in the document comply with the characteristics of power law distribution, which leads to that most of the words that can represent topics are overwhelmed by a small number of high-frequency words, and consequently, the topic expressive ability is reduced. To solve these problems, an unsupervised method is proposed by us in this paper, and on the basis of Sentence-LDA topic model, a new Unsupervised Weighted LDA model (UW-LDA) based on the weighted topic model is obtained through weighting on feature words by a improved Gaussian function. Finally, the experiments on two aspects of the review corpus of several products from different domains show that our proposed model has made a satisfactory result.
机译:本文研究了由无人监督的主题模型提取产品评论的方面,这是意见矿业的重要子任务。主题模型的主题分布,如LDA,因为文档中的单词符合电力法分布的特征,因此符合电力法分布的特征,这导致了可以代表主题的大多数单词被一个小型淹没高频词数,因此,主题表达能力减少。为了解决这些问题,我们在本文中提出了一种无人监督的方法,并在句子-LDA主题模型的基础上,通过对特征进行加权来获得基于加权主题模型的新的无监督加权LDA模型(UW-LDA)通过改进的高斯函数来单词。最后,关于来自不同域的几种产品的审查语料库的两个方面的实验表明,我们的拟议模型取得了令人满意的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号