首页> 外文期刊>Soft computing: A fusion of foundations, methodologies and applications >SSAM: toward Supervised Sentiment and Aspect Modeling on different levels of labeling
【24h】

SSAM: toward Supervised Sentiment and Aspect Modeling on different levels of labeling

机译:SSAM:对不同级别的标签的监督情绪和方面建模

获取原文
获取原文并翻译 | 示例
           

摘要

In recent years, people want to express their opinion on every online service or product, and there are now a huge number of opinions on the social media, online stores and blogs. However, most of the opinions are presented in plain text and thus require a powerful method to analyze this volume of unlabeled reviews to obtain information about relevant details in minimum time and with a high accuracy. In this paper, we propose a supervised model to analyze large unlabeled opinion data sets. This model has two phases: preprocessing and a Supervised Sentiment and Aspect Model (SSAM) which is an extended version of Latent Dirichlet Allocation Model. In the preprocessing phase, we input thousands of unlabeled opinions and received a set of (key, value) pairs in which a key holds a word or an opinion and a value holds supervised information such as a sentiment label of this word or opinion. After that we give these pairs to the proposed SSAM algorithm, which incorporates different levels of supervised information such as (document and sentence) levels or (document and term) levels of supervised information, to extract and cluster aspects related to a sentiment label and also classify opinions based on their sentiments. We applied SSAM to reviews of electronic devices and books from Amazon. The experiments show that the aspects found by SSAM capture more important aspects that are closely coupled with a sentiment label, and also in sentiment classification SSAM outperforms other topic models and comes close to supervised methods.
机译:近年来,人们希望对每个在线服务或产品表达他们的意见,现在对社交媒体,网上商店和博客有很多意见。但是,大多数意见都以纯文本呈现,因此需要强大的方法来分析这一体积的未标记的审查,以便在最短时间和高精度中获取有关相关细节的信息。在本文中,我们提出了一个监督模型来分析大型未标记的意见数据集。该模型具有两个阶段:预处理和监督情绪和方面模型(SSAM),它是潜在的Dirichlet分配模型的扩展版本。在预处理阶段,我们输入了数千个未标记的意见,并收到了一组(键,值)对,其中一个密钥保持一个单词或意见,并且值持有受监管信息,例如这个词或意见的情绪标签。之后我们将这些对提供给所提出的SSAM算法,该算法包含不同级别的监督信息,例如(文件和句子)级别或(文件和术语)水平的监督信息,以提取与情绪标签相关的群集方面根据他们的情绪分类意见。我们应用SSAM从亚马逊的电子设备和书籍的评论。实验表明,SSAM发现的方面捕获了与情绪标签紧密耦合的更重要的方面,并且在情绪分类SSAM中优于其他主题模型,并靠近监督方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号