首页> 外文期刊>International journal of machine learning and cybernetics >Cross-domain sentiment aware word embeddings for review sentiment analysis
【24h】

Cross-domain sentiment aware word embeddings for review sentiment analysis

机译:跨域情绪意识到Word Embeddings进行审查情绪分析

获取原文
获取原文并翻译 | 示例
       

摘要

Learning low-dimensional vector representations of words from a large corpus is one of the basic tasks in natural language processing (NLP). The existing universal word embedding model learns word vectors mainly through grammar and semantic information from the context, while ignoring the sentiment information contained in the words. Some approaches, although they model sentiment information in the reviews, do not consider certain words in different domains. In a case where the emotion changes, if the general word vector is directly applied to the review sentiment analysis task, then this will inevitably affect the performance of the sentiment classification. To solve this problem, this paper extends the CBoW (continuous bag-of-words) word vector model and proposes a cross-domain sentiment aware word embedding learning model, which can capture the sentiment information and domain relevance of a word at the same time. This paper conducts several experiments on Amazon user review data in different domains to evaluate the performance of the model. The experimental results show that the proposed model can obtain a nearly 2% accuracy improvement compared with the general word vector when modeling only the sentiment information of the context. At the same time, when the domain information and the sentiment information are both included, the accuracy and Macro-F1 value of the sentiment classification tasks are significantly improved compared with existing sentiment word embeddings.
机译:学习大型语料库中的单词的低维矢量表示是自然语言处理中的基本任务之一(NLP)。现有的通用Word嵌入模型主要通过来自上下文的语法和语义信息来学习字向量,同时忽略单词中包含的情绪信息。一些方法,虽然它们在评论中模拟了情绪信息,但不要在不同域中考虑某些单词。在情绪变化的情况下,如果一般单词矢量直接应用于审查情绪分析任务,那么这将不可避免地影响情绪分类的性能。为了解决这个问题,本文延伸了Cow(连续袋)字矢量模型,并提出了一个跨域情绪意识的词嵌入学习模型,它可以同时捕获一个单词的情绪信息和域相关性。本文对亚马逊用户审查数据进行了多个实验,在不同的域中进行了评估模型的性能。实验结果表明,当仅在建模上下文的情感信息时,所提出的模型可以获得近2%的准确性改进。同时,当域信息和情绪信息都包括包括时,与现有情感词嵌入品相比,情感分类任务的精度和宏-F1值显着提高。

著录项

  • 来源
  • 作者单位

    Chongqing Univ Posts & Telecommun Sch Software Engn Chongqing Peoples R China;

    Chongqing Univ Posts & Telecommun Sch Software Engn Chongqing Peoples R China;

    Chongqing Univ Posts & Telecommun Sch Software Engn Chongqing Peoples R China|Chongqing Univ Posts & Telecommun Chongqing Key Lab Cyberspace & Informat Secur Chongqing 400065 Peoples R China|Chongqing Univ Informat & Commun Engn Postdoctoral Res Stn Chongqing Peoples R China;

    Fujian Normal Univ Coll Math & Informat Fuzhou 350117 Fujian Peoples R China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Word embeddings; Sentiment analysis; Deep learning; Domain adaptation;

    机译:Word Embeddings;情绪分析;深入学习;域适应;
  • 入库时间 2022-08-18 21:31:50

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号