首页> 外文会议>International conference on computational linguistics >Sentiment Classification with Graph Co-Regularization
【24h】

Sentiment Classification with Graph Co-Regularization

机译:图共正则化的情感分类

获取原文

摘要

Sentiment classification aims to automatically predict sentiment polarity (e.g., positive or negative) of user-generated sentiment data (e.g., reviews, blogs). To obtain sentiment classification with high accuracy, supervised techniques require a large amount of manually labeled data. The labeling work can be time-consuming and expensive, which makes unsupervised (or semi-supervised) sentiment analysis essential for this application. In this paper, we propose a novel algorithm, called graph co-regularized non-negative matrix tri-factorization (GNMTF), from the geometric perspective. GNMTF assumes that if two words (or documents) are sufficiently close to each other, they tend to share the same sentiment polarity. To achieve this, we encode the geometric information by constructing the nearest neighbor graphs, in conjunction with a non-negative matrix tri-factorization framework. We derive an efficient algorithm for learning the factorization, analyze its complexity, and provide proof of convergence. Our empirical study on two open data sets validates that GNMTF can consistently improve the sentiment classification accuracy in comparison to the state-of-the-art methods.
机译:情感分类旨在自动预测用户生成的情感数据(例如评论,博客)的情感极性(例如,正面或负面)。为了获得高精度的情感分类,监督技术需要大量的手动标记数据。标记工作可能既耗时又昂贵,这使得无监督(或半监督)的情绪分析对于此应用程序至关重要。在本文中,我们从几何学角度提出了一种新颖的算法,称为图形共正则化非负矩阵三因子分解(GNMTF)。 GNMTF假设,如果两个单词(或文档)彼此足够接近,则它们倾向于共享相同的情感极性。为了实现这一点,我们结合非负矩阵三因子分解框架,通过构造最近的邻居图来对几何信息进行编码。我们推导了一种有效的算法,用于学习因式分解,分析其复杂性并提供收敛性证明。我们对两个开放数据集的经验研究证实,与最新方法相比,GNMTF可以持续提高情感分类的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号