首页> 外文会议>China National Conference on Computational Linguistics >Incorporating Sample Filtering into Subject-Based Ensemble Model for Cross-Domain Sentiment Classification
【24h】

Incorporating Sample Filtering into Subject-Based Ensemble Model for Cross-Domain Sentiment Classification

机译:将样本滤波结合到基于主题的集合模型,用于跨域情感分类

获取原文

摘要

Recently, cross-domain sentiment classification is becoming popular owing to its potential applications, such as marketing et al. It seeks to generalize a model, which is trained on a source domain and using it to label samples in the target domain. However, the source and target distributions differ substantially in many cases. To address this issue, we propose a comprehensive model, which takes sample filtering and labeling adaptation into account simultaneously, named joint Sample Filtering with Subject-based Ensemble Model (SF-SE). Firstly, a sentence level Latent Dirichlet Allocation (LDA) model, which incorporates topic and sentiment together (SS-LDA) is introduced. Under this model, a high-quality training dataset is constructed in an unsupervised way. Secondly, inspired by the distribution variance of domain-independent and domain-specific features related to the subject of a sentence, we introduce a Subject-based Ensemble model to efficiently improve the classification performance. Experimental results show that the proposed model is effective for cross-domain sentiment classification.
机译:最近,由于其潜在的应用,跨域情绪分类正在变得流行,例如营销等。它试图概括一个模型,该模型在源域上培训并使用它来标记目标域中的样本。然而,源和目标分布在许多情况下大大不同。为了解决这个问题,我们提出了一个全面的模型,它同时将样本过滤和标记适应,命名为具有基于主题的集合模型(SF-SE)的联合样本滤波。首先,引入了将主题和情绪结合在一起(SS-LDA)的句子级潜在Dirichlet分配(LDA)模型。在此模型下,以无人监督的方式构建了高质量的培训数据集。其次,通过与句子主题相关的域独立和域特定功能的分布方差启发,我们介绍了基于主题的集合模型,以有效地提高分类性能。实验结果表明,该模型对跨域情绪分类有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号