首页> 外文会议>International conference on computational linguistics >Leveraging Multiple Domains for Sentiment Classification
【24h】

Leveraging Multiple Domains for Sentiment Classification

机译:利用多个域进行情感分类

获取原文

摘要

Sentiment classification becomes more and more important with the rapid growth of user-generated content. However, sentiment classification task usually comes with two challenges: first, sentiment classification is highly domain-dependent and training sentiment classifier for every domain is inefficient and often impractical; second, since the quantity of labeled data is important for assessing the quality of classifier, it is hard to evaluate classifiers when labeled data is limited for certain domains. To address the challenges mentioned above, we focus on learning high-level features that are able to generalize across domains, so a global classifier can benefit with a simple combination of documents from multiple domains. In this paper, the proposed model incorporates both labeled and unlabeled data from multiple domains and learns new feature representations. Our model doesn't require labels from every domain, which means the learned feature representation can be generalized for sentiment domain adaptation. In addition, the learned feature representation can be used as classifier since our model defines the meaning of feature value and arranges high-level features in a prefixed order, so it is not necessary to train another classifier on top of the new features. Empirical evaluations demonstrate our model outperforms baselines and yields competitive results to other state-of-the-art works on the benchmark dataset.
机译:随着用户生成内容的快速增长,情感分类变得越来越重要。然而,情感分类任务通常面临两个挑战:首先,情感分类高度依赖于领域;针对每个领域训练情感分类器效率低下,而且通常不切实际。第二,由于标记数据的数量对于评估分类器的质量很重要,因此当标记数据在某些领域受到限制时,很难评估分类器。为了解决上述挑战,我们专注于学习能够跨域通用化的高级功能,因此全局分类器可以受益于来自多个域的文档的简单组合。在本文中,提出的模型结合了来自多个域的标记和未标记数据,并学习了新的特征表示。我们的模型不需要每个域的标签,这意味着可以将学习到的特征表示形式进行泛化以适应情感域。此外,由于我们的模型定义了特征值的含义并按前缀顺序排列了高级特征,因此学习到的特征表示可以用作分类器,因此无需在新特征之上训练另一个分类器。经验评估表明,我们的模型优于基准,并且与基准数据集上的其他最新技术相比,具有竞争优势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号