首页> 外文会议>Annual meeting of the Association for Computational Linguistics >Domain Adapted Word Embeddings for Improved Sentiment Classification
【24h】

Domain Adapted Word Embeddings for Improved Sentiment Classification

机译:领域自适应词嵌入,用于改进情感分类

获取原文

摘要

Generic word embeddings are trained on large-scale generic corpora; Domain Specific (DS) word embeddings are trained only on data from a domain of interest. This paper proposes a method to combine the breadth of generic embeddings with the specificity of domain specific embeddings. The resulting embeddings, called Domain Adapted (DA) word embeddings, are formed by aligning corresponding word vectors using Canonical Correlation Analysis (CCA) or the related nonlinear Kernel CCA. Evaluation results on sentiment classification tasks show that the DA embeddings substantially outperform both generic and DS embeddings when used as input features to standard or state-of-the-art sentence encoding algorithms for classification.
机译:通用词嵌入在大型通用语料库上进行训练;特定于域(DS)的词嵌入仅针对来自感兴趣域的数据进行训练。本文提出了一种将泛型嵌入的广度与特定领域嵌入的特殊性相结合的方法。通过使用规范相关分析(CCA)或相关的非线性内核CCA对齐相应的词向量,可以形成称为域自适应(DA)词嵌入的结果嵌入。对情感分类任务的评估结果表明,当DA嵌入用作标准或最新句子编码算法的输入特征时,其性能大大优于通用嵌入和DS嵌入。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号