【24h】

Domain Adaptation Using Domain Similarity- and Domain Complexity-Based Instance Selection for Cross-Domain Sentiment Analysis

机译:域适应使用域名相似性和基于域复杂性的实例选择进行跨域情绪分析

获取原文

摘要

We propose an approach to domain adaptation that selects instances from a source domain training set, which are most similar to a target domain. The factor by which the original source domain training set size is reduced is determined automatically by measuring domain similarity between source and target domain as well as their domain complexity variance. Domain similarity is measured as divergence between term unigram distributions. Domain complexity is measured as homogeneity, i.e. self-similarity. We evaluate our approach in a semi-supervised cross-domain document-level polarity classification experiment. Thereby we show, that it yields small but statistically significant improvements over several natural baselines and achieves results competitive to other state-of-the-art domain adaptation schemes.
机译:我们提出了一种域制自适应的方法,可以从源域训练集中选择实例,这与目标域最相似。 通过测量源域和目标域之间的域相似以及它们的域复杂度方差,自动确定原始源域训练集大小的因素。 域名相似度在术语UNIGRAM分布之间测量分歧。 域复杂度被测量为均匀性,即自相似性。 我们在半监督跨域文档级极性分类实验中评估我们的方法。 因此,我们展示了它在几种自然基线上产生的小而统计上显着的改进,并且实现了对其他最先进的域适应方案的结果。

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号