首页> 外文会议>Pacific Asia Conference on Language, Information and Computation >Hybrid Method of Semi-supervised Learning and Feature Weighted Learning for Domain Adaptation of Document Classification
【24h】

Hybrid Method of Semi-supervised Learning and Feature Weighted Learning for Domain Adaptation of Document Classification

机译:基于半监督学习和特征加权学习的混合分类方法

获取原文

摘要

In regard to document classification, semi-supervised learning using the Naive Bayes method and EM algorithm was a great success, and we refer to this method as NBEM in this paper. Although NBEM is also effective for domain adaption of document classification, there is still room for improvement because NBEM does not employ valuable information for this task, that is the difference between source domain and target domain. Here, according to the similarity between the label distribution of the feature on source domain and the estimated label distribution of the feature on target domain, we set the weight on the features to reconstruct the training data. We use this reconstructed training data to perform document classification by NBEM. As a result of experiment by using a part of 20 Newsgroups, the effect of this method was confirmed.
机译:关于文档分类,使用朴素贝叶斯方法和EM算法的半监督学习取得了很大的成功,在本文中我们将该方法称为NBEM。尽管NBEM对于文档分类的域适应也很有效,但仍有改进空间,因为NBEM并未使用有价值的信息来完成此任务,这就是源域与目标域之间的差异。在此,根据源域上特征的标签分布与目标域上特征的估计标签分布之间的相似性,我们对特征进行权重设置以重建训练数据。我们使用此重构的训练数据来执行NBEM的文档分类。通过使用20个新闻组的一部分进行实验的结果,证实了此方法的效果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号