首页> 外文会议>2010 WASE International Conference on Information Engineering >A Semi-supervised Text Classification Method Based on Incremental EM Algorithm
【24h】

A Semi-supervised Text Classification Method Based on Incremental EM Algorithm

机译:基于增量EM算法的半监督文本分类方法

获取原文

摘要

In the standard EM-based semi-supervised text classification, the classification performance is not well when the initial labeled samples are a few. How to improve the performance is an important issue. In view of this, a semi-supervised method based on incremental EM algorithm is proposed. This method makes full use of the useful information of intermediate classifier. On the one hand, this method verifies the feasibility of division existed in unlabeled samples, and uses the division mechanism to enhance the reliability of new incremental samples by dividing the unlabeled samples scientifically; on the other hand, a feedback learning mechanism is proposed, and it is used to decrease the probability of adding misclassified samples. Experimental results show that the classification performance is improved in our method.
机译:在标准的基于EM的半监督文本分类中,当初始标记的样本很少时,分类性能不佳。如何提高性能是一个重要的问题。鉴于此,提出了一种基于增量EM算法的半监督方法。该方法充分利用了中间分类器的有用信息。一方面,该方法验证了未标记样品存在划分的可行性,并通过科学地对未标记样品进行划分,利用划分机制提高了新增量样本的可靠性。另一方面,提出了一种反馈学习机制,用于减少添加分类错误的样本的可能性。实验结果表明,该方法提高了分类性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号