【24h】

Co-training from an Incremental EM Perspective

机译:从增量EM角度进行联合培训

获取原文
获取原文并翻译 | 示例

摘要

We study classification when the majority of data is unlabeled, and only a small fraction is labeled: the so-called semi-supervised learning situation. Blum and Mitchell's co-training is a popular semi-supervised algorithm to use when we have multiple independent views of the entities to classify. An example of a multi-view situation is classifying web pages: one view may describe the pages by the words that occur on mem, another view describes the pages by the words in the hyperlinks that point to them. In co-training two learners each form a model from the labeled data and then incrementally label small subsets of the unlabeled data for each other. The learners then re-estimate their model from the labeled data and the psuedo-labels provided by the learners. Though some analysis of the algorithm's performance exists the computation performed is still not well understood. We propose that each view in co-training is effectively performing incremental EM as postulated by Neal and Hinton, combined with a Bayesian classifier. This analysis suggests improvements over the core co-training algorithm. We introduce variations, which result in faster convergence to the maximum possible accuracy of classification than the core co-training algorithm, and therefore increase the learning efficiency. We empirically verify our claim for a number of data sets in the context of belief network learning.
机译:当大多数数据没有标签,而只有一小部分被标记时,我们研究分类:所谓的半监督学习情况。 Blum和Mitchell的协同训练是一种流行的半监督算法,当我们对实体有多个独立的视图进行分类时可以使用。多视图情况的一个示例是对网页进行分类:一个视图可以通过出现在mem上的单词来描述页面,另一视图可以通过指向它们的超链接中的单词来描述页面。在共同训练中,两个学习者各自从标记的数据中形成一个模型,然后彼此递增地标记未标记数据的一小部分。然后,学习者根据学习者提供的标签数据和伪标签重新估计其模型。尽管已经对该算法的性能进行了一些分析,但是所执行的计算仍然不是很了解。我们建议,在联合训练中,每种观点都有效地执行了由Neal和Hinton提出的结合贝叶斯分类器的新兴EM。该分析表明对核心协同训练算法进行了改进。我们引入了变异,与核心协同训练算法相比,变异可以更快地收敛到最大的分类精度,从而提高学习效率。我们在信念网络学习的背景下凭经验验证了我们对许多数据集的主张。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号