...
首页> 外文期刊>International journal of machine learning and cybernetics >A multiclass boosting algorithm to labeled and unlabeled data
【24h】

A multiclass boosting algorithm to labeled and unlabeled data

机译:用于标记和未标记数据的多类提升算法

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

In this article we focus on the semi-supervised learning. Semi-supervised learning typically is a learning task from both labeled and unlabeled data. We especially consider the multiclass semi-supervised classification problem. To solve the multiclass semi-supervised classification problem we propose a new multiclass loss function using new codewords. In the proposed loss function, we combine the classifier predictions, based on the labeled data, and the pairwise similarity between labeled and unlabeled examples. The main goal of the proposed loss function is to minimize the inconsistency between classifier predictions and the pairwise similarity. The proposed loss function consists of two terms. The first term is the multiclass margin cost of the labeled data and the second term is a regularization term on unlabeled data. The regularization term is used to minimize the cost of pseudo-margin on unlabeled data. We then derive a new multiclass boosting algorithm from the proposed risk function, called GMSB. The derived algorithm also uses a set optimal similarity functions for a given dataset. The results of our experiments on a number of UCI and real-world biological, text, and image datasets show that GMSB outperforms the state-of-the-art boosting methods to multiclass semi-supervised learning.
机译:在本文中,我们重点介绍半监督学习。半监督学习通常是来自标记和未标记数据的学习任务。我们特别考虑多类半监督分类问题。为了解决多类半监督分类问题,我们提出了使用新码字的新多类损失函数。在提出的损失函数中,我们基于标记的数据以及标记和未标记示例之间的成对相似性,组合了分类器预测。提出的损失函数的主要目的是使分类器预测与成对相似性之间的不一致性最小化。拟议的损失函数包括两个项。第一项是标记数据的多类保证金成本,第二项是未标记数据的正则化项。正则化术语用于最小化未标记数据上的伪保证金成本。然后,我们从提出的风险函数(称为GMSB)派生出一种新的多类提升算法。派生算法还为给定数据集使用了一组最佳相似性函数。我们在许多UCI和真实世界的生物学,文本和图像数据集上进行的实验结果表明,对于多类半监督学习而言,GMSB的性能优于最先进的增强方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号