首页> 外文期刊>The Journal of Artificial Intelligence Research >Rademacher Complexity Bounds for a Penalized Multi-class Semi-supervised Algorithm
【24h】

Rademacher Complexity Bounds for a Penalized Multi-class Semi-supervised Algorithm

机译:Rademacher复杂性归属于惩罚的多级半监督算法

获取原文
           

摘要

We propose Rademacher complexity bounds for multi-class classifiers trained with a two-step semi-supervised model. In the first step, the algorithm partitions the partially labeled data and then identifies dense clusters containing k predominant classes using the labeled training examples such that the proportion of their non-predominant classes is below a fixed threshold stands for clustering consistency. In the second step, a classifier is trained by minimizing a margin empirical loss over the labeled training set and a penalization term measuring the disability of the learner to predict the k predominant classes of the identified clusters. The resulting data-dependent generalization error bound involves the margin distribution of the classifier, the stability of the clustering technique used in the first step and Rademacher complexity terms corresponding to partially labeled training data. Our theoretical result exhibit convergence rates extending those proposed in the literature for the binary case, and experimental results on different multi-class classification problems show empirical evidence that supports the theory.
机译:我们提出了具有两步半监督模型的多级分类器的Radimacher复杂性界限。在第一步中,算法将部分标记的数据分区,然后使用标记的训练示例识别包含k主要类别的密集簇,使得其非主要类的比例低于固定阈值,用于聚类一致性。在第二步中,通过最小化标记的训练集和测量学习者的残疾来预测所识别的集群的k个主要类别,通过最小化分类器来训练分类器。由此产生的数据相关的泛化误差绑定涉及分类器的裕度分布,其在第一步和Rademacher复杂性术语中使用的聚类技术的稳定性对应于部分标记的训练数据。我们的理论结果表明,延长了文献中提出的与二进制案件中提出的会聚率,以及不同多级分类问题的实验结果表明了支持该理论的经验证据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号