Learning a Classifier when the Labeling Is Known

机译：知道标签时学习分类器

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

We introduce a new model of learning, Known-Labeling-Classifier-Learning (KLCL). The goal of such learning is to find a low-error classifier from some given target-class of predictors, when the correct labeling is known to the learner. This learning problem can be viewed as measuring the information conveyed by the identity of input examples, rather than by their labels. Given some class of predictors H., a labeling function, and an i.i.d. unlabeled sample generated by some unknown data distribution, the goal of our learner is to find a classifier in H that has as low as possible error with respect to the sample-generating distribution and the given labeling function. When the labeling function does not belong to the target class, the error of members of the class (and thus their relative quality as label predictors) varies with the marginal of the underlying data distribution. We prove a trichotomy with respect to the KLCL sample complexity. Namely, we show that for any learnable concept class H, its KLCL sample complexity is either 0 or Θ(l/ε) or Ω(1/ε~2). Furthermore, we give a simple combinatorial property of concept classes that characterizes this trichotomy. Our results imply new sample-size lower bounds for the common agnostic PAC model - a lower bound of Ω(l/ε~2) on the sample complexity of learning deterministic classifiers, as well as novel results about the utility of unlabeled examples in a semi-supervised learning setup.

机译：我们介绍了一种新的学习模型，即已知标签分类器学习（KLCL）。这种学习的目的是在学习者知道正确的标签时，从某些给定的预测变量目标类中找到低错误的分类器。可以将这种学习问题视为衡量输入示例的身份而不是其标签所传达的信息。给定一类预测变量H.，标记函数和i.d.由于某些未知数据分布生成的未标记样本，我们的学习者的目标是找到H的分类器，该分类器相对于样本生成分布和给定的标记函数具有尽可能低的误差。当标签功能不属于目标类别时，类别成员的错误（以及它们作为标签预测变量的相对质量）会随着基础数据分布的边际而变化。我们证明了关于KLCL样本复杂性的三分法。即，我们表明，对于任何可学习的概念类H，其KLCL样本复杂度为0或Θ（l /ε）或Ω（1 /ε〜2）。此外，我们给出了概念类的简单组合属性，它描述了这种三分法。我们的结果暗示了通用不可知PAC模型的新样本大小下限-学习确定性分类器的样本复杂度的Ω（l /ε〜2）下界，以及关于未标记示例在a中的实用性的新结果半监督学习设置。

著录项

来源
《Algorithmic learning theory》|2011年|p.440-451|共12页
会议地点 Espoo(FI);Espoo(FI)
作者
Shalev Ben-David; Shai Ben-David;
展开▼
作者单位

Faculty of Mathematics, University of Waterloo, Waterloo, ON N2L 3G1, Canada;

David R. Cheriton School of Computer Science, University of Waterloo, Waterloo, ON N2L 3G1, Canada;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类人工智能理论;
关键词

相似文献

外文文献
中文文献
专利

1. Boosting label weighted extreme learning machine for classifying multi-label imbalanced data [J] . Cheng Ke, Gao Shang, Dong Wenlu, Neurocomputing . 2020,第Auga25期

机译：促进标签加权极限学习机，用于分类多标签不平衡数据
2. Regularizing extreme learning machine by dual locally linear embedding manifold learning for training multi-label neural network classifiers [J] . Mohammad Rezaei-Ravari, Mahdi Eftekhari, Farid Saberi-Movahed Engineering Applications of Artificial Intelligence . 2021,第Jana期

机译：通过双局线性嵌入歧管学习来训练多标题神经网络分类器来规范极限学习机
3. Classifier chains for positive unlabelled multi-label learning [J] . Teisseyre Pawel Knowledge-Based Systems . 2021,第Feba15期

机译：积极未标签的多标签学习的分类器链
4. Joint Learning of Binary Classifiers and Pairwise Label Correlations for Multi-label Image Classification [C] . Junbin Xiao, Sheng Tang Conference on Multimedia Information Processing and Retrieval . 2020

机译：联合学习二分类器和成对标签相关性进行多标签图像分类
5. Improved generative modeling approaches for semi-supervised and domain adaptive classifier learning from labels and constraints. [D] . Raghuram, Jayaram. 2014

机译：用于标签和约束的半监督和领域自适应分类器学习的改进的生成建模方法。
6. Learning Likelihoods for Labeling (L3): A General Multi-Classifier Segmentation Algorithm [O] . Neil I. Weisenfeld, Simon K. Warfield -1

机译：贴标的学习似然性（L3）：一般多分类器分割算法
7. Label-Driven Learning Framework: Towards More Accurate Bayesian Network Classifiers through Discrimination of High-Confidence Labels [O] . Yi Sun, Limin Wang, Minghui Sun 2017

机译：标签驱动的学习框架：通过区分高可信度标签来实现更准确的贝叶斯网络分类器

Learning a Classifier when the Labeling Is Known

摘要

著录项

相似文献

相关主题

期刊订阅