【24h】

Efficient Algorithms for General Active Learning

机译:通用主动学习的高效算法

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Selective sampling, a realistic active learning model, has received recent attention in the learning theory literature. While the analysis of selective sampling is still in its infancy, we focus here on one of the (seemingly) simplest problems that remain open. Given a pool of unlabeled examples, drawn i.i.d. from an arbitrary input distribution known to the learner, and oracle access to their labels, the objective is to achieve a target error-rate with minimum label-complexity, via an efficient algorithm. No prior distribution is assumed over the concept class, however the problem remains open even under the realizability assumption: there exists a target hypothesis in the concept class that perfectly classifies all examples, and the labeling oracle is noiseless. As a precise variant of the problem, we consider the case of learning homogeneous half-spaces in the realizable setting: unlabeled examples, x_t, are drawn i.i.d. from a known distribution D over the surface of the unit ball in R~d and labels y_t are either - 1 or + 1. The target function is a half-space u • x ≥ 0 represented by a unit vector u ∈ R~d such that y_t(u • x_t) > 0 for all t. We denote a hypothesis v''s prediction as v(x) = SGN(v • x).
机译:选择性采样是一种现实的主动学习模型,最近在学习理论文献中受到关注。尽管对选择性抽样的分析仍处于起步阶段,但我们在这里关注的是(看似)最简单的问题之一。给定一组未标记的示例,请绘制i.i.d.从学习者已知的任意输入分布以及oracle对标签的访问中,目标是通过有效的算法以最小的标签复杂度实现目标错误率。没有假定在概念类上进行先验分布,但是即使在可实现性假设下,问题仍然存在:概念类中存在一个目标假设,可以完美地对所有示例进行分类,并且标记预言是无噪音的。作为问题的精确变体,我们考虑在可实现的环境中学习齐次半空间的情况:未标注示例x_t,即i.i.d.从R〜d中单位球表面上的已知分布D开始,标号y_t为-1或+1。目标函数为半空间u•x≥0,由单位矢量u∈R〜d表示这样,对于所有t,y_t(u•x_t)> 0。我们将假设v的预测表示为v(x)= SGN(v•x)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号