【24h】

Active Learning with Direct Query Construction

机译:主动学习与直接查询构造

获取原文

摘要

Active learning may hold the key for solving the data scarcity problem in supervised learning, i.e., the lack of labeled data. Indeed, labeling data is a costly process, yet an active learner may request labels of only selected instances, thus reducing labeling work dramatically. Most previous works of active learning are, however, pool-based;that is, a pool of unla-beled examples is given and the learner can only select examples from the pool to query for their labels. This type of active learning has several weaknesses. In this paper we propose novel active learning algorithms that construct examples directly to query for labels. We study both a specific active learner based on the decision tree algorithm, and a general active learner that can work with any base learning algorithm. As there is no restriction on what examples to be queried, our methods are shown to often query fewer examples to reduce the predictive error quickly. This casts doubt on the usefulness of the pool in pool-based active learning. Nevertheless, our methods can be easily adapted to work with a given pool of unlabeled examples.
机译:主动学习可能是解决有监督学习中数据短缺问题(即缺少标记数据)的关键。确实,标记数据是一个昂贵的过程,但是活跃的学习者可能只请求选定实例的标记,从而大大减少了标记工作。但是,以前的大多数主动学习作品都是基于池的;也就是说,给出了一系列毫无根据的示例,学习者只能从池中选择示例以查询其标签。这种主动学习有一些缺点。在本文中,我们提出了新颖的主动学习算法,该算法可直接构造示例以查询标签。我们既研究基于决策树算法的特定主动学习者,又研究可以与任何基础学习算法一起使用的一般主动学习者。由于对要查询的示例没有限制,因此我们的方法显示出经常查询较少的示例以快速减少预测误差。这使人们对池在基于池的主动学习中的有用性产生怀疑。然而,我们的方法可以很容易地调整以与给定数量的未标记示例一起使用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号