首页> 外文期刊>Machine Learning >Annotation cost-sensitive active learning by tree sampling
【24h】

Annotation cost-sensitive active learning by tree sampling

机译:通过树采样对注释成本敏感的主动学习

获取原文
获取原文并翻译 | 示例
           

摘要

Active learning is an important machine learning setup for reducing the labelling effort of humans. Although most existing works are based on a simple assumption that each labelling query has the same annotation cost, the assumption may not be realistic. That is, the annotation costs may actually vary between data instances. In addition, the costs may be unknown before making the query. Traditional active learning algorithms cannot deal with such a realistic scenario. In this work, we study annotation cost-sensitive active learning algorithms, which need to estimate the utility and cost of each query simultaneously. We propose a novel algorithm, the cost-sensitive tree sampling algorithm, that conducts the two estimation tasks together and solve it with a tree-structured model motivated from hierarchical sampling, a famous algorithm for traditional active learning. Extensive experimental results using datasets with simulated and true annotation costs validate that the proposed method is generally superior to other annotation cost-sensitive algorithms.
机译:主动学习是一种重要的机器学习设置,可以减少人类的标签工作量。尽管大多数现有作品都是基于一个简单的假设,即每个标签查询具有相同的注释成本,但这种假设可能并不现实。也就是说,注释成本实际上可能在数据实例之间有所不同。另外,在进行查询之前,费用可能是未知的。传统的主动学习算法无法应对这种现实情况。在这项工作中,我们研究注释成本敏感型主动学习算法,该算法需要同时估算每个查询的效用和成本。我们提出了一种新颖的算法,即成本敏感型树采样算法,该算法可同时执行两个估计任务,并使用一种以分层采样为动力的树结构模型(传统的主动学习算法)对其进行求解。使用具有模拟和真实注释成本的数据集进行的大量实验结果证明,该方法通常优于其他注释成本敏感算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号