首页> 外文期刊>JMLR: Workshop and Conference Proceedings >Stopping criterion for active learning based on deterministic generalization bounds
【24h】

Stopping criterion for active learning based on deterministic generalization bounds

机译:基于确定性泛化界限停止主动学习的标准

获取原文
       

摘要

Active learning is a framework in which the learning machine can select the samples to be used for training. This technique is promising, particularly when the cost of data acquisition and labeling is high. In active learning, determining the timing at which learning should be stopped is a critical issue. In this study, we propose a criterion for automatically stopping active learning. The proposed stopping criterion is based on the difference in the expected generalization errors and hypothesis testing. We derive a novel upper bound for the difference in expected generalization errors before and after obtaining a new training datum based on PAC-Bayesian theory. Unlike ordinary PAC-Bayesian bounds, though, the proposed bound is deterministic; hence, there is no uncontrollable trade-off between the confidence and tightness of the inequality. We combine the upper bound with a statistical test to derive a stopping criterion for active learning. We demonstrate the effectiveness of the proposed method via experiments with both artificial and real datasets.
机译:主动学习是一个框架,其中学习机可以选择用于培训的样本。这种技术很有希望,特别是当数据采集和标签的成本很高时。在主动学习中,确定应该停止学习的时间是一个关键问题。在这项研究中,我们提出了一种自动停止活动学习的标准。所提出的停止标准基于预期概括误差和假设检测的差异。我们在基于Pac-Bayesian理论获得新的培训基准之前和之后获得了预期泛化误差的差异的新型上限。然而,与普通的Pac-Bayesian界不同,建议的界限是确定性的;因此,在不平等的信心和紧张之间没有无法控制的权衡。我们将上限与统计测试结合起来,以导出积极学习的停止标准。我们通过用人工和真实数据集的实验展示了所提出的方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号