首页> 外文会议>IEEE International Conference on Data Mining Workshops >Integer Programming for Multi-class Active Learning
【24h】

Integer Programming for Multi-class Active Learning

机译:用于多级主动学习的整数编程

获取原文

摘要

Active learning has been demonstrated to be a powerful tool for improving the effectiveness of binary classifiers. It iteratively identifies informative unlabeled examples which after labeling are used to augment the initial training set. Adapting the procedure to large-scale, multi-class classification problems, however, poses certain challenges. For instance, to guarantee improvement by the method we may need to select a large number of examples that require prohibitive labeling resources. Furthermore, the notion of informative examples also changes significantly when multiple classes are considered. In this paper we show that multi-class active learning can be cast into an integer programming framework, where a subset of examples that are informative across maximum number of classes is selected. We test our approach on several large-scale document categorization problems. We demonstrate that in the case of limited labeling resources and large number of classes the proposed method is more effective compared to other known approaches.
机译:积极学习已被证明是提高二元分类器的有效性的强大工具。它迭代地识别信息,在标签后使用它来增加初始训练集之后。然而,将过程调整为大规模,多级分类问题,构成了某些挑战。例如,为了通过该方法保证改进,我们可能需要选择需要禁止标记资源的大量示例。此外,当考虑多个类时,信息示例的概念也会显着变化。在本文中,我们表明,可以将多级主动学习投入到整数编程框架中,其中选择了跨越最大类数的信息的示例子集。我们在几个大规模文档分类问题上测试了我们的方法。我们证明,在有限的标签资源和大量类别的情况下,与其他已知方法相比,所提出的方法更有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号