首页> 外文会议>IEEE Conference on Computer Vision and Pattern Recognition >Efficient label collection for unlabeled image datasets
【24h】

Efficient label collection for unlabeled image datasets

机译:有效的标签集合,用于未标记的图像数据集

获取原文

摘要

Visual classifiers are part of many applications including surveillance, autonomous navigation and scene understanding. The raw data used to train these classifiers is abundant and easy to collect but lacks labels. Labels are necessary for training supervised classifiers, but the labeling process requires significant human effort. Techniques like active learning and group-based labeling have emerged to help reduce the labeling workload. However, the possibility of collecting label noise affects either the efficiency of these systems or the performance of the trained classifiers. Further, many introduce latency by iteratively retraining classifiers or re-clustering data. We introduce a technique that searches for structural change in hierarchically clustered data to identify a set of clusters that span a spectrum of visual concept granularities. This allows us to efficiently label clusters with less label noise and produce high performing classifiers. The data is hierarchically clustered only once, eliminating latency during the labeling process. Using benchmark data we show that collecting labels with our approach is more efficient than existing labeling techniques, and achieves higher classification accuracy. Finally, we demonstrate the speed and efficiency of our system using real-world data collected for an autonomous navigation task.
机译:Visual Classifiry是许多应用程序的一部分,包括监视,自主导航和场景理解。用于训练这些分类器的原始数据丰富,易于收集但缺乏标签。标签是培训监督分类机所必需的,但标签过程需要大量的人力努力。已经出现了像主动学习和基于组的标签的技术,以帮助减少标签工作量。然而,收集标签噪声的可能性会影响这些系统的效率或训练有素的分类器的性能。此外,许多通过迭代再培训分类器或重新聚类数据引入延迟。我们介绍了一种搜索分层集群数据的结构变化的技术,以识别跨越视觉概念粒度频谱的一组集群。这允许我们有效地标记具有较少标签噪声的群集,并产生高性能的分类器。数据仅在分层群集一次,在标记过程中消除延迟。使用基准数据,我们显示使用我们的方法收集标签比现有的标签技术更效率,并且达到更高的分类精度。最后,我们展示了我们使用为自主导航任务收集的现实数据的系统的速度和效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号