首页> 外文会议>European conference on computer vision >Two-Stage Training for Improved Classification of Poorly Localized Object Images
【24h】

Two-Stage Training for Improved Classification of Poorly Localized Object Images

机译:两阶段培训,改进了本地化对象图像的分类

获取原文

摘要

State-of-the-art object classifiers finetuned from a pretrained (e.g. from ImageNet) model on a domain-specific dataset can accurately classify well-localized object images. However, such classifiers often fail on poorly localized images (images with lots of context, heavily occluded/partially visible, and off-centered objects). In this paper, we propose a two-stage training scheme to improve the classification of such noisy detections, often produced by low-compute algorithms such as motion based background removal techniques that run on the edge. The proposed two-stage training pipeline first trains a classifier from scratch with extreme image augmentation, followed by finetuning in the second stage. The first stage incorporates a lot of contextual information around the objects, given access to the corresponding full images. This stage works very well for classification of poorly localized input images, but generates a lot of false positives by classifying non-object images as objects. To reduce the false positives, a second training is done on the tight ground-truth bounding boxes (as done traditionally) by using the trained model in the first stage as the initial model and very slowly adjusting its weights during the training. To demonstrate the efficacy of our approach, we curated a new classification dataset for poorly localized images - noisy PASCAL VOC 2007 test dataset. Using this dataset, we show that the proposed two-stage training scheme can significantly improve the accuracy of the trained classifier on both well-localized and poorly-localized object images.
机译:从域特定数据集上的预读数(例如来自Imagenet)模型的最先进的对象分类器可以准确地对本地化的对象图像进行准确地分类。然而,这种分类器通常在局部的局部图像上失败(具有大量上下文,严重封闭/部分可见的和偏心对象的图像)。在本文中,我们提出了一种两级训练方案,以改善这种嘈杂检测的分类,通常由低计算算法产生,例如在边缘上运行的基于运动的背景删除技术。提出的两阶段训练管道首先用极端图像增强从划痕中训练分类器,然后在第二阶段进行芬降。第一阶段包含大量对象的上下文信息,给定对相应的完整图像的访问。此阶段对于本地化输入图像的分类非常好,但通过将非对象图像分类为对象来生成大量误报。为了减少误报,通过在第一阶段中的训练模型作为初始模型,在训练期间使用训练模型并在训练期间非常缓慢地调整其权重来完成第二次训练。为了展示我们的方法的功效,我们策划了一个新的分类数据集,适用于本地化的图像差,噪音Pascal VOC 2007测试数据集。使用此数据集,我们表明所提出的两级训练方案可以显着提高训练分类器对局部局部和局部良好的对象图像的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号