...
首页> 外文期刊>Multimedia Tools and Applications >Classify multi-label images via improved CNN model with adversarial network
【24h】

Classify multi-label images via improved CNN model with adversarial network

机译:通过具有对冲网络的改进的CNN模型来分类多标签图像

获取原文
获取原文并翻译 | 示例
           

摘要

Convolution neural network (CNN) achieves outstanding results in single-label image classification task. However, due to the complex underlying object layout and insufficient multi-label training images, how to achieve better performance for multi-label images via CNN is still an open problem. In this work, we propose an improved deep CNN model which can extract features of objects at different scales in multi-label images by spatial pyramid pooling as well as feature fusion. In model training, we first transfer the parameters pre-trained on ImageNet to our model, then an Adversarial Network is trained to generate examples with occlusions, which makes our model invariant to occlusions. Experimental results on Pascal VOC 2012 and Corel 5K image datasets demonstrate the superiority of the proposed approach over many approaches. The mAP of our model reaches 84.0% on the VOC 2012 dataset, which significantly outperforms most approaches and closes to HCP, the representative multi-label classification approach.
机译:卷积神经网络(CNN)实现了单标图像分类任务中的出色结果。但是,由于复杂的底层对象布局和多标签训练图像不足,如何通过CNN实现多标签图像的更好性能仍然是一个打开的问题。在这项工作中,我们提出了一种改进的深层CNN模型,可以通过空间金字塔池以及特征融合来提取多标签图像中不同尺度的物体的特征。在模型训练中,我们首先将在想象中预先培训的参数转移到我们的模型,然后培训对抗网络以生成带有遮挡的示例,这使得我们的模型不变于遮挡。 Pascal VOC 2012和Corel 5K图像数据集的实验结果证明了在许多方法中提出的方法的优越性。我们模型的地图在VOC 2012数据集中达到84.0%,这显着优于大多数方法和关闭HCP,代表性的多标签分类方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号