...
首页> 外文期刊>Multimedia Tools and Applications >CNN-RNN: a large-scale hierarchical image classification framework
【24h】

CNN-RNN: a large-scale hierarchical image classification framework

机译:CNN-RNN:大规模的分层图像分类框架

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Objects are often organized in a semantic hierarchy of categories, where fine-level categories are grouped into coarse-level categories according to their semantic relations. While previous works usually only classify objects into the leaf categories, we argue that generating hierarchical labels can actually describe how the leaf categories evolved from higher level coarse-grained categories, thus can provide a better understanding of the objects. In this paper, we propose to utilize the CNN-RNN framework to address the hierarchical image classification task. CNN allows us to obtain discriminative features for the input images, and RNN enables us to jointly optimize the classification of coarse and fine labels. This framework can not only generate hierarchical labels for images, but also improve the traditional leaf-level classification performance due to incorporating the hierarchical information. Moreover, this framework can be built on top of any CNN architecture which is primarily designed for leaf-level classification. Accordingly, we build a high performance network based on the CNN-RNN paradigm which outperforms the original CNN (wider-ResNet) and also the current state-of-the-art. In addition, we investigate how to utilize the CNN-RNN framework to improve the fine category classification when a fraction of the training data is only annotated with coarse labels. Experimental results demonstrate that CNN-RNN can use the coarse-labeled training data to improve the classification of fine categories, and in some cases it even surpasses the performance achieved by fully annotated training data. This reveals that, CNN-RNN can alleviate the challenge of specialized and expensive annotation of fine labels.
机译:对象通常以类别的语义层次结构进行组织,其中,细级类别根据其语义关系被分组为粗略类别。尽管以前的作品通常仅将对象分类为叶子类别,但我们认为生成分层标签实际上可以描述叶子类别如何从更高级别的粗粒度类别演变而来,从而可以更好地理解对象。在本文中,我们建议利用CNN-RNN框架来解决分层图像分类任务。 CNN使我们能够获得输入图像的判别特征,而RNN使我们能够共同优化粗略标签和精细标签的分类。该框架不仅可以生成图像的分层标签,而且由于合并了分层信息,还可以提高传统的叶级分类性能。此外,可以在主要用于叶级分类的任何CNN体系结构的基础上构建此框架。因此,我们基于CNN-RNN范式构建了一个高性能网络,该网络优于原始的CNN(wider-ResNet)以及当前的最新技术。另外,我们研究了当训练数据的一部分仅使用粗略标签进行标注时,如何利用CNN-RNN框架来改进精细类别分类。实验结果表明,CNN-RNN可以使用粗标签的训练数据来改善精细类别的分类,在某些情况下,甚至可以超过完全注释训练数据所实现的性能。这表明,CNN-RNN可以减轻对精细标签进行专门且昂贵的注释的挑战。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号