首页> 外文期刊>Computer Animation and Virtual Worlds >Multilayer feature descriptors fusion CNN models for fine-grained visual recognition
【24h】

Multilayer feature descriptors fusion CNN models for fine-grained visual recognition

机译:多层特征描述符融合CNN模型以实现细粒度的视觉识别

获取原文
获取原文并翻译 | 示例
           

摘要

Fine-grained image classification is a challenging topic in the field of computer vision. General models based on first-order local features cannot achieve acceptable performance because the features are not so efficient in capturing fine-grained difference. A bilinear convolutional neural network (CNN) model exhibits that a second-order statistical feature is more efficient in capturing fine-grained difference than a first-order local feature. However, this framework only considers the extraction of a second-order feature descriptor, using a single convolutional layer. The potential effective classification features of other convolutional layers are ignored, resulting in loss of recognition accuracy. In this paper, a multilayer feature descriptors fusion CNN model is proposed. It fully considers the second-order feature descriptors and the first-order local feature descriptor generated by different layers. Experimental verification was carried out on fine-grained classification benchmark data sets, CUB-200-2011, Stanford Cars, and FGVC-aircraft. Compared with the bilinear CNN model, the proposed method has improved accuracy by 0.8%, 1.1%, and 5.5%. Compared with the compact bilinear pooling model, there is an accuracy increase of 0.64%, 1.63%, and 1.45%, respectively. In addition, the proposed model effectively uses multiple 1x1 convolution kernels to reduce dimension. The experimental results show that the multilayer low-dimensional second-order feature descriptors fusion model has comparable recognition accuracy of the original model.
机译:细粒度的图像分类是计算机视觉领域中一个具有挑战性的主题。基于一阶局部特征的通用模型无法实现可接受的性能,因为这些特征在捕获细粒度差异方面效率不高。双线性卷积神经网络(CNN)模型显示,与一阶局部特征相比,二阶统计特征在捕获细粒度差异方面更有效。但是,此框架仅考虑使用单个卷积层提取二阶特征描述符。其他卷积层的潜在有效分类特征将被忽略,从而导致识别精度下降。本文提出了一种多层特征描述符融合的CNN模型。它充分考虑了由不同层生成的二阶特征描述符和一阶局部特征描述符。在细粒度的分类基准数据集,CUB-200-2011,斯坦福汽车和FGVC飞机上进行了实验验证。与双线性CNN模型相比,该方法的准确性提高了0.8%,1.1%和5.5%。与紧凑型双线性池模型相比,精度分别提高了0.64%,1.63%和1.45%。此外,所提出的模型有效地使用了多个1x1卷积核来减小维数。实验结果表明,多层低维二阶特征描述符融合模型具有与原始模型相当的识别精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号