首页> 外文会议>International Conference on Multimedia Modeling >Multi-branch and Multi-scale Attention Learning for Fine-Grained Visual Categorization
【24h】

Multi-branch and Multi-scale Attention Learning for Fine-Grained Visual Categorization

机译:用于细粒度视觉分类的多分支和多尺度关注学习

获取原文

摘要

ImageNet Large Scale Visual Recognition Challenge (ILSVRC) is one of the most authoritative academic competitions in the field of Computer Vision (CV) in recent years. But applying ILSVRC's annual champion directly to fine-grained visual categorization (FGVC) tasks does not achieve good performance. To FGVC tasks, the small inter-class variations and the large intra-class variations make it a challenging problem. Our attention object location module (AOLM) can predict the position of the object and attention part proposal module (APPM) can propose informative part regions without the need of bounding-box or part annotations. The obtained object images not only contain almost the entire structure of the object, but also contains more details, part images have many different scales and more fine-grained features, and the raw images contain the complete object. The three kinds of training images are supervised by our multi-branch network. Therefore, our multi-branch and multi-scale learning network(MMAL-Net) has good classification ability and robustness for images of different scales. Our approach can be trained end-to-end, while provides short inference time. Through the comprehensive experiments demonstrate that our approach can achieves state-of-the-art results on CUB-200-2011, FGVC-Aircraft and Stanford Cars datasets.
机译:Imagenet大规模视觉识别挑战(ILSVRC)是近年来计算机视觉(CV)领域最权威的学术竞赛之一。但将ILSVRC的年度冠军直接应用于细粒度的视觉分类(FGVC)任务不会达到良好的性能。对于FGVC任务,小阶级的阶级变化和大型级别的变化使其成为一个具有挑战性的问题。我们的注意对象位置模块(AOLM)可以预测物体和注意力部分提案模块(APPM)的位置可以提出信息零件区域,而无需边界盒或部分注释。所获得的对象图像不仅包含对象的整个结构,还包含更多细节,部分图像具有许多不同的尺度和更细粒度的功能,并且原始图像包含完整的对象。三种培训图像由我们的多分支网络监督。因此,我们的多分支和多尺度学习网络(MMAL网)对不同尺度的图像具有良好的分类能力和鲁棒性。我们的方法可以训练结束到底,同时提供短的推理时间。通过综合实验表明,我们的方法可以在Cub-200-2011,FGVC-Aircraft和Stanford Cars数据集上实现最先进的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号