首页> 外文会议>IEEE Winter Conference on Applications of Computer Vision >An Adversarial Domain Adaptation Network For Cross-Domain Fine-Grained Recognition
【24h】

An Adversarial Domain Adaptation Network For Cross-Domain Fine-Grained Recognition

机译:跨域细粒度识别的对抗域自适应网络

获取原文

摘要

In this paper, we tackle a valuable yet very challenging visual recognition task, where the instances are within a subordinate category, and the target domain undergoes a shift with the source domain. This task, termed as cross-domain fine-grained recognition, relates closely to many real-life scenarios, e.g., recognizing retail products in storage racks by models trained with images collected in controlled environments. To deal with this problem, we design a new algorithm and propose a corresponding fine-grained domain adaptation dataset. Firstly, we propose a novel end-to-end CNN architecture that integrates two specialized modules: an adversarial module for domain alignment and a self-attention module for fine-grained recognition. The adversarial module is used to handle domain shift by gradually aligning the different domains with domain-level and class-level alignments, and strive to help the classifier learn with domain-invariant features generated by nets. The self-attention module is designed to capture discriminative image regions which are crucial for fine-grained visual recognition. Secondly, we collect a large-scale fine-grained domain adaptation dataset of retail products, which contains 52,011 images of 263 classes from 3 domains. Thirdly, we validate the effectiveness of our method on three datasets, showing that the proposed method can yield significant improvements over baseline methods on fine-grained datasets. Besides, we also evaluate the effectiveness of the self-attention module by performing visualization, which can capture the discriminative image regions in both source and target domains.
机译:在本文中,我们解决了一项有价值但又极具挑战性的视觉识别任务,其中实例位于下属类别内,目标域与源域发生转移。这项称为跨域细粒度识别的任务与许多现实情况密切相关,例如,通过在受控环境中收集图像训练的模型来识别存储架中的零售产品。为了解决这个问题,我们设计了一种新算法,并提出了相应的细粒度域自适应数据集。首先,我们提出了一种新颖的端到端CNN架构,该架构集成了两个专用模块:用于域对齐的对抗模块和用于细粒度识别的自我关注模块。对抗模块用于通过逐步将不同的领域与领域级别和类别级别的对齐方式对齐来处理领域转移,并努力帮助分类器学习网络生成的领域不变特征。自我注意模块旨在捕获对细粒度视觉识别至关重要的辨别性图像区域。其次,我们收集了零售产品的大规模细粒度域适应数据集,其中包含来自3个域的263个类别的52,011张图像。第三,我们验证了我们的方法在三个数据集上的有效性,表明该方法相对于细粒度数据集的基线方法具有明显的改进。此外,我们还通过执行可视化来评估自我注意模块的有效性,该模块可以捕获源域和目标域中的区别性图像区域。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号