Solving Mixed-Modal Jigsaw Puzzle for Fine-Grained Sketch-Based Image Retrieval

机译：解决混合模态拼图以细粒度基于草图的图像检索

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

ImageNet pre-training has long been considered crucial by the fine-grained sketch-based image retrieval (FG-SBIR) community due to the lack of large sketch-photo paired datasets for FG-SBIR training. In this paper, we propose a self-supervised alternative for representation pre-training. Specifically, we consider the jigsaw puzzle game of recomposing images from shuffled parts. We identify two key facets of jigsaw task design that are required for effective FG-SBIR pre-training. The first is formulating the puzzle in a mixed-modality fashion. Second we show that framing the optimisation as permutation matrix inference via Sinkhorn iterations is more effective than the common classifier formulation of Jigsaw self-supervision. Experiments show that this self-supervised pre-training strategy significantly outperforms the standard ImageNet-based pipeline across all four product-level FG-SBIR benchmarks. Interestingly it also leads to improved cross-category generalisation across both pre-train/fine-tune and fine-tune/testing stages.

机译：细粒度的基于草图的图像检索（FG-SBIR）社区长期以来一直将ImageNet预训练视为至关重要的，原因是缺少用于FG-SBIR训练的大的草图照片配对数据集。在本文中，我们提出了一种自我监督的表示预训练替代方法。具体来说，我们考虑的是一种拼图游戏，它可以根据混洗后的部分重新组成图像。我们确定了拼图任务设计的两个关键方面，它们是有效的FG-SBIR预训练所必需的。首先是以混合模式的方式制定难题。其次，我们表明，通过Sinkhorn迭代将优化框架化为置换矩阵推理比使用Jigsaw自监督的通用分类器公式更为有效。实验表明，这种自我监督的预培训策略在所有四个产品级FG-SBIR基准测试中均明显优于基于ImageNet的标准管道。有趣的是，它还可以改善预训练/微调和微调/测试阶段之间的跨类别泛化。

著录项

来源
《IEEE/CVF Conference on Computer Vision and Pattern Recognition》|2020年|10344-10352|共9页
会议地点
作者
Kaiyue Pang; Yongxin Yang; Timothy M. Hospedales; Tao Xiang; Yi-Zhe Song;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Task analysis; Training; Feature extraction; Image edge detection; Footwear; Image retrieval; Computer vision;

机译：任务分析;训练;特征提取;图像边缘检测;鞋类;图像检索;计算机视觉;
入库时间 2022-08-26 14:36:33

相似文献

外文文献
中文文献
专利

1. Deep cascaded cross-modal correlation learning for fine-grained sketch-based image retrieval [J] . Pattern Recognition: The Journal of the Pattern Recognition Society . 2020,第期

机译：基于细粒草图的图像检索的深层级联跨模态相关学习
2. Cross-modal subspace learning for fine-grained sketch-based image retrieval [J] . Peng Xu, Qiyue Yin, Yongye Huang, Neurocomputing . 2018,第FEBa22期

机译：跨模态子空间学习，用于基于草图的细粒度图像检索
3. Synergistic Instance-Level Subspace Alignment for Fine-Grained Sketch-Based Image Retrieval [J] . Ke Li, Kaiyue Pang, Yi-Zhe Song, IEEE Transactions on Image Processing . 2017,第12期

机译：细粒度基于草图的图像检索的协同实例级子空间对齐
4. Knowledge Retrieval for Automatic Solving of Jigsaw Puzzles [C] . Weiss-Cohen, M., Halevi, . 2005

机译：自动解决拼图难题的知识检索
5. Constructing literacy: Disadvantaged Irish mothers' attempts at developing literacy with their preschool children during storybook reading and jigsaw puzzle building. [D] . Johnson, Dorothy Priscilla. 2003

机译：增强识字能力：不利的爱尔兰母亲在读故事书和拼图游戏期间试图与学龄前儿童发展识字能力。
6. Gradually focused fine-grained sketch-based image retrieval [O] . Ming Zhu, Chun Chen, Nian Wang, 2015

机译：渐进聚焦的细粒度基于草图的图像检索
7. Sketch Less for More: On-the-Fly Fine-Grained Sketch-Based Image Retrieval [O] . Ayan Kumar Bhunia, Yongxin Yang, Timothy M. Hospedales, 2020

机译：素描较少，以备选案：在飞行的细粒度素描的图像检索

Solving Mixed-Modal Jigsaw Puzzle for Fine-Grained Sketch-Based Image Retrieval

摘要

著录项

相似文献

相关主题

期刊订阅