User-Click-Data-Based Fine-Grained Image Recognition via Weakly Supervised Metric Learning

Tan Min; Yu Jun; Yu Zhou; Gao Fei; Rui Yong; Tao Dacheng

首页> 外文期刊>ACM transactions on multimedia computing communications and applications >User-Click-Data-Based Fine-Grained Image Recognition via Weakly Supervised Metric Learning

【24h】

User-Click-Data-Based Fine-Grained Image Recognition via Weakly Supervised Metric Learning

机译：通过弱监督度量学习的基于用户点击数据的细粒度图像识别

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We present a novel fine-grained image recognition framework using user click data, which can bridge the semantic gap in distinguishing categories that are similar in visual. As query set in click data is usually large-scale and redundant, we first propose a click-feature-based query-merging approach to merge queries with similar semantics and construct a compact click feature. Afterward, we utilize this compact click feature and convolutional neural network (CNN)-based deep visual feature to jointly represent an image. Finally, with the combined feature, we employ the metriclearning-based template-matching scheme for efficient recognition. Considering the heavy noise in the training data, we introduce a reliability variable to characterize the image reliability, and propose a weakly-supervised metric and template leaning with smooth assumption and click prior (WMTLSC) method to jointly learn the distance metric, object templates, and image reliability. Extensive experiments are conducted on a public Clickture-Dog dataset and our newly established Clickture-Bird dataset. It is shown that the click-data-based query merging helps generating a highly compact (the dimension is reduced to 0.9%) and dense click feature for images, which greatly improves the computational efficiency. Also, introducing this click feature into CNN feature further boosts the recognition accuracy. The proposed framework performs much better than previous state-of-the-arts in fine-grained recognition tasks.

机译：我们提出一种使用用户点击数据的新颖的细粒度图像识别框架，该框架可以在区分视觉相似的类别中弥合语义鸿沟。由于点击数据中的查询集通常是大规模且多余的，因此我们首先提出一种基于点击功能的查询合并方法，以合并具有相似语义的查询并构建紧凑的点击功能。之后，我们利用这种紧凑的点击功能和基于卷积神经网络（CNN）的深度视觉功能来共同表示图像。最后，结合组合特征，我们采用基于metriclearning的模板匹配方案进行有效识别。考虑到训练数据中的大量噪声，我们引入了一个可靠性变量来表征图像的可靠性，并提出了一种具有弱假设的弱监督指标和模板，并采用先验单击（WMTLSC）方法来共同学习距离指标，对象模板，和图像可靠性。在公共Clickture-Dog数据集和我们新建立的Clickture-Bird数据集上进行了广泛的实验。结果表明，基于点击数据的查询合并有助于为图像生成高度紧凑（尺寸减小至0.9％）和密集点击的功能，从而大大提高了计算效率。此外，将此点击功能引入CNN功能可进一步提高识别准确性。提出的框架在细粒度识别任务方面的性能比以前的最新技术要好得多。

著录项

来源
《ACM transactions on multimedia computing communications and applications》 |2018年第3期|70.1-70.23|共23页
作者
Tan Min; Yu Jun; Yu Zhou; Gao Fei; Rui Yong; Tao Dacheng;
展开▼
作者单位

Hangzhou Dianzi Univ, Sch Comp Sci & Technol, 1158,2nd Ave, Hangzhou 310018, Zhejiang, Peoples R China;

Hangzhou Dianzi Univ, Sch Comp Sci & Technol, 1158,2nd Ave, Hangzhou 310018, Zhejiang, Peoples R China;

Hangzhou Dianzi Univ, Sch Comp Sci & Technol, 1158,2nd Ave, Hangzhou 310018, Zhejiang, Peoples R China;

Hangzhou Dianzi Univ, Sch Comp Sci & Technol, 1158,2nd Ave, Hangzhou 310018, Zhejiang, Peoples R China;

Lenovo, 6 Shang Di West Rd, Beijing 100085, Peoples R China;

Univ Sydney, Fac Engn & Informat Technol, Sydney, NSW, Australia;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Metric learning; fine-grained image recognition; user click data; convolutional neural network; weakly supervised learning;

机译：度量学习;细粒度图像识别;用户点击数据;卷积神经网络;弱监督学习;

相似文献

外文文献
中文文献
专利

1. Weakly Supervised Metric Learning for Traffic Sign Recognition in a LIDAR-Equipped Vehicle [J] . Min Tan, Baoyuan Wang, Zhaohui Wu, IEEE Transactions on Intelligent Transportation Systems . 2016,第5期

机译：配备激光雷达的车辆中的弱监督度量学习，用于交通标志识别
2. Robust object recognition via weakly supervised metric and template learning [J] . Tan Min, Hu Zhenfang, Wang Baoyuan, Neurocomputing . 2016,第Mara12期

机译：通过弱监督指标和模板学习进行可靠的对象识别
3. Pixel-to-Pixel Learning With Weak Supervision for Single-Stage Nucleus Recognition in Ki67 Images [J] . Xing Fuyong, Cornish Toby C., Bennett Tell, IEEE Transactions on Biomedical Engineering . 2019,第11期

机译：对Ki67图像中的单阶段核识别进行弱监督的像素间学习。
4. Fine-grained image recognition via weakly supervised click data guided bilinear CNN model [C] . Guangjian Zheng, Min Tan, Jun Yu, IEEE International Conference on Multimedia and Expo . 2017

机译：通过弱监督点击数据引导的双线性CNN模型进行细粒度图像识别
5. Weakly supervised learning on image manifolds. [D] . Wu, Hui. 2015

机译：图像流形上的弱监督学习。
6. Robust Semi-Supervised Traffic Sign Recognition via Self-Training and Weakly-Supervised Learning [O] . Obed Tettey Nartey, Guowu Yang, Sarpong Kwadwo Asare, 2020

机译：通过自我训练和弱监督学习实现可靠的半监督交通标志识别
7. Weakly supervised clustering: Learning fine-grained signals from coarse labels [O] . Wager, Stefan, Blocker, Alexander, Cardin, Niall 2015

机译：弱监督聚类：学习粗粒度的细粒度信号标签

User-Click-Data-Based Fine-Grained Image Recognition via Weakly Supervised Metric Learning

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅