Leveraging deep learning representation for search-based image annotation

机译：利用基于搜索的图像注释的深度学习表示

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Image annotation aims to assign some tags to an image such that these tags provide a textual description for the content of image. Search-based methods extract relevant tags for an image based on the tags of nearest neighbor images in the training set. In these methods, similarity of two images is determined based on the distance between feature vectors of the images. Thus, it is essential to extract informative feature vectors from images. In this paper, we propose a framework that utilize deep learning to obtain visual representation of images. We apply different architectures of convolutional neural networks (CNN) to the input image and obtain a single feature vector that is a rich representation for visual content of the image. In this way, we eliminate the usage of multiple feature vectors used in the state-of-the-art annotation methods. We also integrate our feature extractors with a nearest neighbors approach to obtain relevant tags of an image. Our experiments on the standard datasets of image annotation (including Corel5k, ESP Game, IAPR) demonstrate that our approach reaches higher precision, recall and F1 than the state-of-the-art methods such as 2PKNN, TagProp, NMF-KNN and etc.

机译：图像注释旨在为图像分配一些标签，使得这些标签为图像的内容提供了文本描述。基于搜索的方法基于训练集中的最近邻图像的标签提取图像的相关标签。在这些方法中，基于图像的特征向量之间的距离来确定两个图像的相似性。因此，必须从图像中提取信息传闻。在本文中，我们提出了一个利用深度学习获得图像的视觉表示的框架。我们将卷积神经网络（CNN）的不同架构应用于输入图像，并获得单个特征向量，该传感器是图像的可视内容的丰富表示。通过这种方式，我们消除了在最先进的注释方法中使用的多个特征向量的使用。我们还将其特征提取器与最近的邻居方法集成，以获取图像的相关标记。我们对图像注释的标准数据集（包括Corel5k，ESP游戏，IAPR）的实验表明，我们的方法比现有技术方法达到更高的精度，召回和F1，例如2PKNN，Tagprop，NMF-KNN等。

著录项

来源
《Artificial Intelligence and Signal Processing Conference》|2017年|305p|共6页
会议地点
作者
Mahya Mohammadi Kashani; S. Hamid Amiri;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词
Machine learning; Feature extraction; Image annotation; Training; Standards; Databases; Games;

机译：机器学习;特征提取;图像注释;培训;标准;数据库;游戏;

相似文献

外文文献
中文文献
专利

1. Deep learning based feature representation for automated skin histopathological image annotation [J] . Zhang Gang, Hsu Ching-Hsien Robert, Lai Huadong, Multimedia Tools and Applications . 2018,第8期

机译：基于深度学习的特征表示可自动进行皮肤组织病理学图像注释
2. Mining Weakly Labeled Web Facial Images For Search-Based Face Annotation [J] . Rohit D. Mane IOSR journal of computer engineering . 2015,第4期

机译：挖掘弱标记的Web面部图像以进行基于搜索的面部注释
3. Mining Weakly Labeled Web Facial Images for Search-Based Face Annotation [J] . Wang Dayong, Hoi Steven C.H., He Ying, IEEE Transactions on Knowledge and Data Engineering . 2014,第1期

机译：挖掘弱标记的Web面部图像以进行基于搜索的面部注释
4. Leveraging deep learning representation for search-based image annotation [C] . Mahya Mohammadi Kashani, S. Hamid Amiri The 19th CSI International Symposium on Artificial Intelligence and Signal Processing . 2017

机译：利用深度学习表示形式进行基于搜索的图像注释
5. Deep Representation Learning for Complex Medical Images [D] . Bozkurt, Alican. 2020

机译：复杂医学图像的深度代表学习
6. Detection and Labeling of Vertebrae in MR Images Using Deep Learning with Clinical Annotations as Training Data [O] . Daniel Forsberg, Erik Sjöblom, Jeffrey L. Sunshine 2017

机译：使用临床注释作为训练数据的深度学习对MR图像中的椎骨进行检测和标记
7. Automatic Image Annotation using Deep Learning Representations [O] . Venkatesh N. Murthy, Subhransu Maji, R. Manmatha 2015

机译：使用深度学习表示的自动图像注释

Leveraging deep learning representation for search-based image annotation

摘要

著录项

相似文献

相关主题

期刊订阅