Deep Representations and Codes for Image Auto-Annotation

机译：图像自动注释的深层表示和代码

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The task of image auto-annotation, namely assigning a set of relevant tags to an image, is challenging due to the size and variability of tag vocabularies. Consequently, most existing algorithms focus on tag assignment and fix an often large number of hand-crafted features to describe image characteristics. In this paper we introduce a hierarchical model for learning representations of standard sized color images from the pixel level, removing the need for engineered feature representations and subsequent feature selection for annotation. We benchmark our model on the STL-10 recognition dataset, achieving state-of-the-art performance. When our features are combined with TagProp (Guillaumin et al.), we compete with or outperform existing annotation approaches that use over a dozen distinct handcrafted image descriptors. Furthermore, using 256-bit codes and Hamming distance for training TagProp, we exchange only a small reduction in performance for efficient storage and fast comparisons. Self-taught learning is used in all of our experiments and deeper architectures always outperform shallow ones.

机译：图像自动注释的任务，即为图像分配一组相关标签，由于标签词汇量的大小和可变性而具有挑战性。因此，大多数现有算法都专注于标签分配，并修复了大量的手工特征来描述图像特征。在本文中，我们引入了一种层次模型，用于从像素级学习标准尺寸的彩色图像的表示，从而消除了对工程化特征表示和后续特征选择进行注释的需求。我们在STL-10识别数据集上对模型进行基准测试，以实现最先进的性能。当我们的功能与TagProp结合使用时（Guillaumin等人），我们将与使用超过12种不同的手工图像描述符的现有注释方法竞争或胜过其他注释方法。此外，使用256位代码和汉明距离来训练TagProp，我们只交换了性能的小幅下降以进行有效存储和快速比较。在我们的所有实验中都使用了自学式学习，并且更深的体系结构总是优于浅层的体系结构。

著录项

来源
《Annual conference on Neural Information Processing Systems》|2012年|908-916|共9页
会议地点
作者
Ryan Kiros; Csaba Szepesvari;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Representation Learning Based on Autoencoder and Deep Adaptive Clustering for Image Clustering [J] . Siquan Yu, Jiaxin Liu, Zhi Han, Mathematical Problems in Engineering: Theory, Methods and Applications . 2021,第a期

机译：基于AutoEncoder的表示学习和图像聚类的深度自适应聚类
2. Image classification algorithm based on stacked sparse coding deep learning model-optimized kernel function nonnegative sparse representation [J] . An Fengping Soft computing: A fusion of foundations, methodologies and applications . 2020,第22期

机译：基于堆叠稀疏编码深度学习模型优化内核功能非负稀疏表示的图像分类算法
3. Feature Representation Using Deep Autoencoder for Lung Nodule Image Classification [J] . Mao Keming, Tang Renjie, Wang Xinqi, Complexity . 2018,第1期

机译：使用深度自动编码器进行肺结节图像分类的特征表示
4. Deep Representations and Codes for Image Auto-Annotation [C] . Ryan Kiros, Csaba Szepesvari Annual conference on Neural Information Processing Systems . 2012

机译：图像自动注释的深度表示和代码
5. Learning Deep Representations, Embeddings and Codes from the Pixel Level of Natural and Medical Images. [D] . Kiros, Ryan. 2013

机译：从自然和医学图像的像素级学习深度表示，嵌入和代码。
6. Correspondence between Monkey Visual Cortices and Layers of a Saliency Map Model Based on a Deep Convolutional Neural Network for Representations of Natural Images [O] . Nobuhiko Wagatsuma, Akinori Hidaka, Hiroshi Tamura 2021

机译：基于深度卷积神经网络的猴子视觉皮质与显着图模型层的对应关系用于自然图像的表示
7. DeepBrain: Functional Representation of Neural In-Situ Hybridization Images for Gene Ontology Classification Using Deep Convolutional Autoencoders [O] . Cohen, Ido, David, Eli, Netanyahu, Nathan S., 2017

机译：DeepBrain：神经原位杂交的功能表示使用深度卷积自动编码器进行基因本体分类的图像

Deep Representations and Codes for Image Auto-Annotation

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅