A probabilistic topic model using deep visual word representation for simultaneous image classification and annotation

Foumani Seyed Navid Mohammadi; Nickabadi Ahmad

首页> 外文期刊>Journal of visual communication & image representation >A probabilistic topic model using deep visual word representation for simultaneous image classification and annotation

【24h】

A probabilistic topic model using deep visual word representation for simultaneous image classification and annotation

机译：一种概率主题模型，使用深视觉词表示同声图像分类和注释

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Researches have shown that holistic examination of an image provides better understanding of the image compared to separate processes each devoted to a single task like annotation, classification or segmentation. During the past decades, there have been several efforts for simultaneous image classification and annotation using probabilistic or neural network based topic models. Despite their relative success, most of these models suffer from the poor visual word representation and the imbalance between the number of visual and annotation words in the training data. This paper proposes a novel model for simultaneous image classification and annotation model based on SupDocNADE, a neural network based topic model for image classification and annotation. The proposed model, named wSupDocNADE, addresses the above shortcomings by using a new coding and introducing a weighting mechanism for the SupDocNADE model. In the coding step of the model, several patches extracted from the input image are first fed to a deep convolutional neural network and the feature vectors obtained from this network are coded using the LLC coding. These vectors are then aggregated in a final descriptor through sum pooling. To overcome the imbalance between the visual and annotation words, a weighting factor is considered for each visual or annotation word. The weights of the visual words are set based on their frequencies obtained from the pooling method and the weights of the annotation words are learned from the training data. The experimental results on three benchmark datasets show the superiority of the proposed model in both image classification and annotation tasks over state-of-the-art models. (C) 2019 Elsevier Inc. All rights reserved.

机译：研究表明，与单独的进程相比，图像的整体检查提供了更好地理解图像，每个过程都致力于作为注释，分类或分割等单一任务。在过去的几十年中，使用概率或神经网络的主题模型来说，已经有几项努力进行同步图像分类和注释。尽管他们相对成功，但大多数模型都遭受了糟糕的视觉词表示和训练数据中的视觉和注释词数之间的不平衡。本文提出了一种基于SupdoCnade的同步图像分类和注释模型的新模型，是一种基于纯网络的图像分类和注释的主题模型。所提出的模型名为WSUPDOCNADE，通过使用新的编码来解决上述缺点，并为纯粹的纯网络模型引入加权机制。在模型的编码步骤中，从输入图像中提取的若干贴片首先被馈送到深卷积神经网络，并且使用LLC编码对从该网络获得的特征向量进行编码。然后通过汇总汇总在最终描述符中聚合这些向量。为了克服视觉和注释词之间的不平衡，考虑每个视觉或注释字的加权因子。基于从汇集方法获得的频率设置视觉词的权重，并且从训练数据中了解注释词的权重。三个基准数据集上的实验结果显示了在最先进的模型上的图像分类和注释任务中提出模型的优越性。（c）2019 Elsevier Inc.保留所有权利。

著录项

来源
《Journal of visual communication & image representation 》 |2019年第2期| 195-203| 共9页
作者
Foumani Seyed Navid Mohammadi; Nickabadi Ahmad;
展开▼
作者单位

Amirkabir Univ Technol Dept Comp Engn & IT Tehran Iran;

Amirkabir Univ Technol Dept Comp Engn & IT Tehran Iran;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Image classification and annotation; Topic models; Probabilistic model; Deep learning; Convolutional neural network; LLC;

机译：图像分类和注释;主题模型;概率模型;深入学习;卷积神经网络;LLC;

相似文献

外文文献
中文文献
专利

1. A probabilistic topic model using deep visual word representation for simultaneous image classification and annotation [J] . Foumani Seyed Navid Mohammadi, Nickabadi Ahmad Journal of visual communication & image representation . 2019 ,第FEBa期

机译：使用深度视觉单词表示同时进行图像分类和注释的概率主题模型
2. A probabilistic topic model for event-based image classification and multi-label annotation [J] . Laib Lakhdar, Allili Mohand Said, Ait-Aoudia Samy Signal Processing. Image Communication: A Publication of the the European Association for Signal Processing . 2019 ,第期

机译：基于事件的图像分类和多标签注释的概率主题模型
3. Simultaneous image classification and annotation based on probabilistic model [J] . LI Xiao-xu, SUN Chao-bo, LU Peng, 中国邮电高校学报（英文版） . 2012 ,第002期

机译：基于概率模型的图像同时分类与标注
4. Simultaneous remote sensing image classification and annotation based on the spatial coherent topic model [C] . Zhang Zheng, Yang Michael Ying, Zhou Mei, IEEE International Geoscience and Remote Sensing Symposium . 2014

机译：基于空间连贯主题模型的遥感影像同时分类与标注
5. Probabilistic Topic Modeling for Hyperspectral Image Classification [D] . Boggs, Thomas P. 2019

机译：高光谱图像分类的概率主题建模
6. An effective content-based image retrieval technique for image visuals representation based on the bag-of-visual-words model [O] . Safia Jabeen, Zahid Mehmood, Toqeer Mahmood, -1

机译：基于视觉袋模型的基于内容的有效图像检索技术
7. A Supervised Neural Autoregressive Topic Model for Simultaneous Image Classification and Annotation [O] . Zheng, Yin, Zhang, Yu-Jin, Larochelle, Hugo 2013

机译：一种用于同时图像的监督神经自回归主题模型分类和注释
8. Image Annotation and Topic Extraction Using Super-Word Latent Dirichlet Allocation. [R] . Noel, I. G. 2013

机译：基于super-Word Latent Dirichlet分配的图像标注与主题提取。

A probabilistic topic model using deep visual word representation for simultaneous image classification and annotation

摘要

著录项

相似文献

相关主题

期刊订阅