Saliency-based selection of visual content for deep convolutional neural networks

Montoya Obeso A.; Benois-Pineau J.; Garcia Vazquez M. S.; Acosta A. A. Ramirez

首页> 外文期刊>Multimedia Tools and Applications >Saliency-based selection of visual content for deep convolutional neural networks

【24h】

Saliency-based selection of visual content for deep convolutional neural networks

机译：基于显着性的深度卷积神经网络视觉内容选择

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The automatic description of digital multimedia content was mainly developed for classification tasks, retrieval systems and massive ordering of data. Preservation of cultural heritage is a field of high importance of application of these methods. We address classification problem in cultural heritage such as classification of architectural styles in digital photographs of Mexican cultural heritage. In general, the selection of relevant content in the scene for training classification models makes the models more efficient in terms of accuracy and training time. Here we use a saliency-driven approach to predict visual attention in images and use it to train a Deep Convolutional Neural Network. Also, we present an analysis of the behavior of the models trained under the state-of-the-art image cropping and the saliency maps. To train invariant models to rotations, data augmentation of training set is required, which posses problems of filling normalization of crops, we study were different padding techniques and we find an optimal solution. The results are compared with the state-of-the-art in terms of accuracy and training time. Furthermore, we are studying saliency cropping in training and generalization for another classical task such as weak labeling of massive collections of images containing objects of interest. Here the experiments are conducted on a large subset of ImageNet database. This work is an extension of preliminary research in terms of image padding methods and generalization on large scale generic database.

机译：数字多媒体内容的自动描述主要用于分类任务，检索系统和大量数据排序。保护文化遗产是应用这些方法的高度重要的领域。我们处理文化遗产中的分类问题，例如墨西哥文化遗产的数码照片中的建筑风格分类。通常，在场景中为训练分类模型选择相关内容会使模型在准确性和训练时间方面更加有效。在这里，我们使用显着性驱动的方法来预测图像中的视觉注意力，并将其用于训练深度卷积神经网络。此外，我们还介绍了在最新的图像裁剪和显着图下训练的模型的行为。为了将不变模型训练为轮换，需要增加训练集的数据，这会带来农作物灌浆归一化的问题，我们研究了不同的填充技术，并找到了最佳解决方案。在准确性和训练时间方面，将结果与最新技术进行比较。此外，我们正在研究针对另一项经典任务的训练和概括中的显着性裁剪，例如对包含感兴趣对象的大量图像进行弱标记。在这里，实验是在ImageNet数据库的很大一部分上进行的。这项工作是对图像填充方法和大规模通用数据库泛化方面的初步研究的扩展。

著录项

来源
《Multimedia Tools and Applications》 |2019年第8期|9553-9576|共24页
作者
Montoya Obeso A.; Benois-Pineau J.; Garcia Vazquez M. S.; Acosta A. A. Ramirez;
展开▼
作者单位

Inst Politecn Nacl, Comp Sci, Mexico City, DF, Mexico|Univ Bordeaux, Comp Sci, Bordeaux, France;

Univ Bordeaux, Comp Sci, Bordeaux, France;

Inst Politecn Nacl, CITEDI, Digital Technol Res & Dev Ctr, Mexico City, DF, Mexico;

MIRAL R&D&I, Res Dev Integrat Innovat, San Diego, CA USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Data selection; Visual attention prediction; Cultural heritage; Deep learning;

机译：数据选择;视觉注意力预测;文化传承;深度学习;

相似文献

外文文献
中文文献
专利

1. Saliency-based selection of visual content for deep convolutional neural networks [J] . Montoya Obeso A., Benois-Pineau J., Garcia Vazquez M. S., Multimedia Tools and Applications . 2019,第8期

机译：基于显着的深度卷积神经网络的视觉内容选择
2. Saliency-based deep convolutional neural network for no-reference image quality assessment [J] . Sen Jia, Yang Zhang Multimedia Tools and Applications . 2018,第12期

机译：基于显着性的深度卷积神经网络用于无参考图像质量评估
3. Convolutional neural networks for relevance feedback in content based image retrieval A Content based image retrieval system that exploits convolutional neural networks both for feature extraction and for relevance feedback [J] . Lorenzo Putzu, Luca Piras, Giorgio Giacinto Multimedia Tools and Applications . 2020,第37a38期

机译：基于内容的图像检索的相关反馈的卷积神经网络基于内容的图像检索系统，用于利用特征提取和相关性反馈的卷积神经网络
4. A Novel Ranking Algorithm of Enhanced Images using a Convolutional Neural Network and a Saliency-based Patch Selection Scheme [C] . Aladine Chetouani, Muhammad Ali Qureshi, Mohamed Deriche, International Conference on Quality of Multimedia Experience . 2019

机译：卷积神经网络和基于显着性的斑块选择方案的增强图像排序新算法
5. Content-Based Music Recommendation with the LFM-1b Dataset and Sample-Level Deep Convolutional Neural Networks [D] . Platt, Devin. 2017

机译：具有LFM-1b数据集和样本级深度卷积神经网络的基于内容的音乐推荐
6. Toward Content Based Image Retrieval with Deep Convolutional Neural Networks [O] . Judah E.S. Sklan, Andrew J. Plassard, Daniel Fabbri, -1

机译：利用深度卷积神经网络实现基于内容的图像检索
7. Saliency-based deep convolutional neural network for no-reference image quality assessment [O] . Jia, Sen, Zhang, Yang 2017

机译：基于显着性的深度卷积神经网络用于无参考图像质量评估

Saliency-based selection of visual content for deep convolutional neural networks

摘要

著录项

相似文献

相关主题

期刊订阅