...
首页> 外文期刊>Multimedia Tools and Applications >Saliency-based selection of visual content for deep convolutional neural networks
【24h】

Saliency-based selection of visual content for deep convolutional neural networks

机译:基于显着性的深度卷积神经网络视觉内容选择

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

The automatic description of digital multimedia content was mainly developed for classification tasks, retrieval systems and massive ordering of data. Preservation of cultural heritage is a field of high importance of application of these methods. We address classification problem in cultural heritage such as classification of architectural styles in digital photographs of Mexican cultural heritage. In general, the selection of relevant content in the scene for training classification models makes the models more efficient in terms of accuracy and training time. Here we use a saliency-driven approach to predict visual attention in images and use it to train a Deep Convolutional Neural Network. Also, we present an analysis of the behavior of the models trained under the state-of-the-art image cropping and the saliency maps. To train invariant models to rotations, data augmentation of training set is required, which posses problems of filling normalization of crops, we study were different padding techniques and we find an optimal solution. The results are compared with the state-of-the-art in terms of accuracy and training time. Furthermore, we are studying saliency cropping in training and generalization for another classical task such as weak labeling of massive collections of images containing objects of interest. Here the experiments are conducted on a large subset of ImageNet database. This work is an extension of preliminary research in terms of image padding methods and generalization on large scale generic database.
机译:数字多媒体内容的自动描述主要用于分类任务,检索系统和大量数据排序。保护文化遗产是应用这些方法的高度重要的领域。我们处理文化遗产中的分类问题,例如墨西哥文化遗产的数码照片中的建筑风格分类。通常,在场景中为训练分类模型选择相关内容会使模型在准确性和训练时间方面更加有效。在这里,我们使用显着性驱动的方法来预测图像中的视觉注意力,并将其用于训练深度卷积神经网络。此外,我们还介绍了在最新的图像裁剪和显着图下训练的模型的行为。为了将不变模型训练为轮换,需要增加训练集的数据,这会带来农作物灌浆归一化的问题,我们研究了不同的填充技术,并找到了最佳解决方案。在准确性和训练时间方面,将结果与最新技术进行比较。此外,我们正在研究针对另一项经典任务的训练和概括中的显着性裁剪,例如对包含感兴趣对象的大量图像进行弱标记。在这里,实验是在ImageNet数据库的很大一部分上进行的。这项工作是对图像填充方法和大规模通用数据库泛化方面的初步研究的扩展。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号