首页> 外文期刊>Pattern recognition letters >Deep neural network based image annotation
【24h】

Deep neural network based image annotation

机译:基于深度神经网络的图像标注

获取原文
获取原文并翻译 | 示例
           

摘要

Multilabel image annotation is one of the most important open problems in computer vision field. Unlike existing works that usually use conventional visual features to annotate images, features based on deep learning have shown potential to achieve outstanding performance. In this work, we propose a multimodal deep learning framework, which aims to optimally integrate multiple deep neural networks pretrained with convolutional neural networks. In particular, the proposed framework explores a unified two stage learning scheme that consists of (i) learning to fine-tune the parameters of deep neural network with respect to each individual modality, and (ii) learning to find the optimal combination of diverse modalities simultaneously in a coherent process. Experiments conducted on a variety of public datasets evaluate the performance of the proposed framework for multilabel image annotation, in which the encouraging results validate the effectiveness of the proposed algorithms. (C) 2015 Elsevier B.V. All rights reserved.
机译:多标签图像标注是计算机视觉领域最重要的开放性问题之一。与通常使用常规视觉功能为图像添加注释的现有作品不同,基于深度学习的功能已显示出实现出色性能的潜力。在这项工作中,我们提出了一种多模式深度学习框架,该框架旨在最佳地集成经过卷积神经网络预训练的多个深度神经网络。特别是,提出的框架探索了一个统一的两阶段学习方案,该方案包括(i)学习以针对每个单独的模式微调深度神经网络的参数,以及(ii)学习以找到各种模式的最佳组合在一个连贯的过程中同时进行。在各种公共数据集上进行的实验评估了所提出的多标签图像标注框架的性能,其中令人鼓舞的结果证实了所提出算法的有效性。 (C)2015 Elsevier B.V.保留所有权利。

著录项

  • 来源
    《Pattern recognition letters》 |2015年第1期|103-108|共6页
  • 作者单位

    Nanjing Univ Posts & Telecommun, Sch Automat, Nanjing 210046, Jiangsu, Peoples R China.;

    Nanjing Univ Posts & Telecommun, Sch Automat, Nanjing 210046, Jiangsu, Peoples R China.;

    Nanjing Univ Posts & Telecommun, Sch Automat, Nanjing 210046, Jiangsu, Peoples R China.;

    Chinese Acad Sci, Inst Automat, Beijing 110093, Peoples R China.;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Deep learning; Multi-label; Multi-modal; Image annotation;

    机译:深度学习;多标签;多模式;图像标注;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号