首页> 外文期刊>Neural computing & applications >A general description generator for human activity images based on deep understanding framework
【24h】

A general description generator for human activity images based on deep understanding framework

机译:基于深度理解框架的人类活动图像的概述生成器

获取原文
获取原文并翻译 | 示例
       

摘要

Image description generation is of great application value in online image searching. Inspired by the recent achievements on neocortex study, we design a deep image understanding framework to implement a description generator for general images involving human activities. Different from existing work on image description, which regards it as a retrieval problem instead of trying to understand an image, our framework can recognize the human-object interaction (HOI) activity in the image based on the co-occurrence analysis of 3-D spatial layout and generate natural language description according to what is really happening in the image. We propose a deep hierarchical model to do the image recognition and a syntactic tree-based model to do the natural language generation. With the consideration of supporting online image searching, these two models are designed to uniformly extract features from humans and different object classes and produce well-formed sentences describing the exact things happening in the image. By conducting experiments on the dataset containing images from the phrasal recognition dataset, the six-class sports dataset and the UIUC Pascal sentence dataset, we demonstrate that our framework outperforms the state-of-the-art methods on recognizing HOI activities and generating image descriptions.
机译:图像描述生成具有很大的应用价值在线图像搜索。灵感来自最近关于Neocortex研究的成就,我们设计了一个深度图像理解框架,以实现涉及人类活动的一般图像的描述发生器。与现有的图像描述不同,这将其视为检索问题而不是尝试理解图像,我们的框架可以基于3-D的共同发生分析来识别图像中的人对象交互(HOI)活动空间布局并根据图像中真正发生的情况生成自然语言描述。我们提出了一个深层次的分层模型来进行图像识别和基于句法树的模型来做自然语言生成。考虑到支持在线图像搜索,这两种模型旨在统一地从人类和不同对象类中提取特征,并产生形成良好的句子,描述了图像中发生的确切事项。通过对来自短语识别数据集的数据集进行数据集进行实验,我们展示了我们的框架优于识别Hoi活动和生成图像描述的最先进的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号