首页> 外文期刊>IEEE Transactions on Pattern Analysis and Machine Intelligence >BabyTalk: Understanding and Generating Simple Image Descriptions
【24h】

BabyTalk: Understanding and Generating Simple Image Descriptions

机译:BabyTalk:了解和生成简单的图像描述

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

We present a system to automatically generate natural language descriptions from images. This system consists of two parts. The first part, content planning, smooths the output of computer vision-based detection and recognition algorithms with statistics mined from large pools of visually descriptive text to determine the best content words to use to describe an image. The second step, surface realization, chooses words to construct natural language sentences based on the predicted content and general statistics from natural language. We present multiple approaches for the surface realization step and evaluate each using automatic measures of similarity to human generated reference descriptions. We also collect forced choice human evaluations between descriptions from the proposed generation system and descriptions from competing approaches. The proposed system is very effective at producing relevant sentences for images. It also generates descriptions that are notably more true to the specific image content than previous work.
机译:我们提出了一种系统,可以自动从图像生成自然语言描述。该系统由两部分组成。第一部分是内容计划,它使用从大量视觉描述性文本池中提取的统计数据来平滑基于计算机视觉的检测和识别算法的输出,以确定用于描述图像的最佳内容词。第二步,表面实现,根据自然语言的预测内容和一般统计信息,选择单词来构建自然语言句子。我们为表面实现步骤提供了多种方法,并使用与人类生成的参考描述相似的自动度量来评估每种方法。我们还从提议的生成系统的描述与竞争方法的描述之间收集了强制选择的人工评估。所提出的系统在产生图像的相关句子方面非常有效。它还生成比以前的作品对特定图像内容更真实的描述。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号