首页> 外文会议>ARPA image understanding workshop >From Pictures to Words: Generating Locative Descriptions of Objects in an Image
【24h】

From Pictures to Words: Generating Locative Descriptions of Objects in an Image

机译:从图片到单词:生成图像中对象的定位描述

获取原文

摘要

In this paper we describe a system that integrates image processing and natural language processing for tasks that involve communicating visual information. The system determines information about the spatial relationship of objects in images and conveys it in the form of an English sentence. We are exploring the applicability of this system to two tasks: landmark navigation and the generation of descriptions of abnormal densities in radiographs. Our previous work described a computational model of preposition semantics and a method for handling some of the ambiguities associated with natural language. Here we concentrate on generating optimal locative expressions for object pairs. In describing the system we will explain the methodologies it employs to achieve its goals. We will illustrate the system's use of these methodologies through several examples for each task.
机译:在本文中,我们描述了一种系统,该系统集成了涉及传送可视信息的任务的图像处理和自然语言处理。该系统确定关于图像中对象的空间关系的信息,并以英语句子的形式传达它。我们正在探索该系统的适用性到两个任务:地标导航和射线照片中异常密度的描述。我们以前的工作描述了介词语义的计算模型和处理与自然语言相关的一些含糊的方法的方法。在这里,我们专注于为对象对产生最佳定位表达式。在描述系统时,我们将解释它所雇用的方法来实现其目标。我们将通过每个任务的几个例子说明系统对这些方法的使用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号