首页> 外文会议>Conference on Integration of Speech and Image Understanding >From images to sentences via spatial relations
【24h】

From images to sentences via spatial relations

机译:通过空间关系从图像到句子

获取原文

摘要

This work presents a conceptual framework for representing, manipulating, measuring, and communicating in natural language several ideas about topological (non-metric) spatial locations, object spatial contexts, and user expectations of spatial relationships. It articulates a theory of spatial relations, how they can be represented as fuzzy predicates internally, and how they can be appropriately derived from, imagery; then, how they can be augmented or filtered using prior knowledge, and lastly, how they can produce natural language statements about location and space. This framework quantifies the notions of context and vagueness, so that all spatial relations are measurably accurate, provably efficient, and matched to users' expectations. The work makes explicit two critical heuristics for reducing the complexity of the relationships implicit in imagery, one a general rule for single object descriptions, and the other a general rule for rank ordering object relationships. A derived working system combines variable aspects of computer science and linguistics in such a way so as to be extensible to many environments. The system has been demonstrated both in, a landmark navigation task and in a medical task, two very separate domains, and has been evaluated in both.
机译:这项工作礼物代表,操纵,测量和自然语言的几个想法有关拓扑(非公制)的空间位置通信的概念框架,对象的空间环境和空间关系的用户的期望。它阐明了一种空间关系理论,它们如何在内部代表为模糊谓词,以及如何适当地衍生自象;然后,如何使用先前的知识来增强或过滤它们,最后,它们如何生成关于地点和空间的自然语言陈述。该框架量化了上下文和模糊的概念,使所有空间关系都是可测量的准确性,可释放的,并且与用户的期望相匹配。这项工作使得显式的两个关键启发式可以降低图像中隐含的关系的复杂性,一个对象描述的一般规则,以及排序对象关系的另一个规则。派生的工作系统将计算机科学和语言学的可变方面组合在这样的方式中,以便对许多环境进行扩展。该系统已经在一个地标导航任务和医疗任务中展示了两个非常单独的域,并且已经在两者中进行了评估。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号