Abstract Situated reference resolution using visual saliency and crowdsourcing-based priors for a spoken dialog system within vehicles
首页> 外文期刊>Computer speech and language >Situated reference resolution using visual saliency and crowdsourcing-based priors for a spoken dialog system within vehicles
【24h】

Situated reference resolution using visual saliency and crowdsourcing-based priors for a spoken dialog system within vehicles

机译:使用视觉显着性和基于众包的先验为车辆内的语音对话系统设置参考分辨率

获取原文
获取原文并翻译 | 示例
           

摘要

AbstractIn this paper, we address issues in situated language understanding in a moving car. More specifically, we propose a reference resolution method to identify user queries about specific target objects in their surroundings. We investigate methods of predicting which target object is likely to be queried given a visual scene and what kind of linguistic cues users naturally provide to describe a given target object in a situated environment. We propose methods to incorporate the visual saliency of the visual scene as a prior. Crowdsourced statistics of how people describe an object are also used as a prior. We have collected situated utterances from drivers using our research system, which was embedded in a real vehicle. We demonstrate that the proposed algorithms improve target identification rate by 15.1% absolute over the baseline method that does not use visual saliency-based prior and depends on public database with a limited number of category information.
机译: 摘要 在本文中,我们解决了在行驶中的汽车中的情景语言理解问题。更具体地说,我们提出一种参考解析方法,以识别用户对周围特定目标对象的查询。我们研究了在给定视觉场景的情况下预测可能要查询哪个目标对象以及用户自然提​​供什么样的语言提示来描述位于周围环境中的给定目标对象的方法。我们提出了将视觉场景的视觉显着性纳入先验的方法。关于人们如何描述物体的众包统计也被用作先验。我们使用我们的研究系统(该系统嵌入在真实的车辆中)从驾驶员处收集了话语。我们证明,与不使用基于视觉显着性的先验方法且依赖于类别信息数量有限的公共数据库的基线方法相比,所提出的算法将目标识别率绝对提高了15.1%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号