首页> 外国专利> Gaze for understanding spoken language in conversational dialogue in multiple modes

Gaze for understanding spoken language in conversational dialogue in multiple modes

机译:凝视以多种方式在对话中理解口语

摘要

Improved accuracy in understanding and / or resolving instructions to visual elements within the visual context associated with a computerized conversational system is described. The technique described in this article uses gaze input along with gestures and / or input to improve the understanding of spoken language in conversational systems. This is due to improving the accuracy with which the system can resolve instructions regarding visual elements within the visual context or interpret user intent. In at least one example, the techniques of this paper describe tracking gaze to generate gaze input, recognizing speech input, and extracting gaze and lexical features from user input. A user utterance directed to a visual element within the visual context can be resolved based at least in part on the gaze and lexical features.
机译:描述了在理解和/或解决对与计算机对话系统相关联的视觉上下文内的视觉元素的指令的改进的准确性。本文中描述的技术使用注视输入以及手势和/或输入来改善会话系统中对口头语言的理解。这是由于提高了系统可以解析有关视觉上下文中的视觉元素的指令或解释用户意图的准确性。在至少一个示例中,本文的技术描述了跟踪注视以生成注视输入,识别语音输入以及从用户输入中提取注视和词汇特征。可以至少部分地基于凝视和词汇特征来解决针对视觉上下文内的视觉元素的用户话语。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号