首页> 外国专利> direction of look at understanding spoken language in multimodal conversational interactions

direction of look at understanding spoken language in multimodal conversational interactions

机译:在多模式对话互动中理解口语的观察方向

摘要

The present invention relates to improving the accuracy of understanding and / or resolving references to visual elements in a visual context associated with a computerized conversation system. The techniques described here leverage gesture input and / or voice input to improve understanding of spoken language in computerized conversation systems. Leveraging eye input and speech input enhances understanding of spoken language in conversational systems by improving the accuracy with which the system can resolve references or interpret a user's intention for visual elements in a visual context. In at least one example, the techniques describe eye tracking to generate eye input, speech input recognition, and extraction of eye features and lexical features from user input. based, at least in part, on look characteristics and lexical characteristics, user utterances directed at visual elements in a visual context can be resolved.
机译:本发明涉及提高理解和/或解决与计算机化对话系统相关联的视觉环境中对视觉元素的引用的准确性。这里描述的技术利用手势输入和/或语音输入来提高对计算机化对话系统中口语的理解。利用眼睛输入和语音输入,通过提高系统可以解析参考或解释用户对视觉环境中视觉元素的意图的准确性,可以增强会话系统对口语的理解。在至少一个示例中,该技术描述了眼睛跟踪以生成眼睛输入,语音输入识别以及从用户输入中提取眼睛特征和词汇特征。至少部分地基于外观特征和词汇特征,可以解决针对视觉上下文中的视觉元素的用户话语。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号