首页> 外国专利> direction of look at understanding spoken language in multimodal conversational interactions

direction of look at understanding spoken language in multimodal conversational interactions

机译：在多模式对话互动中理解口语的观察方向

页面导航

摘要
著录项
相似文献

摘要

The present invention relates to improving the accuracy of understanding and / or resolving references to visual elements in a visual context associated with a computerized conversation system. The techniques described here leverage gesture input and / or voice input to improve understanding of spoken language in computerized conversation systems. Leveraging eye input and speech input enhances understanding of spoken language in conversational systems by improving the accuracy with which the system can resolve references or interpret a user's intention for visual elements in a visual context. In at least one example, the techniques describe eye tracking to generate eye input, speech input recognition, and extraction of eye features and lexical features from user input. based, at least in part, on look characteristics and lexical characteristics, user utterances directed at visual elements in a visual context can be resolved.

机译：本发明涉及提高理解和/或解决与计算机化对话系统相关联的视觉环境中对视觉元素的引用的准确性。这里描述的技术利用手势输入和/或语音输入来提高对计算机化对话系统中口语的理解。利用眼睛输入和语音输入，通过提高系统可以解析参考或解释用户对视觉环境中视觉元素的意图的准确性，可以增强会话系统对口语的理解。在至少一个示例中，该技术描述了眼睛跟踪以生成眼睛输入，语音输入识别以及从用户输入中提取眼睛特征和词汇特征。至少部分地基于外观特征和词汇特征，可以解决针对视觉上下文中的视觉元素的用户话语。

著录项

公开/公告号BR112017003636A2

专利类型
公开/公告日2017-11-28

原文格式PDF
申请/专利权人 MICROSOFT TECHNOLOGY LICENSING LLC;
展开▼

申请/专利号BR20171103636
发明设计人 ANNA PROKOFIEVA;DILEK Z. HAKKANI-TUR;FETHIYE ASLI CELIKYLMAZ;LARRY HECK;MALCOLM SLANEY;
展开▼

申请日2015-09-25
分类号G06F3/16;G02B27;G06F3/01;G06K9;G10L15;
国家 BR
入库时间 2022-08-21 12:53:49

相似文献

专利
外文文献
中文文献