首页> 外国专利> USING VISUAL CUES TO DISAMBIGUATE SPEECH INPUTS

USING VISUAL CUES TO DISAMBIGUATE SPEECH INPUTS

机译：使用视觉提示消除语音输入的歧义

页面导航

摘要
著录项
相似文献

摘要

Embodiments related to recognizing speech inputs are disclosed. One disclosed embodiment provides a method for recognizing a speech input including receiving depth information of a physical space from a depth camera, determining an identity of a user in the physical space based on the depth information, receiving audio information from one or more microphones, and determining a speech input from the audio input. If the speech input comprises an ambiguous term, the ambiguous term in the speech input is compared to one or more of depth image data received from the depth image sensor and digital content consumption information for the user to identify an unambiguous term corresponding to the ambiguous term. After identifying the unambiguous term, an action is taken on the computing device based on the speech input and the unambiguous term.

机译：公开了与识别语音输入有关的实施例。一个公开的实施例提供了一种用于识别语音输入的方法，该方法包括：从深度相机接收物理空间的深度信息;基于深度信息确定用户在该物理空间中的身份;从一个或多个麦克风接收音频信息;以及从音频输入确定语音输入。如果语音输入包括歧义词，则将语音输入中的歧义词与从深度图像传感器接收到的深度图像数据中的一个或多个以及数字内容消耗信息进行比较，以供用户识别与歧义词相对应的歧义词。在识别出明确的术语之后，基于语音输入和明确的术语在计算设备上采取动作。

著录项

公开/公告号US2014214415A1

专利类型
公开/公告日2014-07-31

原文格式PDF
申请/专利权人 MICROSOFT CORPORATION;
展开▼

申请/专利号US201313750674
发明设计人 CHRISTIAN KLEIN;
展开▼

申请日2013-01-25
分类号G10L15/22;
国家 US
入库时间 2022-08-21 16:06:55

相似文献

专利
外文文献
中文文献