首页> 外国专利> SYSTEM AND METHOD FOR CONTINUOUS MULTIMODAL SPEECH AND GESTURE INTERACTION

SYSTEM AND METHOD FOR CONTINUOUS MULTIMODAL SPEECH AND GESTURE INTERACTION

机译:连续多模态语音和手势交互的系统和方法

摘要

Disclosed herein are systems, methods, and non-transitory computer-readable storage media for processing multimodal input. A system configured to practice the method continuously monitors an audio stream associated with a gesture input stream, and detects a speech event in the audio stream. Then the system identifies a temporal window associated with a time of the speech event, and analyzes data from the gesture input stream within the temporal window to identify a gesture event. The system processes the speech event and the gesture event to produce a multimodal command. The gesture in the gesture input stream can be directed to a display, but is remote from the display. The system can analyze the data from the gesture input stream by calculating an average of gesture coordinates within the temporal window.
机译:本文公开了用于处理多模式输入的系统,方法和非暂时性计算机可读存储介质。配置为实施该方法的系统连续监视与手势输入流相关联的音频流,并检测音频流中的语音事件。然后,系统识别与语音事件的时间相关联的时间窗口,并分析来自时间窗口内的手势输入流的数据以识别手势事件。该系统处理语音事件和手势事件以产生多模式命令。手势输入流中的手势可以定向到显示器,但是远离显示器。该系统可以通过计算时间窗口内手势坐标的平均值来分析来自手势输入流的数据。

著录项

  • 公开/公告号US2020150921A1

    专利类型

  • 公开/公告日2020-05-14

    原文格式PDF

  • 申请/专利权人 NUANCE COMMUNICATIONS INC.;

    申请/专利号US202016743117

  • 发明设计人 MICHAEL JOHNSTON;DERYA OZKAN;

    申请日2020-01-15

  • 分类号G06F3/16;G10L15/22;G06F3/01;

  • 国家 US

  • 入库时间 2022-08-21 11:25:02

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号