首页> 外国专利> SYSTEM AND METHOD FOR SPEECH UNDERSTANDING VIA INTEGRATED AUDIO AND VISUAL BASED SPEECH RECOGNITION

SYSTEM AND METHOD FOR SPEECH UNDERSTANDING VIA INTEGRATED AUDIO AND VISUAL BASED SPEECH RECOGNITION

机译:通过集成的音频和基于视觉的语音识别进行语音理解的系统和方法

摘要

The present teaching relates to method, system, medium, and implementations for speech recognition. An audio signal is received that represents a speech of a user engaged in a dialogue. A visual signal is received that captures the user uttering the speech. A first speech recognition result is obtained by performing audio based speech recognition based on the audio signal. Based on the visual signal, lip movement of the user is detected and a second speech recognition result is obtained by performing lip reading based speech recognition. The first and the second speech recognition results are then integrated to generate an integrated speech recognition result.
机译:本教导涉及用于语音识别的方法,系统,介质和实现。接收到表示参与对话的用户的语音的音频信号。接收到视觉信号,捕获了用户说出语音。通过基于音频信号执行基于音频的语音识别来获得第一语音识别结果。基于视觉信号,通过执行基于唇读的语音识别来检测用户的嘴唇运动并获得第二语音识别结果。然后,将第一语音识别结果和第二语音识别结果进行积分以生成综合的语音识别结果。

著录项

  • 公开/公告号US2019279642A1

    专利类型

  • 公开/公告日2019-09-12

    原文格式PDF

  • 申请/专利权人 DMAI INC.;

    申请/专利号US201916277136

  • 发明设计人 NISHANT SHUKLA;ASHWIN DHARNE;

    申请日2019-02-15

  • 分类号G10L15/32;G10L15/02;G10L15/25;G10L15/22;G10L15/30;

  • 国家 US

  • 入库时间 2022-08-21 12:13:29

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号