首页> 外国专利> Adaptive multichannel dereverberation for automatic speech recognition

Adaptive multichannel dereverberation for automatic speech recognition

机译:自适应多通道去混响用于自动语音识别

摘要

Utilizing an adaptive multichannel technique to mitigate reverberation present in received audio signals, prior to providing corresponding audio data to one or more additional component(s), such as automatic speech recognition (ASR) components. Implementations disclosed herein are “adaptive”, in that they utilize a filter, in the reverberation mitigation, that is online, causal and varies depending on characteristics of the input. Implementations disclosed herein are “multichannel”, in that a corresponding audio signal is received from each of multiple audio transducers (also referred to herein as “microphones”) of a client device, and the multiple audio signals (e.g., frequency domain representations thereof) are utilized in updating of the filter—and dereverberation occurs for audio data corresponding to each of the audio signals (e.g., frequency domain representations thereof) prior to the audio data being provided to ASR component(s) and/or other component(s).
机译:在将对应的音频数据提供给一个或多个其他组件(例如自动语音识别(ASR)组件)之前,利用自适应多通道技术来缓解接收到的音频信号中存在的混响。本文公开的实施方式是“自适应的”,因为它们在混响缓解中利用滤波器​​,该滤波器是在线的,因果关系的,并且取决于输入的特性而变化。本文公开的实施方式是“多通道”,因为从客户端设备的多个音频换能器(在本文中也称为“麦克风”)中的每个接收对应的音频信号,并且多个音频信号(例如,其频域表示)在对滤波器的更新中使用了“混响”,并且在将音频数据提供给一个或多个ASR组件和/或其他组件之前,与每个音频信号相对应的音频数据(例如其频域表示)会发生混响。

著录项

  • 公开/公告号US10762914B2

    专利类型

  • 公开/公告日2020-09-01

    原文格式PDF

  • 申请/专利权人 GOOGLE LLC;

    申请/专利号US201816032996

  • 申请日2018-07-11

  • 分类号G10L21;G10L21/0208;G10L15/20;G10L15/22;G10L15/065;G06F3/16;G06N3/02;G06F17/14;G10L15/06;G10L21/0216;

  • 国家 US

  • 入库时间 2022-08-21 11:29:17

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号