首页> 外国专利> Multi-Channel Speech Signal Enhancement for Robust Voice Trigger Detection and Automatic Speech Recognition

Multi-Channel Speech Signal Enhancement for Robust Voice Trigger Detection and Automatic Speech Recognition

机译:多通道语音信号增强功能,可实现可靠的语音触发检测和自动语音识别

摘要

A digital speech enhancement system that performs a specific chain of digital signal processing operations upon multi-channel sound pick up, to result in a single, enhanced speech signal. The operations are designed to be computationally less complex yet as a whole yield an enhanced speech signal that produces accurate voice trigger detection and low word error rates by an automatic speech recognizer. The constituent operations or components of the system have been chosen so that the overall system is robust to changing acoustic conditions, and can deliver the enhanced speech signal with low enough latency so that the system can be used online (enabling real-time, voice trigger detection and streaming ASR.) Other embodiments are also described and claimed.
机译:一种数字语音增强系统,在多通道声音拾取后执行特定的数字信号处理链操作,以产生单个增强的语音信号。这些操作被设计为在计算上不太复杂,但总体上产生了增强的语音信号,该语音信号可通过自动语音识别器产生准确的语音触发检测和较低的单词错误率。选择了系统的组成操作或组件,以使整个系统对于改变声学条件具有鲁棒性,并且可以以足够低的延迟提供增强的语音信号,从而可以在线使用该系统(启用实时,语音触发检测和流ASR。)还描述了其他实施例并要求保护。

著录项

  • 公开/公告号US2018350379A1

    专利类型

  • 公开/公告日2018-12-06

    原文格式PDF

  • 申请/专利权人 APPLE INC.;

    申请/专利号US201715613127

  • 申请日2017-06-02

  • 分类号G10L21/02;G10L21/0232;G10L21/0272;G10L21/038;

  • 国家 US

  • 入库时间 2022-08-21 12:04:01

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号