首页> 外国专利> System and method for improving robustness of speech recognition using vocal tract length normalization codebooks

System and method for improving robustness of speech recognition using vocal tract length normalization codebooks

机译:使用声道长度归一化码本提高语音识别鲁棒性的系统和方法

摘要

Disclosed are systems, methods, and computer readable media for performing speech recognition. The method embodiment comprises selecting a codebook from a plurality of codebooks with a minimal acoustic distance to a received speech sample, the plurality of codebooks generated by a process of (a) computing a vocal tract length for a each of a plurality of speakers, (b) for each of the plurality of speakers, clustering speech vectors, and (c) creating a codebook for each speaker, the codebook containing entries for the respective speaker's vocal tract length, speech vectors, and an optional vector weight for each speech vector, (2) applying the respective vocal tract length associated with the selected codebook to normalize the received speech sample for use in speech recognition, and (3) recognizing the received speech sample based on the respective vocal tract length associated with the selected codebook.
机译:公开了用于执行语音识别的系统,方法和计算机可读介质。该方法实施例包括从具有到接收的语音样本的最小声学距离的多个码本中选择一个码本,该多个码本是通过以下过程生成的:(a)计算多个扬声器中的每个扬声器的声道长度,(a) b)对于多个说话者中的每一个,对语音向量进行聚类,以及(c)为每个说话者创建一个码本,该码本包含各个说话者的声道长度,语音向量以及每个语音向量的可选向量权重的条目, (2)应用与所选码本相关联的各个声道长度以归一化所接收的语音样本以用于语音识别,以及(3)基于与所选码本相关联的各个声道长度来识别所接收的语音样本。

著录项

  • 公开/公告号US8160875B2

    专利类型

  • 公开/公告日2012-04-17

    原文格式PDF

  • 申请/专利权人 MAZIN GILBERT;

    申请/专利号US20100869039

  • 发明设计人 MAZIN GILBERT;

    申请日2010-08-26

  • 分类号G10L15/06;

  • 国家 US

  • 入库时间 2022-08-21 17:28:23

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号