首页> 外国专利> Singular value decomposition for improved voice recognition in presence of multi-talker background noise

Singular value decomposition for improved voice recognition in presence of multi-talker background noise

机译:奇异值分解可在存在多讲话者背景噪声的情况下改善语音识别

摘要

A system and method for providing speech recognition functionality offers improved accuracy and robustness in noisy environments having multiple speakers. The described technique includes receiving speech energy and converting the received speech energy to a digitized form. The digitized speech energy is decomposed into features that are then projected into a feature space having multiple speaker subspaces. The projected features fall either into one of the multiple speaker subspaces or outside of all speaker subspaces. A speech recognition operation is performed on a selected one of the multiple speaker subspaces to resolve the utterance to a command or data.
机译:用于提供语音识别功能的系统和方法在具有多个扬声器的嘈杂环境中提供了改进的准确性和鲁棒性。所描述的技术包括接收语音能量并将接收到的语音能量转换成数字形式。将数字化的语音能量分解为特征,然后将其投影到具有多个扬声器子空间的特征空间中。投影特征落入多个扬声器子空间之一或所有扬声器子空间之外。在多个说话者子空间中的选定一个子空间上执行语音识别操作,以将话语解析为命令或数据。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号