首页> 外国专利> Singular value decomposition for improved voice recognition in presence of multi-talker background noise

Singular value decomposition for improved voice recognition in presence of multi-talker background noise

机译：奇异值分解可在存在多讲话者背景噪声的情况下改善语音识别

页面导航

摘要
著录项
相似文献

摘要

A system and method for providing speech recognition functionality offers improved accuracy and robustness in noisy environments having multiple speakers. The described technique includes receiving speech energy and converting the received speech energy to a digitized form. The digitized speech energy is decomposed into features that are then projected into a feature space having multiple speaker subspaces. The projected features fall either into one of the multiple speaker subspaces or outside of all speaker subspaces. A speech recognition operation is performed on a selected one of the multiple speaker subspaces to resolve the utterance to a command or data.

机译：用于提供语音识别功能的系统和方法在具有多个扬声器的嘈杂环境中提供了改进的准确性和鲁棒性。所描述的技术包括接收语音能量并将接收到的语音能量转换成数字形式。将数字化的语音能量分解为特征，然后将其投影到具有多个扬声器子空间的特征空间中。投影特征落入多个扬声器子空间之一或所有扬声器子空间之外。在多个说话者子空间中的选定一个子空间上执行语音识别操作，以将话语解析为命令或数据。

著录项

公开/公告号US9177557B2

专利类型
公开/公告日2015-11-03

原文格式PDF
申请/专利权人 GAURAV TALWAR;RATHINAVELU CHENGALVARAYAN;
展开▼

申请/专利号US20090498811
发明设计人 RATHINAVELU CHENGALVARAYAN;GAURAV TALWAR;
展开▼

申请日2009-07-07
分类号G10L17/02;G10L15/20;G10L21/0208;
国家 US
入库时间 2022-08-21 15:20:00

相似文献

专利
外文文献
中文文献