首页> 外国专利> Implementing a high accuracy continuous speech recognizer on a fixed-point processor

Implementing a high accuracy continuous speech recognizer on a fixed-point processor

机译:在定点处理器上实现高精度连续语音识别器

摘要

A small vocabulary speech recognizer suitable for implementation on a 16-bit fixed-point DSP is described. The input speech xt is sampled at analog-to-digital (A/D) converter 11 and the digital samples are applied to MFCC (Mel-scaled cepstrum coefficients) front end processing 13. For robustness to background noises, PMC (parallel model combination) 15 is integrated. The MFCC and Gaussian mean vectors are applied to PMC 15. The MFCC and PMC provide speech features extracted in noise and this is used to modify the HMMs. The noise adapted HMMs excluding mean vectors are applied to the search procedure to recognize the grammar. A method of computing MFCC comprises the steps of: performing dynamic Q-point computation for the preemphasis, Hamming Window, FFT, complex FFT to power spectrum and Mel scale power spectrum into filter bank steps, a log filter bank step and after the log filter bank step performing fixed Q-point computation. A polynomial fit is used to compute log2 in the log filter bank step. The method of computing PMC comprises the steps of: computing noise MFCC profile, computing cosine transform MFCC into mel-scale filter bank, converting log filter bank into linear filter bank with an exponential wherein to compute exp2 a polynomial fit is used, performing a model combination in the linear filter bank domain; and converting the noise compensated linear filter bank into MFCC by log and inverse cosine transform.
机译:描述了适用于16位定点DSP的小型词汇语音识别器。输入语音xt在模数(A / D)转换器 11 中进行采样,并将数字采样应用于MFCC(梅尔缩放倒谱系数)前端处理 13。 为了增强对背景噪声的鲁棒性,集成了PMC(并行模型组合) 15 。将MFCC和高斯平均向量应用于PMC 15。 MFCC和PMC提供了从噪声中提取的语音功能,用于修改HMM。排除均值向量的经过噪声适应的HMM应用于搜索过程以识别语法。一种计算MFCC的方法,包括以下步骤:对预加重执行动态Q点计算,汉明窗,FFT,将复数FFT变换为功率谱并将Mel标度功率谱分为滤波器组步骤,对数滤波器组步骤和对数滤波器之后库步执行固定的Q点计算。多项式拟合用于在对数过滤器库步骤中计算log2。 PMC的计算方法包括以下步骤:计算噪声MFCC轮廓,将余弦变换MFCC计算为梅尔尺度滤波器组,将对数滤波器组转换为具有指数的线性滤波器组,其中为了计算exp2,使用多项式拟合,执行模型线性滤波器组域中的组合;通过对数和反余弦变换将噪声补偿线性滤波器组转换为MFCC。

著录项

  • 公开/公告号US2002198706A1

    专利类型

  • 公开/公告日2002-12-26

    原文格式PDF

  • 申请/专利权人 KAO YU-HUNG;GONG YIFAN;

    申请/专利号US20020136967

  • 发明设计人 YU-HUNG KAO;YIFAN GONG;

    申请日2002-05-02

  • 分类号G10L21/00;

  • 国家 US

  • 入库时间 2022-08-22 00:08:48

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号