首页> 外国专利> Optimized local feature extraction for automatic speech recognition

Optimized local feature extraction for automatic speech recognition

机译:优化的局部特征提取可实现自动语音识别

摘要

The acoustic speech signal is decomposed into wavelets arranged in an asymmetrical tree data structure from which individual nodes may be selected to best extract local features, as needed to model specific classes of sound units. The wavelet packet transformation is smoothed through integration and compressed to apply a non-linearity prior to discrete cosine transformation. The resulting subband features such as cepstral coefficients may then be used to construct the speech recognizer's speech models. Using the local feature information extracted in this manner allows a single recognizer to be optimized for several different classes of sound units, thereby eliminating the need for parallel path recognizers.
机译:语音信号被分解成以非对称树数据结构排列的小波,从中可以选择单个节点以最佳地提取局部特征,以对特定类别的声音单元进行建模。小波包变换可通过积分进行平滑处理,并在离散余弦变换之前进行压缩以应用非线性。然后可以将所得的子带特征(例如倒频谱系数)用于构建语音识别器的语音模型。使用以此方式提取的局部特征信息,可以针对多个不同类别的声音单元优化单个识别器,从而消除了对并行路径识别器的需求。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号