首页> 外国专利> Optimized local feature extraction for automatic speech recognition

Optimized local feature extraction for automatic speech recognition

机译：优化的局部特征提取可实现自动语音识别

页面导航

摘要
著录项
相似文献

摘要

The acoustic speech signal is decomposed into wavelets arranged in an asymmetrical tree data structure from which individual nodes may be selected to best extract local features, as needed to model specific classes of sound units. The wavelet packet transformation is smoothed through integration and compressed to apply a non-linearity prior to discrete cosine transformation. The resulting subband features such as cepstral coefficients may then be used to construct the speech recognizer's speech models. Using the local feature information extracted in this manner allows a single recognizer to be optimized for several different classes of sound units, thereby eliminating the need for parallel path recognizers.

机译：语音信号被分解成以非对称树数据结构排列的小波，从中可以选择单个节点以最佳地提取局部特征，以对特定类别的声音单元进行建模。小波包变换可通过积分进行平滑处理，并在离散余弦变换之前进行压缩以应用非线性。然后可以将所得的子带特征（例如倒频谱系数）用于构建语音识别器的语音模型。使用以此方式提取的局部特征信息，可以针对多个不同类别的声音单元优化单个识别器，从而消除了对并行路径识别器的需求。

著录项

公开/公告号US6513004B1

专利类型
公开/公告日2003-01-28

原文格式PDF
申请/专利权人 MATSUSHITA ELECTRIC INDUSTRIAL CO. LTD.;
展开▼

申请/专利号US19990449053
发明设计人 DAVID KRYZE;LUCA RIGAZIO;JEAN-CLAUDE JUNQUA;TED APPLEBAUM;
展开▼

申请日1999-11-24
分类号G10L150/40;G10L170/00;G10L150/80;
国家 US
入库时间 2022-08-22 00:04:46

相似文献

专利
外文文献
中文文献