首页> 外国专利> WAVELET-BASED ENERGY BINNING CEPSTRAL FEATURES FOR AUTOMATIC SPEECH RECOGNITION

WAVELET-BASED ENERGY BINNING CEPSTRAL FEATURES FOR AUTOMATIC SPEECH RECOGNITION

机译：基于小波的能量归纳倒谱特征用于自动语音识别

页面导航

摘要
著录项
相似文献

摘要

Systems and methods for processing acoustic speech signals which utilize thewavelettransform (and alternatively, the Fourier transform) as a fundamental tool.The method essentiallyinvolves "synchrosqueezing" spectral component data obtained by performing awavelet transform(or Fourier transform) on digitized speech signals. In one aspect, spectralcomponents of thesynchrosqueezed plane are dynamically tracked via a K-means clusteringalgorithm. The amplitude,frequency and bandwidth of each of the; components are, thus, extracted. Thecepstrum generatedfrom this information is referred to as "K-mean Wastrum." In another aspect,the result of theK-mean clustering process is further processed to limit the set of primarycomponents to formants.The resulting features are referred to as "formant-based wastrum." Formantsare interpolated inunvoiced regions and the contribution of unvoiced turbulent part of thespectrum are added. Thismethod requires adequate formant tracking. The resulting robust formantextraction has a numberof applications in speech processing and analysis including vocal tractnormalization.

机译：用于处理声学语音信号的系统和方法小波变换（或傅立叶变换）作为基本工具。该方法本质上涉及通过执行“同步压缩”频谱分量数据小波变换（或傅里叶变换）处理数字化语音信号。一方面，光谱的组成部分通过K均值聚类动态跟踪同步压缩的平面算法。振幅每个的频率和带宽;因此，提取成分。的倒谱产生来自该信息的信息被称为“ K-mean Wastrum”。在另一方面，结果进一步处理K均值聚类过程以限制主集合共振峰的组成部分。所产生的特征称为“基于共振峰的波峰”。共振峰插值清音区域和清音湍流部分的贡献频谱已添加。这个该方法需要足够的共振峰跟踪。产生的坚固的共振峰提取有个数在语音处理和分析（包括声道）中的应用正常化。

著录项

公开/公告号CA2290185C

专利类型
公开/公告日2005-09-20

原文格式PDF
申请/专利权人 INTERNATIONAL BUSINESS MACHINES CORPORATION;
展开▼

申请/专利号CA19992290185
发明设计人 MAES STEPHANE H.;BASU SANKAR;
展开▼

申请日1999-11-22
分类号G10L15/02;G10L19/02;
国家 CA
入库时间 2022-08-21 22:14:26

相似文献

专利
外文文献
中文文献