首页> 外国专利> WAVELET-BASED ENERGY BINNING CEPSTRAL FEATURES FOR AUTOMATIC SPEECH RECOGNITION

WAVELET-BASED ENERGY BINNING CEPSTRAL FEATURES FOR AUTOMATIC SPEECH RECOGNITION

机译:基于小波的能量归纳倒谱特征用于自动语音识别

摘要

Systems and methods for processing acoustic speech signals which utilize thewavelettransform (and alternatively, the Fourier transform) as a fundamental tool.The method essentiallyinvolves "synchrosqueezing" spectral component data obtained by performing awavelet transform(or Fourier transform) on digitized speech signals. In one aspect, spectralcomponents of thesynchrosqueezed plane are dynamically tracked via a K-means clusteringalgorithm. The amplitude,frequency and bandwidth of each of the; components are, thus, extracted. Thecepstrum generatedfrom this information is referred to as "K-mean Wastrum." In another aspect,the result of theK-mean clustering process is further processed to limit the set of primarycomponents to formants.The resulting features are referred to as "formant-based wastrum." Formantsare interpolated inunvoiced regions and the contribution of unvoiced turbulent part of thespectrum are added. Thismethod requires adequate formant tracking. The resulting robust formantextraction has a numberof applications in speech processing and analysis including vocal tractnormalization.
机译:用于处理声学语音信号的系统和方法小波变换(或傅立叶变换)作为基本工具。该方法本质上涉及通过执行“同步压缩”频谱分量数据小波变换(或傅里叶变换)处理数字化语音信号。一方面,光谱的组成部分通过K均值聚类动态跟踪同步压缩的平面算法。振幅每个的频率和带宽;因此,提取成分。的倒谱产生来自该信息的信息被称为“ K-mean Wastrum”。在另一方面,结果进一步处理K均值聚类过程以限制主集合共振峰的组成部分。所产生的特征称为“基于共振峰的波峰”。共振峰插值清音区域和清音湍流部分的贡献频谱已添加。这个该方法需要足够的共振峰跟踪。产生的坚固的共振峰提取有个数在语音处理和分析(包括声道)中的应用正常化。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号