首页> 外国专利> Speaker verification, speech recognition and channel normalization through dynamic time/frequency warping

Speaker verification, speech recognition and channel normalization through dynamic time/frequency warping

机译:通过动态时间/频率扭曲实现说话人验证,语音识别和频道归一化

摘要

A Dynamic Time/Frequency Warping (DTFW) technique is disclosed for speaker verification, speech recognition and channel normalization, among other uses. The DTFW technique utilities best path dynamic programming methods using a 3-dimensional time frequency array representing the spectral differences between a test utterance (the utterance being analyzed) and a reference utterance (template). The array is created by summing the squares of the differences of each feature in each frame of the template with each feature in each frame of the utterance in question. Dynamic programming techniques are then used to find the minimal distance path matching the test utterance and the template so as to optimize the time and frequency warping paths.
机译:公开了一种动态时间/频率翘曲(DTFW)技术,用于说话者验证,语音识别和信道归一化,以及其他用途。 DTFW技术使用3维时间频率阵列来利用最佳路径动态编程方法,该3维时间频率阵列表示测试话语(正在分析的话语)和参考话语(模板)之间的频谱差异。通过将模板每个帧中每个特征与所讨论话语每个帧中每个特征的差的平方相加来创建数组。然后使用动态编程技术来找到与测试话语和模板匹配的最小距离路径,从而优化时间和频率弯曲路径。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号