【24h】

An Amplitude Warping Approach to Intra-speaker Normalization for Speech Recognition

机译:用于语音识别的扬声器内归一化的幅度变形方法

获取原文
获取原文并翻译 | 示例

摘要

In this paper, we present an intra-speaker warping factor estimation based on pitch alteration utterance. The feature space distributions of untrans-formed speech from the pitch alteration utterance of intra-speaker would vary due to the acoustic differences of speech produced by glottis and vocal tract. Therefore, it may be possible to determine the amplitude warping factor by calculating the inverse ratio of input to reference pitch. As the recognition results, the error rate is reduced from 0.4% to 2.3% for digit and word decoding.
机译:在本文中,我们提出了基于音调变化话语的扬声器内翘曲因子估计。由于声门和声道产生的语音声学差异,来自扬声器内音调变化发声的未转换语音的特征空间分布将发生变化。因此,有可能通过计算输入与参考音高的反比来确定幅度扭曲因数。作为识别结果,数字和单词解码的错误率从0.4%降低到2.3%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号