首页> 外文期刊>IEICE Transactions on Information and Systems >The Use of Overlapped Sub-Bands in Multi-Band, Multi-SNR, Multi-Path Recognition of Noisy Word Utterances
【24h】

The Use of Overlapped Sub-Bands in Multi-Band, Multi-SNR, Multi-Path Recognition of Noisy Word Utterances

机译:重叠的子频带在多频带,多SNR,多路径的有声单词话语识别中的使用

获取原文
获取原文并翻译 | 示例
       

摘要

A solution to the problem of improving robustness to noise in automatic speech recognition is presented in the framework of multi-band, multi-SNR, and multi-path approaches. In our word recognizer, the whole frequency band is divided into seven-overlapped sub-bands, and then sub-band noisy phoneme HMMs are trained on speech data mixed with the filtered white Gaussian noise at multiple SNRs. The acoustic model of a word is built as a set of concatenations of clean and noisy sub-band phoneme HMMs arranged in parallel. A Viterbi decoder allows a search path to transit to another SNR condition at a phoneme boundary. The recognition scores of the sub-bands are then recombined to give the score for a word. Experiments show that the overlapped seven-band system yields the best performance under nonstationary ambient noises. It is also shown that the use of filtered white Gaussian noise is advantageous for training noisy phoneme HMMs.
机译:在多频带,多SNR和多路径方法的框架下,提出了一种解决方案,该方案在自动语音识别中提高了对噪声的鲁棒性。在我们的单词识别器中,将整个频带划分为七个重叠的子带,然后在混合有多个SNR的滤波后的高斯白噪声的语音数据上训练子带噪声音素HMM。单词的声学模型构建为一组并行排列的干净且嘈杂的子带音素HMM。维特比解码器允许搜索路径转换到音素边界处的另一个SNR条件。然后将子带的识别分数重新组合以给出单词的分数。实验表明,重叠的七波段系统在非平稳环境噪声下具有最佳性能。还表明,使用滤波后的高斯白噪声对训练有噪音素HMM有利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号