首页> 外文会议>Asia-Pacific Signal and Information Processing Association Annual Summit and Conference >Fast NMF based approach and VQ based approach using MFCC distance measure for speech recognition from mixed sound
【24h】

Fast NMF based approach and VQ based approach using MFCC distance measure for speech recognition from mixed sound

机译:基于快速NMF的方法和基于VQ的方法,使用MFCC距离量度从混合声音中进行语音识别

获取原文
获取外文期刊封面目录资料

摘要

We have considered a speech recognition method for mixed sound, consisting of speech and music, that removes only the music based on vector quantization (VQ) and non-negative matrix factorization (NMF). Instead of conventional amplitude spectrum distance measure, MFCC distance measure which is not affected by the pitch is introduced. For isolated word recognition using the clean speech model, an improvement of 53% word error reduction rate was obtained compared with the case of not removing music. Furthermore, a high recognition rate, close to clean speech recognition was obtained at 10dB. For the case of the multi-conditions, our proposed method reduced the error rate of 67% compared with the multi-conditions model.
机译:我们已经考虑了一种由语音和音乐组成的混合声音的语音识别方法,该方法仅基于矢量量化(VQ)和非负矩阵分解(NMF)才能删除音乐。代替传统的幅度谱距离测量,引入了不受音高影响的MFCC距离测量。对于使用纯净语音模型的孤立单词识别,与不删除音乐的情况相比,获得了53%的单词错误减少率的提高。此外,在10dB处获得了接近清晰语音识别的高识别率。对于多条件情况,与多条件模型相比,我们提出的方法将错误率降低了67%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号