Fast NMF based approach and VQ based approach using MFCC distance measure for speech recognition from mixed sound

机译：基于快速NMF的方法和基于VQ的方法，使用MFCC距离量度从混合声音中进行语音识别

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We have considered a speech recognition method for mixed sound, consisting of speech and music, that removes only the music based on vector quantization (VQ) and non-negative matrix factorization (NMF). Instead of conventional amplitude spectrum distance measure, MFCC distance measure which is not affected by the pitch is introduced. For isolated word recognition using the clean speech model, an improvement of 53% word error reduction rate was obtained compared with the case of not removing music. Furthermore, a high recognition rate, close to clean speech recognition was obtained at 10dB. For the case of the multi-conditions, our proposed method reduced the error rate of 67% compared with the multi-conditions model.

机译：我们已经考虑了一种由语音和音乐组成的混合声音的语音识别方法，该方法仅基于矢量量化（VQ）和非负矩阵分解（NMF）才能删除音乐。代替传统的幅度谱距离测量，引入了不受音高影响的MFCC距离测量。对于使用纯净语音模型的孤立单词识别，与不删除音乐的情况相比，获得了53％的单词错误减少率的提高。此外，在10dB处获得了接近清晰语音识别的高识别率。对于多条件情况，与多条件模型相比，我们提出的方法将错误率降低了67％。

著录项

来源
《Asia-Pacific Signal and Information Processing Association Annual Summit and Conference》|2013年|1-4|共4页
会议地点
作者
Nakano Shoichi; Yamamoto Kazumasa; Nakagawa Seiichi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. A MFCC-Based CELP Speech Coder for Server-Based Speech Recognition in Network Environments [J] . Jae Sam YOON, Gil Ho LEE, Hong Kook KIM IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences . 2007,第3期

机译：基于MFCC的CELP语音编码器，用于网络环境中基于服务器的语音识别
2. Speech Recognition for Isolated Words using Mel-Frequency Cepstral Coefficients (MFCC) and Vector Quantization (VQ) [J] . Yogesh S. Angal, R. H. Chile, R. S. Holambe Journal of the Instrument Society of India: Proceedings of the national symposium on instrumentation . 2011,第3期

机译：使用Mel频率倒谱系数（MFCC）和矢量量化（VQ）对孤立单词进行语音识别
3. An improved maximum model distance approach for HMM-based speech recognition systems [J] . He QH., Man KF., Tang KS., Pattern Recognition: The Journal of the Pattern Recognition Society . 2000,第10期

机译：基于HMM的语音识别系统的改进的最大模型距离方法
4. Fast NMF based approach and VQ based approach using MFCC distance measure for speech recognition from mixed sound [C] . Nakano Shoichi, Yamamoto Kazumasa, Nakagawa Seiichi Asia-Pacific Signal and Information Processing Association Annual Summit and Conference . 2013

机译：基于NMF的基于NMF的方法和基于VQ的方法使用MFCC距离测量混合声音语音识别
5. HMM-based non-intrusive speech quality and implementation of Viterbi score distribution and hiddenness based measures to improve the performance of speech recognition [D] . Talwar, Gaurav 2006

机译：基于HMM的非侵入式语音质量以及基于Viterbi分数分布和隐蔽性的措施的实施，以提高语音识别的性能
6. Recognition of Emotions in Mexican Spanish Speech: An Approach Based on Acoustic Modelling of Emotion-Specific Vowels [O] . Santiago-Omar Caballero-Morales 2013

机译：墨西哥西班牙语语音中的情绪识别：一种基于情绪特定元音声学模型的方法
7. Continuous kannada speech segmentation and speech recognition based on threshold using MFCC And VQ [O] . Vanajakshi Puttaswamy Gowda, Mathivanan Murugavelu, Senthil Kumaran Thangamuthu 2019

机译：使用MFCC和VQ的阈值连续kannada语音分割和语音识别

Fast NMF based approach and VQ based approach using MFCC distance measure for speech recognition from mixed sound

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅