Effect of MFCC normalization on vector quantization based speaker identification

机译：MFCC归一化对基于矢量量化的说话人识别的影响

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Mel Frequency Cepstral Coefficients (MFCC) are widely used in speech recognition and speaker identification. MFCC features are usually pre-processed before being used for recognition. One of these pre-processing is creating delta and delta-delta coefficients and append them to MFCC to create feature vector. Another pre-processing is coefficients mean normalization. In this paper, the effect of these two processes on the accuracy of a Vector Quantization (VQ) speaker identification system is compared. Additionally, it is shown that coefficient variance normalization, which is less common, can improve the accuracy.

机译：梅尔频率倒谱系数（MFCC）被广泛用于语音识别和说话人识别。 MFCC功能通常在用于识别之前先进行预处理。这些预处理之一是创建delta和delta-delta系数，并将它们附加到MFCC以创建特征向量。另一个预处理是系数均值归一化。在本文中，比较了这两个过程对矢量量化（VQ）说话人识别系统准确性的影响。另外，还表明，较少见的系数方差归一化可以提高精度。

著录项

来源
《10th IEEE International Symposium on Signal Processing and Information Technology》|2010年|p.250-253|共4页
会议地点
作者

展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词
Mel Frequency Cepstral Coefficients (MFCC); Normalization; Speaker Recognition; Vector Quantization (VQ);

机译：梅尔频率倒谱系数（MFCC）;归一化;扬声器识别;矢量量化（VQ）;

相似文献

外文文献
中文文献
专利

1. Speaker Recognition using MFCC and Improved Weighted Vector Quantization Algorithm [J] . C. Sunitha, E. Chandra International Journal of Engineering and Technology . 2015,第5期

机译：使用MFCC和改进的加权矢量量化算法的说话人识别
2. Design Of An Automatic Speaker Recognition System Using MFCC, Vector Quantization And LBG Algorithm [J] . Ch.Srinivasa Kumar, Dr. P. Mallikarjuna Rao International Journal on Computer Science and Engineering . 2011,第8期

机译：基于MFCC，矢量量化和LBG算法的说话人自动识别系统设计
3. Modelling a Voice Activated Speaker Identification System using MFCC-Pitch-Formant Vector [J] . Avik Sengupta, Rabindranath Ghosh Journal of The Institution of Engineers (India): Series B . 2012,第1期

机译：使用MFCC音高形成矢量对语音激活的说话人识别系统建模
4. Effect of MFCC normalization on vector quantization based speaker identification [C] . {missing} IEEE International Symposium on Signal Processing and Information Technology . 2010

机译：MFCC归一化对矢量量化的扬声器识别的影响
5. Acoustic-feature-based frequency warping for speaker normalization. [D] . Gouvea, Evandro Bacci. 1999

机译：基于声音特征的频率扭曲，用于扬声器归一化。
6. Intoxicated Speech Detection: A Fusion Framework with Speaker-Normalized Hierarchical Functionals and GMM Supervectors [O] . Daniel Bone, Ming Li, Matthew P. Black, -1

机译：陶醉的语音检测：具有扬声器归一化分层功能和GMM运行的融合框架
7. Comparison of Vector Quantization and Gaussian Mixture Model using Effective MFCC Features for Text-independent Speaker Identification [O] . S. B., S. M. 2016

机译：矢量量化与高斯混合模型使用有效MFCC特征对文本无关的扬声器识别的比较
8. Text-Dependent Speaker Verification Using Vector Quantization Source Coding [R] . Burton, D. K. 1985

机译：使用矢量量化源编码的文本相关说话人验证

Effect of MFCC normalization on vector quantization based speaker identification

摘要

著录项

相似文献

相关主题

期刊订阅