首页> 外文会议>IEEE international conference on signal processing systems >Integration of Speaker and Speech Recognition Systems
【24h】

Integration of Speaker and Speech Recognition Systems

机译:说话人和语音识别系统的集成

获取原文

摘要

Speech is the most important and primary mode of communication and also the most natural and efficient form of exchanging information among humans. This paper presents a detailed study of textdependent Speaker and Speech Recognition system. Speaker recognition system uses vector quantization (VQ) as the modeling technique while features of the speech signal are extracted using Mel Frequency Cepstum Coefficients (MFCC). K-means clustering algorithm has been used to obtain the vector quantized codebook. For Speech recognition system, formant frequencies of the word sample are used to determine the unknown word. Speaker recognition system yields highest accuracy with Hanning window and Mel perceptual feature extraction realized with 35 filter bank. Accuracy also improves as the number of vectors in the VQ codebook is increased from 64 to 100 whereas for Speech recognition, highest accuracy obtained is 95%.
机译:言语是最重要和最主要的沟通方式,也是人类之间交换信息的最自然和有效的形式。本文提出了对TextDependendent扬声器和语音识别系统的详细研究。扬声器识别系统使用矢量量化(VQ)作为建模技术,而使用MEL频率Cepstum系数(MFCC)提取语音信号的特征。 K-means群集算法已被用于获得矢量量化码本。对于语音识别系统,Word样本的格式频率用于确定未知字。扬声器识别系统通过35个滤波器组成的汉宁窗口和梅尔感知特征提取产生最高精度。由于VQ码本的矢量数量从64增加到100,而对于语音识别,所获得的最高精度是95%的载体。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号