...
首页> 外文期刊>Indian Journal of Science and Technology >Speaker Adaptation on Hidden Markov Model using MFCC and Rasta-PLP and Comparative Study
【24h】

Speaker Adaptation on Hidden Markov Model using MFCC and Rasta-PLP and Comparative Study

机译:MFCC和Rasta-PLP对隐马尔可夫模型的说话人适应和比较研究

获取原文
   

获取外文期刊封面封底 >>

       

摘要

This work compares the performance of the Mel-Frequency Cepstral Coefficient (MFCC) and Perceptual Linear Prediction (PLP) features for developing a text-dependent speaker identification system. Continuously spoken Hindi speech sentences have been used to train the HMM models using HTK toolkit for each speaker separately. The experiments have been performed using a set of 200 continuously spoken sentences with vocabulary of 20000 isolated words using a database of 100 speakers. The results show an accuracy of 92.26% recognition when PLP features have been used and accuracy of 91.18% for MFCC features. A confusion matrix has been created for all the 20 test speakers based on the recognition scores obtained for each of these speakers and their confusion with other speakers. Performance has been compared in the closed set and open set conditions of testing and as it is expected, the performance in the closed set condition is far better than the open set. We propose that if PLP features are used in place of MFCC, they may provide improvement in speaker identification accuracy by reducing the cases of false acceptance.
机译:这项工作比较了梅尔频率倒谱系数(MFCC)和感知线性预测(PLP)功能的性能,以开发文本相关的说话人识别系统。连续口语印地语语音句子已被用于使用HTK工具包分别为每个说话者训练HMM模型。实验是使用100个说话者的数据库使用一组200个连续口语句子和20000个独立单词的词汇进行的。结果表明,使用PLP功能时,识别精度为92.26%,而MFCC功能为91.18%。基于为每个20位测试演讲者获得的识别分数以及他们与其他演讲者的混淆,已经创建了一个混淆矩阵。已对封闭设置和开放设置测试条件下的性能进行了比较,并且可以预期,封闭设置条件下的性能远远优于开放设置。我们建议,如果使用PLP功能代替MFCC,则可以通过减少错误接受的情况来提高说话人识别的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号