首页> 外文会议>International Conference on Audio, Language and Image Processing >Speaker identification in shouted talking environments based on novel Third-Order Hidden Markov Models
【24h】

Speaker identification in shouted talking environments based on novel Third-Order Hidden Markov Models

机译:基于新颖的三阶隐马尔可夫模型的高声说话环境中的说话人识别

获取原文

摘要

In this work we propose, implement, and evaluate novel models called Third-Order Hidden Markov Models (HMM3s) to enhance low performance of text-independent speaker identification in shouted talking environments. The proposed models have been tested on our collected speech database using Mel-Frequency Cepstral Coefficients (MFCCs). Our results demonstrate that HMM3s significantly improve speaker identification performance in such talking environments by 11.3% and 166.7% compared to second-order hidden Markov models (HMM2s) and first-order hidden Markov models (HMM1s), respectively. The achieved results based on the proposed models are close to those obtained in subjective assessment by human listeners.
机译:在这项工作中,我们提出,实施和评估称为三阶隐马尔可夫模型(HMM3)的新颖模型,以增强在大声交谈环境中与文本无关的说话人识别的低性能。拟议的模型已在我们收集的语音数据库中使用Mel频率倒谱系数(MFCC)进行了测试。我们的结果表明,与二阶隐马尔可夫模型(HMM2s)和一阶隐马尔可夫模型(HMM1s)相比,HMM3在这种说话环境中的说话人识别性能分别提高了11.3%和166.7%。基于提出的模型获得的结果接近于人类听众在主观评估中获得的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号