首页> 外文期刊>Progress in Natural Science >A new frequency scale of Chinese whispered speech in the application of speaker identification
【24h】

A new frequency scale of Chinese whispered speech in the application of speaker identification

机译:汉语低语语音频率等级在说话人识别中的应用

获取原文
获取原文并翻译 | 示例
       

摘要

In this paper, the frequency characteristics of Chinese whispered speech were investigated by a filter bank analysis. It was shown that the first and the third formants were more important than the other formants in the speaker identification of Chinese whispered speech. The experiment showed that the 800—1200 Hz and 2800—3200 Hz ranges were the most significant frequency ranges in discriminating the speaker. Based on this result, a new feature scale named whisper sensitive scale (WSS) was proposedto replace the common scale, Mel scale, and to extract the cepstral coefficient from whispered speech signal. Furthermore, a speaker identification system in whispered speech was presented based on the modified Hidden Markov Models integrating advantages of WSCC (the whisper sensitive cepstral coefficient) and LPCC. And the new system performed better in solving the problem of speaker identification of Chinese whispered speech than the traditional method.
机译:本文通过滤波器组分析研究了中国低语语音的频率特性。结果表明,第一和第三共振峰比其他共振峰更重要。实验表明,在区分扬声器时,800-1200 Hz和2800-3200 Hz范围是最重要的频率范围。基于此结果,提出了一种新的特征量表,称为耳语敏感量表(WSS),以代替通用量表Mel量表,并从耳语语音信号中提取倒谱系数。此外,基于改进的隐马尔可夫模型,提出了一种结合了WSCC(耳语敏感倒频谱系数)和LPCC的优点的低声语音说话人识别系统。与传统方法相比,该新系统在解决汉语低声语音的说话人识别问题上表现更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号