首页> 外文会议>2011 IEEE International Conference on Acoustics, Speech and Signal Processing >Speaker characterization using spectral subband energy ratio based on Harmonic plus Noise Model
【24h】

Speaker characterization using spectral subband energy ratio based on Harmonic plus Noise Model

机译:基于谐波加噪声模型的频谱子带能量比进行说话人表征

获取原文

摘要

This paper proposes a feature extraction for speaker characterization by exploring the relationship between the two distinct components of the speech signal, one is harmonics accounting for the periodicity of the signal and the other is modulated noise accounting for the turbulences of the glottal airflow. The harmonic and noise parts of the speech signal are decomposed based on the Harmonic plus Noise Model approach. We estimate the spectral subband energy ratios (SSERs) as the speaker characteristic features, which are expected to reflect the interaction property of the vocal tract and glottal airflow of individual speakers for speaker verification. The speaker verification experiments based on a GMM-UBM system have shown the efficiency of the SSER features, reducing the error equal rate by 27.2% by combining with the conventional MFCC features.
机译:本文通过探讨语音信号的两个不同成分之间的关​​系,提出了一种用于说话人表征的特征提取方法,一种是谐波,用于说明信号的周期性,另一种是调制噪声,用于说明声门气流的湍流。语音信号的谐波和噪声部分基于“谐波加噪声模型”方法进行分解。我们估计频谱子带能量比(SSERs)作为扬声器的特征,有望反映出声道的相互作用特性和各个扬声器的声门气流,以进行扬声器验证。基于GMM-UBM系统的说话人验证实验显示了SSER功能的效率,通过与常规MFCC功能相结合,将错误均等率降低了27.2%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号