首页> 外文会议>International Conference on speech and computer >Fusion of a Novel Volterra-Wiener Filter Based Nonlinear Residual Phase and MFCC for Speaker Verification
【24h】

Fusion of a Novel Volterra-Wiener Filter Based Nonlinear Residual Phase and MFCC for Speaker Verification

机译:基于非线性残留相位的新型Volterra-Wiener滤波器与MFCC的融合,用于说话人验证

获取原文

摘要

This paper investigates the complementary nature of the speaker-specific information present in the Volterra-Wiener filter residual (VWFR) phase of speech signal in comparison with the information present in conventional Mel Frequency Cepstral Coefficients (MFCC) and Teager Energy Operator (TEO) phase. The feature set is derived from residual phase extracted from the output of nonlinear filter designed using Volterra-Weiner series exploiting higher order linear as well as nonlinear relationships hidden in the sequence of samples of speech signal. The proposed feature set is being used to conduct Speaker Verification (SV) experiments on NIST SRE 2002 database using state-of-the-art GMM-UBM system. The score-level fusion of proposed feature set with MFCC gives an EER of 6.05% as compared to EER of 8.9% with MFCC alone. EER of 8.83% is obtained for TEO phase in fusion with MFCC, indicating that residual phase from proposed nonlinear filtering approach contain complementary speaker-specific information.
机译:与传统的梅尔频率倒谱系数(MFCC)和Teager能量算子(TEO)阶段中存在的信息相比,本文研究了语音信号的Volterra-Wiener滤波器残差(VWFR)阶段中存在的特定于扬声器的信息的互补性质。 。该特征集来自残余相位,该残余相位是从使用Volterra-Weiner序列设计的非线性滤波器的输出中提取的残留相位中提取的,该非线性滤波器利用语音信号样本序列中隐藏的高阶线性以及非线性关系。拟议的功能集用于使用最新的GMM-UBM系统在NIST SRE 2002数据库上进行说话者验证(SV)实验。与MFCC单独的EER为8.9%相比,拟议的特征集与MFCC的得分级融合给出了6.05%的EER。与MFCC融合时,TEO相的EER为8.83%,表明所提出的非线性滤波方法的残余相包含特定于说话人的补充信息。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号