首页> 外文会议>2017 IEEE International Conference on Imaging, Vision amp; Pattern Recognition >Feature extraction using HHT-based locally optimized short-time fractional Fourier transform for speaker recognition
【24h】

Feature extraction using HHT-based locally optimized short-time fractional Fourier transform for speaker recognition

机译:使用基于HHT的局部优化的短时分数阶傅里叶变换进行特征提取以进行说话人识别

获取原文
获取原文并翻译 | 示例

摘要

This paper presents an improved locally optimized short-time fractional Fourier transform (STFRFT), HHT-based locally optimized STFRFT, by finding the optimal order using phase information ignoring the premise of the known chirp rate of signal and pre-estimated pitch of speech. The feature derived from the optimal order FRFT's magnitude spectrum, HHT-based locally optimized STFRFT Mel-frequency cepstral coefficients (HLO-STFRFT-MFCC), reveals the definite advantage in speaker recognition experiments on the TIMIT database. Furthermore, HLO-STFRFT-MFCC yields a gain of 13.0% relative to the baseline feature of Mel-frequency cepstral coefficients (MFCC) in the recognition accuracy on 2004 NIST SRE corpora.
机译:本文提出了一种改进的局部优化短时分数阶傅里叶变换(STFRFT),基于HHT的局部优化STFRFT,它通过使用相位信息找到了最佳阶,而忽略了已知线性调频信号的速率和预先估计的语音音调。从最佳阶FRFT幅度谱得出的功能,基于HHT的局部优化STFRFT梅尔频率倒谱系数(HLO-STFRFT-MFCC),在TIMIT数据库上的说话人识别实验中显示出一定的优势。此外,相对于2004年NIST SRE语料库的识别准确性,HLO-STFRFT-MFCC相对于梅尔频率倒谱系数(MFCC)的基线特征,获得了13.0%的增益。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号