首页> 外文会议>International Conference on Security with Intelligent Computing and Big-data Services >Robust Speaker Recognition Using Improved GFCC and Adaptive Feature Selection
【24h】

Robust Speaker Recognition Using Improved GFCC and Adaptive Feature Selection

机译:使用改进的GFCC和Adaptive Feature选择的强大的扬声器识别

获取原文

摘要

Speaker recognition systems have shown good performance in noise-free environments, but the performance will severely deteriorate in the presence of noises. At the front end of the systems, Mel-Frequency Cepstral Coefficient (MFCC), or a relatively noise-robust feature Gammatone Frequency Cepstral Coefficients (GFCC), is commonly used as time-frequency feature. To further improve the noise-robustness of GFCC, signal processing techniques, such as DC removal, pre-emphasis and Cepstral Mean Variance Normalization (CMVN), are investigated in the extraction of GFCC. Being aware the advantages and disadvantages of MFCC and GFCC, an adaptive strategy was proposed to make feature selection based on the quality of speech. Experiments were conducted on TIMIT dataset to evaluate our approach. Compared with ordinary GFCC and MFCC features, our method significantly reduced the EER in speech data with miscellaneous SNRs.
机译:扬声器识别系统在无噪声环境中表现出良好的性能,但在噪声存在下性能会严重恶化。在系统的前端,熔融频率谱系码(MFCC)或相对响应稳健的特征γ频率谱系数(GFCC)通常用作时频特征。为了进一步提高GFCC的噪声稳健性,在GFCC的提取中研究了信号处理技术,例如DC去除,预加重和抗康斯兰语平均方差标准化(CMVN)。有人知道MFCC和GFCC的优点和缺点,提出了一种自适应策略,以基于语音质量进行特征选择。在Timit DataSet上进行实验以评估我们的方法。与普通的GFCC和MFCC功能相比,我们的方法用杂项SNR显着降低了语音数据中的eer。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号