首页> 外文会议>2011 International Conference on Recent Advancements in Electrical, Electronics and Control Engineering >Frequency-time analysis approach to feature extraction for text independent speaker identification
【24h】

Frequency-time analysis approach to feature extraction for text independent speaker identification

机译:频率-时间分析的特征提取方法用于文本无关的说话人识别

获取原文
获取原文并翻译 | 示例

摘要

This paper presents an alternative approach to Mel Frequency Cepstral Coefficient (MFCC) based method of feature extraction for robust text independent speaker identification. This work is focused to increase the identification accuracy without increasing the size and complexity of filter bank. The drive for this new feature extraction technique comes from a transformation which is based on the Nyquist filter bank consuructed using Gaussian filters. This new feature extraction technique has been compared with MFCC feature for different lengths of utterances. Experimental evaluation is carried out on MEPCO telephone speech database with 50 speakers using Gaussian Mixture Model (GMM). The proposed feature set performs significantly better than the MFCC feature set achieves 6% higher average accuracy compared to the MFCC feature set for utterance lengths of 20 seconds.
机译:本文提出了一种基于梅尔频率倒谱系数(MFCC)的特征提取方法,用于健壮的与文本无关的说话人识别。这项工作的重点是在不增加滤波器组的大小和复杂性的情况下提高识别精度。这项新特征提取技术的驱动力来自一种变换,该变换基于使用高斯滤波器的奈奎斯特滤波器组。这项新的特征提取技术已与MFCC特征进行了比较,以用于不同长度的语音。使用高斯混合模型(GMM)在MEPCO电话语音数据库上对50名发言人进行了实验评估。对于20秒的发声长度,与MFCC功能集相比,与MFCC功能集相比,所提出的功能集的性能明显好于MFCC功能集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号