首页> 外文会议>International Conference on Electrical, Electronics, and Optimization Techniques >Extraction of speech signal based on Power Normalized Cepstral Coefficient and Mel Frequency Cepstral Coefficient: A comparison
【24h】

Extraction of speech signal based on Power Normalized Cepstral Coefficient and Mel Frequency Cepstral Coefficient: A comparison

机译:基于功率归一化倒谱系数和梅尔频率倒谱系数的语音信号提取:比较

获取原文

摘要

Speech processing is emerged as one of the important application area of digital signal processing. Power Normalized Cepstral Coefficients (PNCC) and Mel Frequency Cepstral Coefficient (MFCC) are mainly used in feature extraction of speech signals. The problem of real time speaker segmentation in speech processing is enormous in which no prior knowledge about the number of speakers and the identities of speakers are available. In this paper the performances of the PNCC and MFCC are compared and the experimental results demonstrate that PNCC processing provides improvements in recognition accuracy in the presence of various types of noises and in environmental changes. The performance of PNCC method is robust to natural and unpredictable situations.
机译:语音处理已成为数字信号处理的重要应用领域之一。功率归一化倒谱系数(PNCC)和梅尔频率倒谱系数(MFCC)主要用于语音信号的特征提取。语音处理中的实时说话人分割问题是巨大的,其中没有关于说话人数量和说话人身份的先验知识。本文对PNCC和MFCC的性能进行了比较,实验结果表明PNCC处理可以在存在各种类型的噪声和环境变化的情况下提高识别精度。 PNCC方法的性能对于自然和不可预测的情况具有鲁棒性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号