首页> 外文会议>IEEE International Conference on Signal and Image Processing Applications >Speech Feature Extraction Based on Linear Prediction Residual
【24h】

Speech Feature Extraction Based on Linear Prediction Residual

机译:基于线性预测残差的语音特征提取

获取原文

摘要

Linear prediction coding (LPC) is the core technology in speech processing, which has been successfully applied in speech recognition, synthesis and coding. LPC coefficient can well represent the speaker's vocal tract information, and is widely used in the field of speaker recognition. However, the epiphytic LPC residual is often ignored. This paper shows that the LPC residual contains information, which can reflect the characteristics of the speaker himself. We extracted new feature parameters (Second moment, third moment) from the LPC residual, combined with LPC coefficients, and input them into a speaker recognition system based on the GRU network. Cross entropy loss is used as a loss function to train the classifier. The experimental results show that the combined parameters effectively improve the system recognition rate by 5% relative to the LPC coefficients.
机译:线性预测编码(LPC)是语音处理中的核心技术,其已成功应用于语音识别,合成和编码。 LPC系数可以很好地代表扬声器的声道信息,并且广泛用于扬声器识别领域。但是,果皮LPC残差通常被忽略。本文表明,LPC残差包含信息,可以反映扬声器本人的特征。我们从LPC剩余的新功能参数(第二时刻,第三时刻)提取,与LPC系数组合,并将其输入基于GRU网络的扬声器识别系统。交叉熵损耗用作培训分类器的损耗函数。实验结果表明,组合参数相对于LPC系数有效地提高了系统识别率5%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号