Speech Feature Extraction Based on Linear Prediction Residual

机译：基于线性预测残差的语音特征提取

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Linear prediction coding (LPC) is the core technology in speech processing, which has been successfully applied in speech recognition, synthesis and coding. LPC coefficient can well represent the speaker's vocal tract information, and is widely used in the field of speaker recognition. However, the epiphytic LPC residual is often ignored. This paper shows that the LPC residual contains information, which can reflect the characteristics of the speaker himself. We extracted new feature parameters (Second moment, third moment) from the LPC residual, combined with LPC coefficients, and input them into a speaker recognition system based on the GRU network. Cross entropy loss is used as a loss function to train the classifier. The experimental results show that the combined parameters effectively improve the system recognition rate by 5% relative to the LPC coefficients.

机译：线性预测编码（LPC）是语音处理中的核心技术，其已成功应用于语音识别，合成和编码。 LPC系数可以很好地代表扬声器的声道信息，并且广泛用于扬声器识别领域。但是，果皮LPC残差通常被忽略。本文表明，LPC残差包含信息，可以反映扬声器本人的特征。我们从LPC剩余的新功能参数（第二时刻，第三时刻）提取，与LPC系数组合，并将其输入基于GRU网络的扬声器识别系统。交叉熵损耗用作培训分类器的损耗函数。实验结果表明，组合参数相对于LPC系数有效地提高了系统识别率5％。

著录项

来源
《IEEE International Conference on Signal and Image Processing Applications》|2020年|768-772|共5页
会议地点
作者
Lu Xu; Hongjin Liu; Shaolin Zhang; Mingjiang Wang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Image coding; Image recognition; Speech recognition; Feature extraction; Entropy; Speaker recognition; Speech processing;

机译：图像编码;图像识别;语音识别;特征提取;熵;扬声器识别;语音处理;

相似文献

外文文献
中文文献
专利

1. Extraction of speaker-specific excitation information from linear prediction residual of speech [J] . S.R. Mahadeva Prasanna, Cheedella S. Gupta, B. Yegnanarayana Speech Communication . 2006,第10期

机译：从语音的线性预测残差中提取说话人特定的激励信息
2. Improved pulsed residual excited linear prediction: a new excitation for CELP based speech coders [J] . Black A.W., Kondoz A.M. Electronics Letters . 1996,第6期

机译：改进的脉冲残留激励线性预测：基于CELP的语音编码器的新激励
3. Nonlinear Dynamic Feature Extraction Based on Phase Space Reconstruction for the Classification of Speech and Emotion [J] . Ying Sun, Xue-Ying Zhang, Jiang-He Ma, Mathematical Problems in Engineering: Theory, Methods and Applications . 2020,第1期

机译：基于相位空间重建的语音与情感分类非线性动态特征提取
4. Gaussian Mixture Model Learning for Desired Speech Discrimination Based on Kurtosis of Linear Prediction Residual Signals [C] . Kohei HAYASHIDA, Masato NAKAYAMA, Takanobu NISHIURA, KES International Conference on Intelligent Decision Technologies . 2014

机译：高斯混合模型学习所需语音鉴别的基于线性预测残余信号的峰值
5. Reverberant speech enhancement using linear prediction residual signal. [D] . Joshi, Bhavin Bharat. 2005

机译：使用线性预测残差信号的混响语音增强。
6. SVM-based prediction of linear B-cell epitopes using Bayes Feature Extraction [O] . Lawrence JK Wee, Diane Simarmata, Yiu-Wing Kam, 2010

机译：使用贝叶斯特征提取的基于SVM的线性B细胞表位预测
7. Spatial Domain-Based Nonlinear Residual Feature Extraction for Identification of Image Operations [O] . Xiaochen Yuan, Tian Huang 2020

机译：基于空间域的非线性残余特征提取，用于识别图像操作
8. Optimal Linear and Nonlinear Feature Extraction Based on the Minimization of the Increased Risk of Misclassification [R] . R. J. P. de Figueiredo 1974

机译：基于最小化误分类风险的最优线性和非线性特征提取

Speech Feature Extraction Based on Linear Prediction Residual

摘要

著录项

相似文献

相关主题

期刊订阅