Feature Extraction Using Power-Law Adjusted Linear Prediction With Application to Speaker Recognition Under Severe Vocal Effort Mismatch

Saeidi Rahim; Alku Paavo; Backstrom Tom

首页> 外文期刊>Audio, Speech, and Language Processing, IEEE/ACM Transactions on >Feature Extraction Using Power-Law Adjusted Linear Prediction With Application to Speaker Recognition Under Severe Vocal Effort Mismatch

【24h】

Feature Extraction Using Power-Law Adjusted Linear Prediction With Application to Speaker Recognition Under Severe Vocal Effort Mismatch

机译：幂律调整线性预测的特征提取及其在严重人声力度不匹配下的说话人识别中的应用

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Linear prediction is one of the most established techniques in signal estimation, and it is widely utilized in speech signal processing. It has been long understood that the nerve firing rate of human auditory system can be approximated by power law non-linearity, and this has been the motivation behind using perceptual linear prediction in extracting acoustic features in a variety of speech processing applications. In this paper, we revisit the application of power law non-linearity in speech spectrum estimation by compressing/expanding power spectrum in autocorrelation-based linear prediction. The development of so-called LP- is motivated by a desire to obtain spectral features that present less mismatch than conventionally used spectrum estimation methods when speech of normal loudness is compared to speech under vocal effort. The effectiveness of the proposed approach is demonstrated in a speaker recognition task conducted under severe vocal effort mismatch comparing shouted versus normal speech mode.

机译：线性预测是信号估计中最成熟的技术之一，在语音信号处理中得到了广泛的应用。长期以来人们一直了解，人类听觉系统的神经发声速率可以通过幂律非线性来近似，这一直是在各种语音处理应用中使用感知线性预测来提取声学特征的背后动机。在本文中，我们通过在基于自相关的线性预测中压缩/扩展功率谱，重新探讨了幂律非线性在语音谱估计中的应用。当将正常响度的语音与在人声下的语音进行比较时，寻求获得比常规使用的频谱估计方法呈现更少失配的频谱特征的愿望推动了所谓LP-的发展。比较大声与正常语音模式时，在严重的语音不匹配情况下进行的说话人识别任务证明了该方法的有效性。

著录项

来源
《Audio, Speech, and Language Processing, IEEE/ACM Transactions on》 |2016年第1期|42-53|共12页
作者
Saeidi Rahim; Alku Paavo; Backstrom Tom;
展开▼
作者单位

Department of Signal Processing and Acoustics, Aalto University, Finland;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Speaker recognition; linear prediction; mismatch; power- law; shouting; vocal effort;

机译：说话人识别;线性预测;不匹配;幂律;呼喊;嗓音;

相似文献

外文文献
中文文献
专利

1. Mixture Linear Prediction in Speaker Verification Under Vocal Effort Mismatch [J] . Pohjalainen J., Hanilci C., Kinnunen T., Signal Processing Letters, IEEE . 2014,第12期

机译：语音努力不匹配下说话人验证中的混合线性预测
2. Vocal effort compensation for MFCC feature extraction in a shouted versus normal speaker recognition task [J] . Jokinen Emma, Saeidi Rahim, Kinnunen Tomi, Computer speech and language . 2019,第JANa期

机译：呼喊补偿与普通说话人识别任务中的MFCC特征提取有关
3. Robust Speaker Recognition Using Denoised Vocal Source and Vocal Tract Features [J] . Wang N.Ching P. C.Zheng N.Lee T. Audio, Speech, and Language Processing, IEEE Transactions on . 2011,第1期

机译：使用降噪后的人声源和人声道功能进行可靠的说话人识别
4. Normal-to-shouted speech spectral mapping for speaker recognition under vocal effort mismatch [C] . Ana Ramírez López, Rahim Saeidi, Lauri Juvela, IEEE International Conference on Acoustics, Speech and Signal Processing . 2017

机译：语音不匹配下从正常到呼出的语音频谱映射，用于说话人识别
5. Nonlinear feature extraction for pattern recognition applications. [D] . Talukder, Ashit. 1999

机译：用于模式识别应用程序的非线性特征提取。
6. Revisiting vocal perception in non-human animals: a review of vowel discrimination speaker voice recognition and speaker normalization [O] . Buddhamas Kriengwatana, Paola Escudero, Carel ten Cate 2014

机译：重温非人类动物的声音感知：元音辨别说话人语音识别和说话人正常化的综述
7. Vocal effort compensation for MFCC feature extraction in a shouted versus normal speaker recognition task [O] . Emma Jokinen, Rahim Saeidi, Tomi Kinnunen, 2019

机译：声音努力补偿MFCC功能提取在喊叫与正常扬声器识别任务中

Feature Extraction Using Power-Law Adjusted Linear Prediction With Application to Speaker Recognition Under Severe Vocal Effort Mismatch

摘要

著录项

相似文献

相关主题

期刊订阅