Temporally Weighted Linear Prediction Features for Speaker Verification in Additive Noise

机译：临时加权线性预测功能，用于在加性噪声中验证说话人

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

We consider text-independent speaker verification under additive noise corruption. In the popular mel-frequency cepstral coefficient (MFCC) front-end, we substitute the conventional Fourier-based spectrum estimation with weighted linear predictive methods, which have earlier shown success in noise-robust speech recognition. We introduce two temporally weighted variants of linear predictive (LP) modeling to speaker verification and compare them to FFT, which is normally used in computing MFCCs, and to conventional LP. We also investigate the effect of speech enhancement (spectral subtraction) on the system performance with each of the four feature representations. Our experiments on the NIST 2002 SRE corpus indicate that the accuracy of the conventional and proposed features are close to each other on clean data. On 0 dB SNR level, baseline FFT and the better of the proposed features give EERs of 17.4 % and 15.6 %, respectively. These accuracies improve to 11.6 % and 11.2 %, respectively, when spectral subtraction is included as a pre-processing method. The new features hold a promise for noise-robust speaker verification.

机译：我们考虑在加性噪声破坏下独立于文本的说话者验证。在流行的梅尔频率倒谱系数（MFCC）前端，我们用加权线性预测方法代替了传统的基于傅立叶的频谱估计，该方法早先在噪声鲁棒的语音识别中取得了成功。我们将线性预测（LP）建模的两个时间加权变量引入说话者验证，并将它们与通常用于计算MFCC的FFT和常规LP进行比较。我们还研究了语音增强（频谱减法）对系统性能的四个特征表示的影响。我们在NIST 2002 SRE语料库上进行的实验表明，在干净的数据上，常规功能和建议功能的准确性彼此接近。在SNR为0 dB的情况下，基线FFT和更好的拟议功能可使EER分别为17.4％和15.6％。当包括光谱减法作为预处理方法时，这些精度分别提高到11.6％和11.2％。这些新功能有望实现对噪声的扬声器验证。

著录项

来源
《Odyssey 2010: the speaker and language recognition workshop》|2010年|p.40-46|共7页
会议地点 Brno(CS)
作者
Rahim Saeidi; Jouni Pohjalainen; Tomi Kinnunen; Paavo Alku;
展开▼
作者单位

School of Computing, University of Eastern Finland, Finland;

Department of Signal Processing and Acoustics, Aalto University, Finland;

School of Computing, University of Eastern Finland, Finland;

Department of Signal Processing and Acoustics, Aalto University, Finland;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类语音信号处理;
关键词

相似文献

外文文献
中文文献
专利

1. Mixture linear prediction Gammatone Cepstral features for robust speaker verification under transmission channel noise [J] . Ahmed Krobba, Mohamed Debyeche, Sid-Ahmed Selouani Multimedia Tools and Applications . 2020,第25a26期

机译：混合线性预测酚酮谱的临床特征在传输信道噪声下强大的扬声器验证
2. Linear prediction residual features for automatic speaker verification anti-spoofing [J] . Hanilci Cemal Multimedia Tools and Applications . 2018,第13期

机译：线性预测残差功能可实现自动说话人验证防欺骗
3. Speaker Verification using Weighted Local MFCC Features Extracted by Minimum Verification Error Learning [J] . Shunsuke Sakai, Toru Nozaki, Keisuke Kameyama Australian journal of intelligent information processing systems . 2010,第3期

机译：使用通过最小验证错误学习提取的加权本地MFCC功能进行说话人验证
4. ADDITIVE AND CONVOLUTIONAL NOISE CANCELING IN SPEAKER VERIFICATION USING A STOCHASTIC WEIGHTED VITERBI ALGORITHM [C] . Nestor Becerra Yoma, Miguel Villar Fernandez European conference on speech communication and technology . 2001

机译：使用随机加权维特比算法，添加剂和卷积噪声取消扬声器验证
5. Feature and model transformation techniques for robust speaker verification. [D] . Yiu, Kwok Kwong. 2005

机译：功能和模型转换技术可实现可靠的说话人验证。
6. Identifying Essential Features of Juvenile Psychopathy in the Prediction of Later Antisocial Behavior: Is There an Additive Synergistic or Curvilinear Role for Fearless Dominance? [O] . Colin E. Vize, Donald R. Lynam, Joanna Lamkin, -1

机译：在预测以后的反社会行为时确定青少年精神病的基本特征：无畏统治地位是否具有加性协同作用或曲线作用？
7. 1 Temporally Weighted Linear Prediction Features for Tackling Additive Noise in Speaker Verification [O] . Rahim Saeidi, Student Member, Jouni Pohjalainen, 2011

机译：1用于解决说话人验证中加性噪声的时间加权线性预测特征
8. Feature-Based and Channel-Based Analyses of Intrinsic Variability in Speaker Verification. [R] . Graciarena, M., Bocklet, T., Shriberg, E., 2013

机译：基于特征和基于通道的说话人验证中内在变异性分析。

Temporally Weighted Linear Prediction Features for Speaker Verification in Additive Noise

摘要

著录项

相似文献

相关主题

期刊订阅