Speaker identification from shouted speech: Analysis and compensation

机译：从大声说话中识别说话人：分析和补偿

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Text-independent speaker identification is studied using neutral and shouted speech in Finnish to analyze the effect of vocal mode mismatch between training and test utterances. Standard mel-frequency cepstral coefficient (MFCC) features with Gaussian mixture model (GMM) recognizer are used for speaker identification. The results indicate that speaker identification accuracy reduces from perfect (100 %) to 8.71 % under vocal mode mismatch. Because of this dramatic degradation in recognition accuracy, we propose to use a joint density GMM mapping technique for compensating the MFCC features. This mapping is trained on a disjoint emotional speech corpus to create a completely speaker- and speech mode independent emotion-neutralizing mapping. As a result of the compensation, the 8.71 % identification accuracy increases to 32.00 % without degrading the non-mismatched train-test conditions much.

机译：研究人员使用芬兰语中立和高喊的语音来研究与文本无关的说话人识别，以分析训练和测试发声之间语音模式不匹配的影响。具有高斯混合模型（GMM）识别器的标准梅尔频率倒谱系数（MFCC）功能用于说话人识别。结果表明，在语音模式不匹配的情况下，说话人识别准确度从完美（100％）降低到8.71％。由于识别准确度的急剧下降，我们建议使用联合密度GMM映射技术来补偿MFCC特征。此映射在不相交的情感语音语料库上进行训练，以创建完全独立于说话者和语音模式的情感中和映射。补偿的结果是，8.71％的识别精度提高到32.00％，而不会大大降低不匹配的列车测试条件。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2013年|8027-8031|共5页
会议地点
作者
Hanilci Cemal; Kinnunen Tomi; Saeidi Rahim; Pohjalainen Jouni;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
shouted speech; speaker identification;

机译：大声讲话;说话人识别;
入库时间 2022-08-26 15:17:16

相似文献

外文文献
中文文献
专利

1. Vocal effort compensation for MFCC feature extraction in a shouted versus normal speaker recognition task [J] . Jokinen Emma, Saeidi Rahim, Kinnunen Tomi, Computer speech and language . 2019,第JANa期

机译：呼喊补偿与普通说话人识别任务中的MFCC特征提取有关
2. Analysis and Compensation of Lombard Speech Across Noise Type and Levels With Application to In-Set/Out-of-Set Speaker Recognition [J] . Hansen J. H. L., Varadarajan V. IEEE transactions on audio, speech and language processing . 2009,第2期

机译：朗伯语音跨噪声类型的分析和补偿及其在室内/室外说话者识别中的应用
3. Emirati-accented speaker identification in each of neutral and shouted talking environments [J] . Ismail Shahin, Ali Bou Nassif, Mohammed Bahutair International journal of speech technology . 2018,第2期

机译：在中性和嘈杂的谈话环境中均采用阿联酋语的讲话者识别
4. SPEAKER IDENTIFICATION FROM SHOUTED SPEECH: ANALYSIS AND COMPENSATION [C] . Cemal Hanil?i, Tomi Kinnunen, Rahim Saeidi, International Conference on Acoustics, Speech and Signal Processing . 2013

机译：喊叫讲话语音：分析和补偿
5. Analysis of variability in speech with applications to speech and speaker recognition. [D] . Kajarekar, Sachin Subhash. 2002

机译：分析语音变异性，并将其应用于语音和说话人识别。
6. Analysis of speech and tongue motion in normal and post-glossectomy speaker using cine MRI [O] . Jinhee Ha, Iel-yong Sung, Jang-ho Son, 2016

机译：使用电影MRI分析正常和舌后切除术说话者的语音和舌头运动
7. Speaker identification from shouted speech: analysis and compensation [O] . Hanilci C., Kinnunen T., Saeidi R., 2013

机译：从大声说话中识别说话人：分析和补偿
8. Co-Channel Speech and Speaker Identification Study. [R] . Yantorno, R. E. 1998

机译：同声道语音和说话人识别研究。

Speaker identification from shouted speech: Analysis and compensation

摘要

著录项

相似文献

相关主题

期刊订阅