Mismatched feature detection with finer granularity for emotional speaker recognition

Li?Chen; Ying-chun?Yang; Zhao-hui?Wu

首页> 外文期刊>Journal of Zhejiang university science >Mismatched feature detection with finer granularity for emotional speaker recognition

【24h】

Mismatched feature detection with finer granularity for emotional speaker recognition

机译：不匹配的特征检测和更精细的粒度，可实现情感说话者识别

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The shapes of speakers’ vocal organs change under their different emotional states, which leads to the deviation of the emotional acoustic space of short-time features from the neutral acoustic space and thereby the degradation of the speaker recognition performance. Features deviating greatly from the neutral acoustic space are considered as mismatched features, and they negatively affect speaker recognition systems. Emotion variation produces different feature deformations for different phonemes, so it is reasonable to build a finer model to detect mismatched features under each phoneme. However, given the difficulty of phoneme recognition, three sorts of acoustic class recognition—phoneme classes, Gaussian mixture model (GMM) tokenizer, and probabilistic GMM tokenizer—are proposed to replace phoneme recognition. We propose feature pruning and feature regulation methods to process the mismatched features to improve speaker recognition performance. As for the feature regulation method, a strategy of maximizing the between-class distance and minimizing the within-class distance is adopted to train the transformation matrix to regulate the mismatched features. Experiments conducted on the Mandarin affective speech corpus (MASC) show that our feature pruning and feature regulation methods increase the identification rate (IR) by 3.64% and 6.77%, compared with the baseline GMM-UBM (universal background model) algorithm. Also, corresponding IR increases of 2.09% and 3.32% can be obtained with our methods when applied to the state-of-the-art algorithm i-vector.

机译：说话人声音器官的形状在其不同的情绪状态下会发生变化，这导致短时特征的情绪声音空间与中性声音空间发生偏离，从而导致说话人识别性能下降。偏离中性声学空间的特征被认为是不匹配的特征，它们会对说话人识别系统产生负面影响。情绪变化会为不同的音素产生不同的特征变形，因此建立一个更好的模型来检测每个音素下不匹配的特征是合理的。但是，鉴于音素识别的困难，提出了三种声学类别识别（音素类别，高斯混合模型（GMM）标记器和概率GMM标记器）来代替音素识别。我们提出了特征修剪和特征调节方法来处理不匹配的特征，以提高说话人识别性能。对于特征调整方法，采用最大化类间距离并最小化类内距离的策略来训练变换矩阵以调节失配特征。对普通话情感语料库（MASC）进行的实验表明，与基线GMM-UBM（通用背景模型）算法相比，我们的特征修剪和特征调节方法将识别率（IR）提高了3.64％和6.77％。同样，当将我们的方法应用于最新算法i-vector时，可以分别获得2.09％和3.32％的IR增长。

著录项

来源
《Journal of Zhejiang university science》 |2014年第10期|共14页
作者
Li?Chen; Ying-chun?Yang; Zhao-hui?Wu;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Mismatched feature detection with finer granularity for emotional speaker recognition [J] . Li CHEN, Ying-chun YANG, Zhao-hui WU 浙江大学学报（英文版）（C辑：计算机与电子） . 2014,第010期

机译：不匹配的特征检测和更精细的粒度，可实现情感说话者识别
2. Emotional speech feature normalization and recognition based on speaker-sensitive feature clustering [J] . Chengwei Huang, Baolin Song, Li Zhao International journal of speech technology . 2016,第4期

机译：基于说话人敏感特征聚类的情感语音特征归一化与识别
3. Speaker Recognition in Mismatch Conditions: A Feature Level Approach [J] . Sharada V Chougule, Mahesh S. Chavan International Journal of Image, Graphics and Signal Processing . 2017,第4期

机译：不匹配条件下的说话人识别：功能级别方法
4. Reliability detection by Fuzzy SVM with UBM Component feature for emotional speaker recognition [C] . Chen Li, Yang Yingchun, Yao Min 2011 Eighth International Conference on Fuzzy Systems and Knowledge Discovery . 2011

机译：具有UBM组件功能的Fuzzy SVM的可靠性检测可用于情感说话者识别
5. Physiologically-motivated feature extraction methods for speaker recognition. [D] . Wang, Jianglin. 2013

机译：用于说话人识别的生理动机特征提取方法。
6. Efficient Invariant Features for Sensor Variability Compensation in Speaker Recognition [O] . Abdennour Alimohad, *, Ahmed Bouridane, 2014

机译：说话人识别中传感器可变性补偿的高效不变性
7. Speaker Recognition in Mismatch Conditions: A Feature Level Approach [O] . Sharada V Chougule, Mahesh S. Chavan 2017

机译：扬声器识别在不匹配条件下：特征级别方法

Mismatched feature detection with finer granularity for emotional speaker recognition

摘要

著录项

相似文献

相关主题

期刊订阅