A Preliminary Study on GMM Weight Transformation for Emotional Speaker Recognition

机译：对情绪扬声器识别GMM重量转换的初步研究

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The performance of speaker recognition system degrades when the emotional states are inconsistent during the enrollment and evaluation stage. Emotional GMM model synthesis, such as NEGT (Neutral-Emotional GMM mean Transformation), is one way to reduce this degradation. This paper discovers that GMM weight transformation is also feasible and the number of parameters that need to be modified is much less than that of GMM mean ransformation. Thus, we propose two algorithms: RBFNN (Radial Basis Function Neural Network) and EBSR (Exemplar Based Sparse Representation) based GMM weight transformation to model the neutral-to-emotion weight transformation law for emotional GMM model synthesis. The experiments carried on MASC show that IR has been increased by 6.91% and 5.74% through these two algorithms respectively, compared with that of the GMM-UBM system. Meanwhile, these two algorithms require less development data and time compared with those of NEGT.

机译：当情绪状态在入学和评估阶段不一致时，扬声器识别系统的性能降低。情绪GMM模型合成，如Negt（中性 - 情绪GMM平均转换），是减少这种降级的一种方式。本文发现GMM重量转换也是可行的，需要修改的参数数量远低于GMM意义互换的参数。因此，我们提出了两种算法：RBFNN（径向基函数神经网络）和EBSR（基于示例性的稀疏表示）基于GMM权重转换，以模拟用于情绪GMM模型合成的中性到情绪重量转换法。与GMM-UBM系统的实验表明，IR分别通过这两种算法增加了6.91％和5.74％。同时，与Negt的那些，这两种算法需要较少的开发数据和时间。

著录项

来源
《Humaine Association Conference on Affective Computing and Intelligent Interaction》|2013年||共6页
会议地点
作者
Chen Li; Yang Yingchun;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算复杂性理论;
关键词
Emotional Speaker Recognition; Neural Network; Sparse Representation;

机译：情绪扬声器识别;神经网络;稀疏表示;

相似文献

外文文献
中文文献
专利

1. Inter-speaker weighted MAP adaptation for GMM-supervector speaker recognition [J] . MARC FERRAS, KOICHI SHINODA, SADAOKI FURUI 電子情報通信学会技術研究報告. 音声. Speech . 2010,第357期

机译：扬声器间加权MAP自适应用于GMM超向量扬声器识别
2. Inter-speaker weighted MAP adaptation for GMM-supervector speaker recognition [J] . MARC FERRAS, KOICHI SHINODA, SADAOKI FURUI 電子情報通信学会技術研究報告. 言語理解とコミュニケーション. Natural Language Understanding and Models of Communication . 2010,第356期

机译：扬声器间加权MAP自适应用于GMM超向量扬声器识别
3. Inter-speaker weighted MAP adaptation for GMM-supervector speaker recognition [J] . Marc Ferras, Koichi Shinoda, Sadaoki Furui 電子情報通信学会技術研究報告 . 2010,第356期

机译：扬声器间加权MAP自适应用于GMM超向量扬声器识别
4. A Preliminary Study on GMM Weight Transformation for Emotional Speaker Recognition [C] . Chen Li, Yang Yingchun 2013 Humaine Association Conference on Affective Computing and Intelligent Interaction . 2013

机译：GMM权重变换对情感说话人识别的初步研究
5. Speaker Recognition: Evaluation for GMM-UBM and 3D Convolutional Neural Networks Systems [D] . Alghamdi, Mohammad S. 2019

机译：说话者识别：对GMM-UBM和3D卷积神经网络系统的评估
6. New transformed features generated by deep bottleneck extractor and a GMM–UBM classifier for speaker age and gender classification [O] . Arafat Abu Mallouh, Zakariya Qawaqneh, Buket D. Barkana -1

机译：由深瓶颈提取器和GMM–UBM分类器生成的新转换功能用于说话人年龄和性别分类
7. Learning Polynomial Function Based Neutral-Emotion GMM Transformation for Emotional Speaker Recognition [O] . Zhenyu Shan, Yingchun Yang 2012

机译：基于多项式函数的中性情绪Gmm变换在情绪说话人识别中的应用
8. Test Token Driven Acoustic Balancing for Sparse Enrollment Data in Cohort GMM Speaker Recognition [R] . Suh, J., Hansen, J. H. 2009

机译：在队列Gmm说话人识别中测试令牌驱动的声学平衡稀疏登记数据

A Preliminary Study on GMM Weight Transformation for Emotional Speaker Recognition

摘要

著录项

相似文献

相关主题

期刊订阅