首页> 外文会议>Humaine Association Conference on Affective Computing and Intelligent Interaction >A Preliminary Study on GMM Weight Transformation for Emotional Speaker Recognition
【24h】

A Preliminary Study on GMM Weight Transformation for Emotional Speaker Recognition

机译:对情绪扬声器识别GMM重量转换的初步研究

获取原文

摘要

The performance of speaker recognition system degrades when the emotional states are inconsistent during the enrollment and evaluation stage. Emotional GMM model synthesis, such as NEGT (Neutral-Emotional GMM mean Transformation), is one way to reduce this degradation. This paper discovers that GMM weight transformation is also feasible and the number of parameters that need to be modified is much less than that of GMM mean ransformation. Thus, we propose two algorithms: RBFNN (Radial Basis Function Neural Network) and EBSR (Exemplar Based Sparse Representation) based GMM weight transformation to model the neutral-to-emotion weight transformation law for emotional GMM model synthesis. The experiments carried on MASC show that IR has been increased by 6.91% and 5.74% through these two algorithms respectively, compared with that of the GMM-UBM system. Meanwhile, these two algorithms require less development data and time compared with those of NEGT.
机译:当情绪状态在入学和评估阶段不一致时,扬声器识别系统的性能降低。情绪GMM模型合成,如Negt(中性 - 情绪GMM平均转换),是减少这种降级的一种方式。本文发现GMM重量转换也是可行的,需要修改的参数数量远低于GMM意义互换的参数。因此,我们提出了两种算法:RBFNN(径向基函数神经网络)和EBSR(基于示例性的稀疏表示)基于GMM权重转换,以模拟用于情绪GMM模型合成的中性到情绪重量转换法。与GMM-UBM系统的实验表明,IR分别通过这两种算法增加了6.91%和5.74%。同时,与Negt的那些,这两种算法需要较少的开发数据和时间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号