首页> 外文会议>Annual Conference of the International Speech Communication Association >DBN-ivector Framework for Acoustic Emotion Recognition
【24h】

DBN-ivector Framework for Acoustic Emotion Recognition

机译:DBN-IVERCORCORION识别框架

获取原文

摘要

Deep learning and i-vectors have been successfully used in speech and speaker recognition recently. In this work we propose a framework based on deep belief network (DBN) and i-vector space modeling for acoustic emotion recognition. We use two types of labels for frame level DBN training. The first one is the vector of posterior probabilities calculated from the GMM universal background model (UBM). The second one is the predicted label based on the GMMs. The DBN is trained to minimize errors for both types. After DBN training, we use the vector of posterior probabilities estimated by DBN to replace the UBM for i-vector extraction. Finally the extracted i-vectors are used in backend classifiers for emotion recognition. Our experiments on the USC IEMOCAP data show the effectiveness of our proposed DBN-ivector framework. In particular, with decision level combination, our proposed system yields significant improvement on both unweighted and weighted accuracy.
机译:深入学习和I-vectors最近已成功用于言语和扬声器识别。 在这项工作中,我们提出了一种基于深度信仰网络(DBN)和I - Vector Space建模的框架,用于声学情感识别。 我们使用两种类型的标签进行帧级DBN培训。 第一个是由GMM通用背景模型(UBM)计算的后验概率的向量。 第二个是基于GMMS的预测标签。 培训DBN以最小化两种类型的错误。 在DBN培训之后,我们使用DBN估计的后验概率的向量替换为I - 矢量提取的UBM。 最后,提取的I载体用于情感识别的后端分类器。 我们对USC IEMocap数据的实验表明了我们提出的DBN-Ivector框架的有效性。 特别是,对于决策水平组合,我们所提出的系统对两种未加权和加权准确性产生重大改善。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号