DBN-ivector Framework for Acoustic Emotion Recognition

机译：DBN-IVERCORCORION识别框架

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Deep learning and i-vectors have been successfully used in speech and speaker recognition recently. In this work we propose a framework based on deep belief network (DBN) and i-vector space modeling for acoustic emotion recognition. We use two types of labels for frame level DBN training. The first one is the vector of posterior probabilities calculated from the GMM universal background model (UBM). The second one is the predicted label based on the GMMs. The DBN is trained to minimize errors for both types. After DBN training, we use the vector of posterior probabilities estimated by DBN to replace the UBM for i-vector extraction. Finally the extracted i-vectors are used in backend classifiers for emotion recognition. Our experiments on the USC IEMOCAP data show the effectiveness of our proposed DBN-ivector framework. In particular, with decision level combination, our proposed system yields significant improvement on both unweighted and weighted accuracy.

机译：深入学习和I-vectors最近已成功用于言语和扬声器识别。在这项工作中，我们提出了一种基于深度信仰网络（DBN）和I - Vector Space建模的框架，用于声学情感识别。我们使用两种类型的标签进行帧级DBN培训。第一个是由GMM通用背景模型（UBM）计算的后验概率的向量。第二个是基于GMMS的预测标签。培训DBN以最小化两种类型的错误。在DBN培训之后，我们使用DBN估计的后验概率的向量替换为I - 矢量提取的UBM。最后，提取的I载体用于情感识别的后端分类器。我们对USC IEMocap数据的实验表明了我们提出的DBN-Ivector框架的有效性。特别是，对于决策水平组合，我们所提出的系统对两种未加权和加权准确性产生重大改善。

著录项

来源
《Annual Conference of the International Speech Communication Association》|2016年|744p|共5页
会议地点
作者
Rui Xia; Yang Liu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TB95-53;
关键词
入库时间 2022-08-21 11:41:06

相似文献

外文文献
中文文献
专利

1. Recognition of Emotions in Mexican Spanish Speech: An Approach Based on Acoustic Modelling of Emotion-Specific Vowels [J] . Santiago-OmarCaballero-Morales ScientificWorldJournal . 2013,第3期

机译：墨西哥西班牙语演讲的情绪：一种基于情感特定元音声学建模的方法
2. Ambient acoustic event assistive framework for identification, detection,and recognition of unknown acoustic events of a residence [J] . Sharnil Pandya, Hemant Ghayvat Advanced engineering informatics . 2021,第Jana期

机译：环境声学事件辅助框架，用于鉴定，检测和识别住所的未知声学事件
3. A Comparative Approach to Human Recognition of Emotions Using Acoustic Properties of Human and Non-Human Primate Vocalisations [J] . Gruber Thibaud, Debracque Coralie, Slocombe Katie, Folia Primatologica: International Journal of Primatology: = Internationale Zeitschrift fur Primatologie: = Journal International de Primatologie . 2020,第3期

机译：利用人类和非人类灵长类犬声学性质的人为情绪识别情绪的比较方法
4. DBN-ivector Framework for Acoustic Emotion Recognition [C] . Rui Xia, Yang Liu Annual Conference of the International Speech Communication Association . 2016

机译：DBN-IVERCHER声音识别框架
5. Analysis of Physiological Signals Using a Semi-Supervised Framework for Emotion Recognition [D] . Popoola, Gabriel A. 2020

机译：用半监控情感识别框架分析生理信号
6. Recognition of Emotions in Mexican Spanish Speech: An Approach Based on Acoustic Modelling of Emotion-Specific Vowels [O] . Santiago-Omar Caballero-Morales 2013

机译：墨西哥西班牙语语音中的情绪识别：一种基于情绪特定元音声学模型的方法
7. On the Use of Self-Supervised Pre-Trained Acoustic and Linguistic Features for Continuous Speech Emotion Recognition [O] . Manon Macary, Marie Tahon, Yannick Esteve, 2021

机译：关于使用自我监督的预训练的声学和语言特征，用于连续语音情感识别

DBN-ivector Framework for Acoustic Emotion Recognition

摘要

著录项

相似文献

相关主题

期刊订阅