Improving Speech Emotion Recognition with Unsupervised Representation Learning on Unlabeled Speech

机译：通过对无标签语音进行无监督表示学习来提高语音情感识别能力

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper we present our findings on how representation learning on large unlabeled speech corpora can be beneficially utilized for speech emotion recognition (SER). Prior work on representation learning for SER mostly focused on the relatively small emotional speech datasets without making use of additional unlabeled speech data. We show that integrating representations learnt by an unsupervised autoencoder into a CNN-based emotion classifier improves the recognition accuracy. To gain insights about what those models learn, we analyze visualizations of the different representations using t-distributed neighbor embeddings (t-SNE). We evaluate our approach on IEMOCAP and MSP-IMPROV by means of within- and cross-corpus testing.

机译：在本文中，我们介绍了有关如何将大型未标记语音语料库上的表示学习有效地用于语音情感识别（SER）的发现。 SER的表示学习的先前工作主要集中在相对较小的情感语音数据集上，而不使用其他未标记的语音数据。我们表明，将无监督的自动编码器学习的表示形式集成到基于CNN的情感分类器中，可以提高识别的准确性。为了获得有关这些模型学习内容的见解，我们使用t分布邻居嵌入（t-SNE）分析了不同表示形式的可视化。我们通过内部和跨主体测试来评估针对IEMOCAP和MSP-IMPROV的方法。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2019年|7390-7394|共5页
会议地点
作者
Michael Neumann; Ngoc Thang Vu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Speech recognition; Emotion recognition; Training; Feature extraction; Visualization; Adaptation models; Decoding;

机译：语音识别;情感识别;训练;特征提取;可视化;适应模型;解码;

相似文献

外文文献
中文文献
专利

1. Speech emotion recognition with unsupervised feature learning [J] . Zheng-wei?Huang, Wen-tao?Xue, Qi-rong?Mao Frontiers of Information Technology & Electronic Engineering . 2015,第5期

机译：语音情感识别与无监督特征学习
2. Speech emotion recognition with unsupervised feature learning [J] . Zheng-wei HUANG, Wen-tao XUE, Qi-rong MAO 浙江大学学报（英文版）（C辑：计算机与电子） . 2015,第005期

机译：语音情感识别与无监督特征学习
3. Speech emotion recognition based on an improved brain emotion learning model [J] . Liu Zhen-Tao, Xie Qiao, Wu Min, Neurocomputing . 2018,第OCTa2期

机译：基于改进的大脑情感学习模型的语音情感识别
4. Improving Speech Emotion Recognition with Unsupervised Representation Learning on Unlabeled Speech [C] . Michael Neumann, Ngoc Thang Vu IEEE International Conference on Acoustics, Speech and Signal Processing . 2019

机译：改善言语情感认同与无监督的言论学习
5. Speech recognition: The interpretation of training and using speech recognition software from the perspectives of postsecondary students with learning challenges. [D] . Soenksen, Delann. 2006

机译：语音识别：从具有学习挑战的大专学生的角度解释培训和使用语音识别软件的解释。
6. Feature Selection for Speech Emotion Recognition in Spanish and Basque: On the Use of Machine Learning to Improve Human-Computer Interaction [O] . Andoni Arruti, Idoia Cearreta, Aitor Álvarez, 2010

机译：西班牙语和巴斯克人语音情感识别的特征选择：使用机器学习改善人机交互
7. Unsupervised Representation Learning with Future Observation Prediction for Speech Emotion Recognition [O] . Zheng Lian, Jianhua Tao, Bin Liu, 2019

机译：未经监督的代表学习与言语情感认可的未来观察预测

Improving Speech Emotion Recognition with Unsupervised Representation Learning on Unlabeled Speech

摘要

著录项

相似文献

相关主题

期刊订阅