Speech Emotion Recognition Using Semi-supervised Learning with Ladder Networks

机译：梯形网络中半监督学习的语音情感识别

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

As a major branch of speech processing, speech emotion recognition has drawn much attention of researchers. Prior works have proposed a variety of models and feature sets for training a system. In this paper, we propose to use semi-supervised learning with ladder networks to generate robust feature representation for speech emotion recognition. In our method, the input of ladder network is the normalized static acoustic features and is mapped to high level hidden representations. The model is trained to simultaneously minimize the sum of supervised and unsupervised cost functions by back-propagation. The extracted hidden representations are used as emotional features in SVM model for speech emotion recognition. The experimental results, performed on IEMOCAP database, show 2.6% higher performance than denoising auto-encoder, and 5.3% than the static acoustic features.

机译：语音情感识别作为语音处理的主要分支，引起了研究者的广泛关注。先前的工作提出了用于训练系统的各种模型和特征集。在本文中，我们建议使用带有梯形网络的半监督学习来生成用于语音情感识别的鲁棒特征表示。在我们的方法中，梯形网络的输入是归一化的静态声学特征，并映射到高级隐藏表示。对模型进行训练，以通过反向传播同时最小化监督和非监督成本函数的总和。提取的隐藏表示用作SVM模型中的情感特征，以进行语音情感识别。在IEMOCAP数据库上执行的实验结果显示，比降噪自动编码器的性能高2.6％，比静态声学功能的性能高5.3％。

著录项

来源
《2018 First Asian Conference on Affective Computing and Intelligent Interaction》|2018年|1-5|共5页
会议地点 Beijing(CN)
作者
Jian Huang; Ya Li; Jianhua Tao; Zheng Lian; Mingyue Niu; Jiangyan Yi;
展开▼
作者单位

National Laboratory of Pattern Recognition, (NLPR), School of Artificial Intelligence, Institute of Automation, CAS, University of Chinese Academy of Sciences, Beijing, China;

National Laboratory of Pattern Recognition, (NLPR), School of Artificial Intelligence, Institute of Automation, CAS, University of Chinese Academy of Sciences, Beijing, China;

National Laboratory of Pattern Recognition, (NLPR), School of Artificial Intelligence, CAS Center for Excellence in Brain Science and Intelligence Technology, Institute of Automation, CAS, University of Chinese Academy of Sciences, Beijing, China;

National Laboratory of Pattern Recognition, (NLPR), School of Artificial Intelligence, Institute of Automation, CAS, University of Chinese Academy of Sciences, Beijing, China;

National Laboratory of Pattern Recognition, (NLPR), School of Artificial Intelligence, Institute of Automation, CAS, University of Chinese Academy of Sciences, Beijing, China;

National Laboratory of Pattern Recognition, (NLPR), School of Artificial Intelligence, Institute of Automation, CAS, University of Chinese Academy of Sciences, Beijing, China;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Feature extraction; Emotion recognition; Speech recognition; Acoustics; Decoding; Automation; Supervised learning;

机译：特征提取;情感识别;语音识别;声学;解码;自动化;监督学习;;
入库时间 2022-08-26 14:26:36

相似文献

外文文献
中文文献
专利

1. 基于随机权神经网络的在线自适应半监督学习算法及其在工业过程产品质量评价中的应用 [J] . 代伟, 胡金成, 程玉虎, 中南大学学报（英文版） . 2019,第012期
2. Learning Deep Binaural Representations With Deep Convolutional Neural Networks for Spontaneous Speech Emotion Recognition [J] . Zhang Shiqing, Chen Aihua, Guo Wenping, Quality Control, Transactions . 2020,第期

机译：学习深层卷积神经网络的深层双耳陈述，用于自发言论情绪识别
3. Learning Salient Features for Speech Emotion Recognition Using Convolutional Neural Networks [J] . Mao Q., Dong M., Huang Z., Multimedia, IEEE Transactions on . 2014,第8期

机译：使用卷积 /神经网络学习语音情感的显着特征
4. Recognition of speech emotion using custom 2D-convolution neural network deep learning algorithm [J] . Zvarevashe Kudakwashe, Olugbara Oludayo O. Intelligent data analysis . 2020,第5期

机译：使用自定义2D卷积神经网络深度学习算法识别语音情绪
5. Speech Emotion Recognition Using Semi-supervised Learning with Ladder Networks [C] . Jian Huang, Ya Li, Jianhua Tao, Asian Conference on Affective Computing and Intelligent Interaction . 2018

机译：用梯形网络使用半监督学习的语音情感认知
6. Graph-based Semi-Supervised Learning in Acoustic Modeling for Automatic Speech Recognition. [D] . Liu, Yuzong. 2016

机译：用于自动语音识别的声学建模中基于图的半监督学习。
7. Possibilistic Clustering-Promoting Semi-Supervised Learning for EEG-Based Emotion Recognition [O] . Yufang Dan, Jianwen Tao, Jianjing Fu, 2021

机译：可能的聚类 - 促进基于脑电乐的情感识别的半监督学习
8. Semi-supervised Ladder Networks for Speech Emotion Recognition [O] . Jian-Hua Tao, Jian Huang, Ya Li, 2019

机译：用于语音情感识别的半监督梯形网络

Speech Emotion Recognition Using Semi-supervised Learning with Ladder Networks

摘要

著录项

相似文献

相关主题

期刊订阅