Adieu features? End-to-end speech emotion recognition using a deep convolutional recurrent network

机译：Adieu功能？使用深度卷积递归网络的端到端语音情感识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The automatic recognition of spontaneous emotions from speech is a challenging task. On the one hand, acoustic features need to be robust enough to capture the emotional content for various styles of speaking, and while on the other, machine learning algorithms need to be insensitive to outliers while being able to model the context. Whereas the latter has been tackled by the use of Long Short-Term Memory (LSTM) networks, the former is still under very active investigations, even though more than a decade of research has provided a large set of acoustic descriptors. In this paper, we propose a solution to the problem of ???context-aware??? emotional relevant feature extraction, by combining Convolutional Neural Networks (CNNs) with LSTM networks, in order to automatically learn the best representation of the speech signal directly from the raw time representation. In this novel work on the so-called end-to-end speech emotion recognition, we show that the use of the proposed topology significantly outperforms the traditional approaches based on signal processing techniques for the prediction of spontaneous and natural emotions on the RECOLA database.

机译：自动识别语音中的自发情绪是一项艰巨的任务。一方面，声学特征必须足够健壮，以捕获各种说话风格的情感内容，另一方面，机器学习算法需要对异常值不敏感，同时能够对上下文进行建模。尽管后者已通过使用长短期记忆（LSTM）网络解决，但前者仍处于非常积极的研究之中，尽管十多年来的研究已经提供了大量声学描述符。在本文中，我们提出了解决“上下文感知”问题的方法。通过将卷积神经网络（CNN）与LSTM网络相结合，提取情感相关特征，以便直接从原始时间表示中自动学习语音信号的最佳表示。在关于端到端语音情感识别的这项新颖工作中，我们表明，所提出的拓扑的使用明显优于基于信号处理技术的传统方法，该方法可用于预测RECOLA数据库上的自然和自然情感。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2016年|5200-5204|共5页
会议地点
作者
George Trigeorgis; Fabien Ringeval; Raymond Brueckner; Erik Marchi; Mihalis A. Nicolaou; Bjrn Schuller; Stefanos Zafeiriou;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
CNN; LSTM; deep learning; emotion recognition; end-to-end learning; raw waveform;

机译：CNN; LSTM;深度学习;情感识别;端到端学习;原始波形;

相似文献

外文文献
中文文献
专利

1. 基于膨胀卷积网络的端到端文档语义分割 [J] . 许灿辉, 史操, 陈以农中南大学学报（英文版） . 2021,第006期
2. Deep and shallow features fusion based on deep convolutional neural network for speech emotion recognition [J] . Linhui Sun, Jia Chen, Keli Xie, International journal of speech technology . 2018,第4期

机译：基于深度卷积神经网络的深浅特征融合在语音情感识别中的应用
3. Speech Emotion Recognition Using Deep Convolutional Neural Network and Simple Recurrent Unit [J] . Pengxu Jiang, Hongliang Fu, Huawei Tao Engineering Letters . 2019,第4期

机译：使用深卷积神经网络和简单复发单元的语音情感识别
4. Emotion recognition from speech using deep recurrent neural networks with acoustic features [J] . Byun Sung-Woo, Shin Bo-Ra, Lee Seok-Pil, Basic & clinical pharmacology & toxicology. . 2019,第S7期

机译：使用深度经常性神经网络具有声学特征的情感认识
5. Adieu features? End-to-end speech emotion recognition using a deep convolutional recurrent network [C] . George Trigeorgis, Fabien Ringeval, Raymond Brueckner, IEEE International Conference on Acoustics, Speech and Signal Processing . 2016

机译：Adieu功能？使用深度卷积经常性网络的端到端语音情感识别
6. Emotion Recognition Using Deep Convolutional Neural Network with Large Scale Physiological Data [D] . Sharma, Astha. 2018

机译：基于深度卷积神经网络的大规模生理数据情感识别
7. Impact of Feature Selection Algorithm on Speech Emotion Recognition Using Deep Convolutional Neural Network [O] . Misbah Farooq, Fawad Hussain, Naveed Khan Baloch, 2020

机译：利用深卷积神经网络对语音情感识别的特征选择算法的影响
8. Adieu Features? End-to- End Speech Emotion Recognition using a Deep Convolutional Recurrent Network [O] . Trigeorgis, George, Ringeval, Fabien, Brueckner, Raymond, 2016

机译：Adieu功能？使用深度卷积递归网络的端到端语音情感识别

Adieu features? End-to-end speech emotion recognition using a deep convolutional recurrent network

摘要

著录项

相似文献

相关主题

期刊订阅