Optimized multi-channel deep neural network with 2D graphical representation of acoustic speech features for emotion recognition

机译：优化的多通道深度神经网络，具有用于语音识别的语音特征二维图形表示

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This study investigates the effectiveness of speech emotion recognition using a new approach called the Optimized Multi-Channel Deep Neural Network (OMC-DNN), The proposed method has been tested with input features given as simple 2D black and white images representing graphs of the MFCC coefficients or the TEO parameters calculated either from speech (MFCC-S, TEO-S) or glottal waveforms (MFCC-G, TEO-G). A comparison with 6 different single-channel benchmark classifiers has shown that the OMC-DNN provided the best performance in both pair-wise (emotion vs. neutral) and simultaneous multiclass recognition of 7 emotions (anger, boredom, disgust, happiness, fear, sadness and neutral). In the pair-wise case, the OMC-DNN outperformed the single-channel DNN by 5%-10% depending on the feature set. In the multiclass case, the OMC-DNN outperformed or matched the singlechannel equivalents for all features. The speech spectrum and the glottal energy characteristics were identified as two important factors in discriminating between different types of categorical emotions in speech.

机译：这项研究使用一种称为优化多通道深度神经网络（OMC-DNN）的新方法研究了语音情感识别的有效性。该方法已通过输入特征（如代表MFCC图的简单2D黑白图像）进行了测试。从语音（MFCC-S，TEO-S）或声门波形（MFCC-G，TEO-G）计算出的系数或TEO参数。与6种不同的单通道基准分类器进行的比较表明，OMC-DNN在7种情绪（愤怒，无聊，厌恶，幸福，恐惧，悲伤和中立）。在成对情况下，取决于功能集，OMC-DNN的性能优于单通道DNN的5％-10％。在多类情况下，OMC-DNN的所有功能均优于或匹配单通道等效功能。语音频谱和声门能量特征被认为是区分语音中不同类别情感的两个重要因素。

著录项

来源
《International Conference on Signal Processing and Communication Systems》|2014年|1-6|共6页
会议地点
作者
Stolar Melissa N.; Lech Margaret; Burnett Ian S.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
acoustic signal processing; emotion recognition; neural nets; speech processing; 2D black images; 2D graphical representation; MFCC-G; MFCC-S; OMC-DNN; TEO-G; TEO-S; acoustic speech features; categorical emotions; glottal energy characteristics; optimized multichannel deep neural network; single-channel DNN; single-channel benchmark classifiers; speech emotion recognition; speech spectrum; white images; Accuracy; Artificial neural networks; Benchmark testing; Emotion recognition; Speech; Speech recognition; 2D features; deep neural network; emotion recognition; multichannel speech classification;

机译：声信号处理;情感识别;神经网络;语音处理; 2D黑色图像; 2D图形表示; MFCC-G; MFCC-S; OMC-DNN; TEO-G; TEO-S;语音特征;类别情感;声门能量特征;优化的多通道深度神经网络;单通道DNN;单通道基准分类器;语音情感识别;语音频谱;白图像;精度;人工神经网络;基准测试;情感识别;语音;语音识别;二维特征;深度神经网络;情感识别;多通道语音分类;

相似文献

外文文献
中文文献
专利

1. Emotion recognition from speech using deep recurrent neural networks with acoustic features [J] . Byun Sung-Woo, Shin Bo-Ra, Lee Seok-Pil, Basic & clinical pharmacology & toxicology. . 2019,第S7期

机译：使用深度经常性神经网络具有声学特征的情感认识
2. Learning Deep Binaural Representations With Deep Convolutional Neural Networks for Spontaneous Speech Emotion Recognition [J] . Zhang Shiqing, Chen Aihua, Guo Wenping, Quality Control, Transactions . 2020,第期

机译：学习深层卷积神经网络的深层双耳陈述，用于自发言论情绪识别
3. Deep and shallow features fusion based on deep convolutional neural network for speech emotion recognition [J] . Linhui Sun, Jia Chen, Keli Xie, International journal of speech technology . 2018,第4期

机译：基于深度卷积神经网络的深浅特征融合在语音情感识别中的应用
4. Learning utterance-level representations for speech emotion and age/gender recognition using deep neural networks [C] . Zhong-Qiu Wang, Ivan Tashev IEEE International Conference on Acoustics, Speech and Signal Processing . 2017

机译：使用深度神经网络学习语音表达和年龄/性别识别的话语级表示
5. Dysarthric Speech Recognition and Offline Handwriting Recognition using Deep Neural Networks. [D] . Pillai, Suhas Balkrishna. 2017

机译：使用深度神经网络的表情异常语音识别和离线手写识别。
6. Impact of Feature Selection Algorithm on Speech Emotion Recognition Using Deep Convolutional Neural Network [O] . Misbah Farooq, Fawad Hussain, Naveed Khan Baloch, 2020

机译：利用深卷积神经网络对语音情感识别的特征选择算法的影响
7. A Study on a Speech Emotion Recognition System with Effective Acoustic Features Using Deep Learning Algorithms [O] . Sung-Woo Byun, Seok-Pil Lee 2021

机译：利用深层学习算法具有有效声学特征的语音情感识别系统研究

Optimized multi-channel deep neural network with 2D graphical representation of acoustic speech features for emotion recognition

摘要

著录项

相似文献

相关主题

期刊订阅