The Efficacy of Deep Learning-Based Mixed Model for Speech Emotion Recognition

Mohammad Amaz Uddin; Mohammad Salah Uddin Chowdury; Mayeen Uddin Khandaker; Nissren Tamam; Abdelmoneim Sulieman

首页> 中文期刊> 《计算机、材料和连续体（英文）》 >The Efficacy of Deep Learning-Based Mixed Model for Speech Emotion Recognition

The Efficacy of Deep Learning-Based Mixed Model for Speech Emotion Recognition

开具论文收录证明 >>

期刊封面封底目录下载 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Human speech indirectly represents the mental state or emotion of others.The use of Artificial Intelligence(AI)-based techniques may bring revolution in this modern era by recognizing emotion from speech.In this study,we introduced a robust method for emotion recognition from human speech using a well-performed preprocessing technique together with the deep learning-based mixed model consisting of Long Short-Term Memory(LSTM)and Convolutional Neural Network(CNN).About 2800 audio files were extracted from the Toronto emotional speech set(TESS)database for this study.A high pass and Savitzky Golay Filter have been used to obtain noise-free as well as smooth audio data.A total of seven types of emotions;Angry,Disgust,Fear,Happy,Neutral,Pleasant-surprise,and Sad were used in this study.Energy,Fundamental frequency,and Mel Frequency Cepstral Coefficient(MFCC)have been used to extract the emotion features,and these features resulted in 97.5%accuracy in the mixed LSTM+CNN model.This mixed model is found to be performed better than the usual state-of-the-art models in emotion recognition from speech.It also indicates that this mixed model could be effectively utilized in advanced research dealing with sound processing.

著录项

来源
《计算机、材料和连续体（英文）》 |2023年第1期|1709-1722|共14页
作者
Mohammad Amaz Uddin; Mohammad Salah Uddin Chowdury; Mayeen Uddin Khandaker; Nissren Tamam; Abdelmoneim Sulieman;
展开▼
作者单位

Department of Computer Science and Engineering Trust University Bangladesh;

Centre for Applied Physics and Radiation Technologies of Engineering and Technology University Sunway;

Department of Physics of Sciences Nourah bint Abdulrahman University.O Box 84428 Arabia;

Department of Radiology and Medical Imaging Sattam bin Abdulaziz University Arabia;

展开▼
原文格式 PDF
正文语种 chi
中图分类 TN9;
关键词
Emotion recognition; Savitzky Golay; fundamental frequency; MFCC; neural networks;

相似文献

中文文献
外文文献

1. Deep Learning-Based Approach for Arabic Visual Speech Recognition [J] . Nadia H.Alsulami ,Amani T.Jamal ,Lamiaa A.Elrefaei . 计算机、材料和连续体(英文) . 2022,第4期
2. Emotion Recognition from Occluded Facial Images Using Deep Ensemble Model [J] . Zia Ullah ,Muhammad Ismail Mohmand ,Sadaqat ur Rehman . 计算机、材料和连续体(英文) . 2022,第12期
3. Deep Learning-Based Emotion Detection [J] . Yuwei Chen ,Jianyu He . 电脑和通信(英文) . 2022,第2期
4. Cross-Language Transfer Learning-based Lhasa-Tibetan Speech Recognition [J] . Zhijie Wang ,Yue Zhao ,Licheng Wu . 计算机、材料和连续体(英文) . 2022,第10期
5. A Segmental VQ based Efficient Method for Mandarin Digits Speech Recognition [C] . . 第六届全国人机语音通讯学术会议 . 2001
6. Deep Learning based Speech Emotion Recognition by Fusing Acoustic Features and Transcriptions Clues [A] . 田天 . 2020

The Efficacy of Deep Learning-Based Mixed Model for Speech Emotion Recognition

摘要

著录项

相似文献

相关主题

期刊订阅