Deep bi-directional LSTM network with CNN features for human emotion recognition in audio-video signals

Lovejit Singh

首页> 外文期刊>International journal of swarm intelligence >Deep bi-directional LSTM network with CNN features for human emotion recognition in audio-video signals

【24h】

Deep bi-directional LSTM network with CNN features for human emotion recognition in audio-video signals

机译：Deep bi-directional LSTM network with CNN features for human emotion recognition in audio-video signals

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相关主题

摘要

The human emotion detection in audio-video signals is a challenging task. This paper proposed deep bi-directional long short-term memory (Bi-LSTM) network with convolution neural network (CNN) features-based human emotion detection method. First, it utilises the transfer learning Inception-ResNet V2 model to extract the CNN features from audio and video modalities. Furthermore, the frame-wise CNN features sequential information is learned by two separate Bi-LSTM models for audio and video channels, respectively. The weighted product rule-based decision level fusion method computes the final confidence scores with the output probabilities of two independent Bi-LSTM models. The proposed approach is validated, tested, and compared with existing deep learning-based audio-video emotion detection methods on the challenging Ryerson audio-visual database of emotional speech and song (RAVDESS). The experimental results show that the proposed approach has outperformed the existing methods. It has attained 81.03% validation and 83.98% testing emotion detection accuracy on RAVDESS dataset.

著录项

来源
《International journal of swarm intelligence》 |2022年第1期|110-122|共13页
作者
Lovejit Singh;
展开▼
作者单位

Department of Computer Science and Engineering, University Institute of Engineering, Chandigarh University;

展开▼
收录信息
原文格式 PDF
正文语种英语
中图分类
关键词
convolution neural network; CNN; bi-directional long short-term memory network; emotion recognition;

Deep bi-directional LSTM network with CNN features for human emotion recognition in audio-video signals

摘要

著录项

相关主题

期刊订阅