A Two-Stage Hierarchical Bilingual Emotion Recognition System Using a Hidden Markov Model and Neural Networks

Deriche Mohamed; Absa Ahmed H. Abo

首页> 外文期刊>Arabian Journal for Science and Engineering >A Two-Stage Hierarchical Bilingual Emotion Recognition System Using a Hidden Markov Model and Neural Networks

【24h】

A Two-Stage Hierarchical Bilingual Emotion Recognition System Using a Hidden Markov Model and Neural Networks

机译：基于隐马尔可夫模型和神经网络的两阶段分级双语情感识别系统

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Speech emotion recognition continues to attract a lot of research especially under mixed-language scenarios. Here, we show that emotion is language dependent and that enhanced emotion recognition systems can be built when the language is known. We propose a two-stage emotion recognition system that starts by identifying the language, followed by a dedicated language-dependent recognition system for identifying the type of emotion. The system is able to recognize accurately the four main types of emotion, namely neutral, happy, angry, and sad. These types of emotion states are widely used in practical setups. To keep the computation complexity low, we identify the language using a feature vector consisting of energies from a basic wavelet decomposition. A hidden Markov model (HMM) is then used to track the changes of this vector to identify the language, achieving recognition accuracy close to 100%. Once the language is identified, a set of speech processing features including pitch and MFCCs are used with a neural network (NN) architecture to identify the emotion type. The results show that that identifying the language first can substantially improve the overall accuracy in identifying emotions. The overall accuracy achieved with the proposed system reached more than 93%. To test the robustness of the proposed methodology, we also used a Gaussian mixture model (GMM) for both language identification and emotion recognition. Our proposed HMM-NN approach showed a better performance than the GMM-based approach. More importantly, we tested the proposed algorithm with 6 emotions which are showed that the overall accuracy continues to be excellent, while the performance of the GMM-based approach deteriorates substantially. It is worth noting that the performance we achieved is close to the one attained for single language emotion recognition systems and outperforms by far recognition systems without language identification (around 60%). The work shows the strong correlation between language and type of emotion, and can further be extended to other scenarios including gender-based, facial expression-based, and age-based emotion recognition.

机译：语音情感识别继续吸引大量研究，尤其是在混合语言场景下。在这里，我们证明了情感是依赖于语言的，并且当已知该语言时可以构建增强的情感识别系统。我们提出了一个两阶段的情感识别系统，该系统首先识别语言，然后是用于识别情感类型的依赖于语言的专用识别系统。该系统能够准确识别四种主要的情绪类型，即中性，快乐，愤怒和悲伤。这些类型的情绪状态在实际设置中被广泛使用。为了保持较低的计算复杂度，我们使用由基本小波分解的能量组成的特征向量来识别语言。然后，使用隐马尔可夫模型（HMM）跟踪此向量的变化以识别语言，从而实现接近100％的识别精度。识别语言后，一组语音处理功能（包括音调和MFCC）将与神经网络（NN）体系结构一起使用，以识别情感类型。结果表明，首先识别语言可以大大提高识别情感的整体准确性。所提出的系统实现的总体精度超过93％。为了测试所提出方法的鲁棒性，我们还将高斯混合模型（GMM）用于语言识别和情感识别。我们提出的HMM-NN方法显示出比基于GMM的方法更好的性能。更重要的是，我们用6种情感测试了所提出的算法，这些算法表明总体准确性仍然非常好，而基于GMM的方法的性能却大大降低了。值得注意的是，我们获得的性能接近于单语言情感识别系统所达到的性能，并且优于没有语言识别的远距离识别系统（约60％）。该作品显示了语言和情感类型之间的强烈关联，并且可以进一步扩展到其他场景，包括基于性别，基于面部表情和基于年龄的情感识别。

著录项

来源
《Arabian Journal for Science and Engineering》 |2017年第12期|5231-5249|共19页
作者
Deriche Mohamed; Absa Ahmed H. Abo;
展开▼
作者单位

King Fahd Univ Petr & Minerals, Elect Engn Dept, Dhahran, Saudi Arabia;

King Fahd Univ Petr & Minerals, Elect Engn Dept, Dhahran, Saudi Arabia;

展开▼
收录信息美国《科学引文索引》(SCI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Speech emotion recognition; Language recognition; Hidden Markov model; Neural networks; Pattern recognition;

机译：语音情感识别语言识别隐马尔可夫模型神经网络模式识别;

相似文献

外文文献
中文文献
专利

1. Determination of tDETERMINATION OF 3D STRUCTURE OF GAG POLY PROTEIN ISOLATE 90CF056 OF HIV TYPE 1 BY HIDDEN MARKOV MODEL AND NEURAL NETWhree dimensional structure of Gag Poly Protein isolate 90CF056 of HIV type 1 by Hidden Markov Model and neural networks [J] . Jason Benjamin Undety, Daniel Alex Anand International Journal of Pharmacy and Pharmaceutical Sciences . 2014,第8期

机译：用隐马尔可夫模型和神经网络确定1型HIV的GAG分离蛋白90CF056的3D结构的结构用隐马尔可夫模型和神经网络确定1型HIV的Gag多聚蛋白分离物90CF056的三维结构
2. Determination of tDETERMINATION OF 3D STRUCTURE OF GAG POLY PROTEIN ISOLATE 90CF056 OF HIV TYPE 1 BY HIDDEN MARKOV MODEL AND NEURAL NETWhree dimensional structure of Gag Poly Protein isolate 90CF056 of HIV type 1 by Hidden Markov Model and neural networks [J] . Jason Benjamin Undety, Daniel Alex Anand International Journal of Pharmacy and Pharmaceutical Sciences . 2014,第8期

机译：用隐马尔可夫模型和神经网络确定1型HIV的GAG分离蛋白90CF056的3D结构的结构用隐马尔可夫模型和神经网络确定1型HIV的Gag多聚蛋白分离物90CF056的三维结构
3. A HYBRID SPEECH RECOGNITION SYSTEM WITH HIDDEN MARKOV MODEL AND RADIAL BASIS FUNCTION NEURAL NETWORK [J] . Judith Justin, Ila Vennila American journal of applied sciences . 2013,第10期

机译：具有隐马尔可夫模型和径向基函数神经网络的混合语音识别系统。
4. A Two-Stage Hierarchical Multilingual Emotion Recognition System Using Hidden Markov Models and Neural Networks [C] . Ahmed H. Abo absa, M. Deriche 2017 9th IEEE-GCC Conference and Exhibition . 2017

机译：基于隐马尔可夫模型和神经网络的两阶段分层多语言情感识别系统
5. A real-time hidden Markov model based action recognition system using body sensor networks. [D] . Mannil, Jerry Jolly. 2011

机译：一个基于实时隐马尔可夫模型的动作识别系统，使用人体传感器网络。
6. Epigenetic change detection and pattern recognition via Bayesian hierarchical hidden Markov models [O] . Xinlei Wang, Miao Zang, Guanghua Xiao -1

机译：贝叶斯分层隐马尔可夫模型的表观遗传变革检测与模式识别
7. A Hybrid Large Vocabulary Handwritten Word Recognition System using Neural Networks with Hidden Markov Models [O] . Alessandro L. Koerich, Yann Leydier, Robert Sabourin, 2002

机译：基于隐马尔可夫模型的神经网络混合大词汇手写单词识别系统
8. Speaker Recognition by Hidden Markov Models and Neural Networks [R] . Zeek, E. J. 1996

机译：隐马尔可夫模型和神经网络的说话人识别

A Two-Stage Hierarchical Bilingual Emotion Recognition System Using a Hidden Markov Model and Neural Networks

摘要

著录项

相似文献

相关主题

期刊订阅