...
首页> 外文期刊>Arabian Journal for Science and Engineering >A Two-Stage Hierarchical Bilingual Emotion Recognition System Using a Hidden Markov Model and Neural Networks
【24h】

A Two-Stage Hierarchical Bilingual Emotion Recognition System Using a Hidden Markov Model and Neural Networks

机译:基于隐马尔可夫模型和神经网络的两阶段分级双语情感识别系统

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Speech emotion recognition continues to attract a lot of research especially under mixed-language scenarios. Here, we show that emotion is language dependent and that enhanced emotion recognition systems can be built when the language is known. We propose a two-stage emotion recognition system that starts by identifying the language, followed by a dedicated language-dependent recognition system for identifying the type of emotion. The system is able to recognize accurately the four main types of emotion, namely neutral, happy, angry, and sad. These types of emotion states are widely used in practical setups. To keep the computation complexity low, we identify the language using a feature vector consisting of energies from a basic wavelet decomposition. A hidden Markov model (HMM) is then used to track the changes of this vector to identify the language, achieving recognition accuracy close to 100%. Once the language is identified, a set of speech processing features including pitch and MFCCs are used with a neural network (NN) architecture to identify the emotion type. The results show that that identifying the language first can substantially improve the overall accuracy in identifying emotions. The overall accuracy achieved with the proposed system reached more than 93%. To test the robustness of the proposed methodology, we also used a Gaussian mixture model (GMM) for both language identification and emotion recognition. Our proposed HMM-NN approach showed a better performance than the GMM-based approach. More importantly, we tested the proposed algorithm with 6 emotions which are showed that the overall accuracy continues to be excellent, while the performance of the GMM-based approach deteriorates substantially. It is worth noting that the performance we achieved is close to the one attained for single language emotion recognition systems and outperforms by far recognition systems without language identification (around 60%). The work shows the strong correlation between language and type of emotion, and can further be extended to other scenarios including gender-based, facial expression-based, and age-based emotion recognition.
机译:语音情感识别继续吸引大量研究,尤其是在混合语言场景下。在这里,我们证明了情感是依赖于语言的,并且当已知该语言时可以构建增强的情感识别系统。我们提出了一个两阶段的情感识别系统,该系统首先识别语言,然后是用于识别情感类型的依赖于语言的专用识别系统。该系统能够准确识别四种主要的情绪类型,即中性,快乐,愤怒和悲伤。这些类型的情绪状态在实际设置中被广泛使用。为了保持较低的计算复杂度,我们使用由基本小波分解的能量组成的特征向量来识别语言。然后,使用隐马尔可夫模型(HMM)跟踪此向量的变化以识别语言,从而实现接近100%的识别精度。识别语言后,一组语音处理功能(包括音调和MFCC)将与神经网络(NN)体系结构一起使用,以识别情感类型。结果表明,首先识别语言可以大大提高识别情感的整体准确性。所提出的系统实现的总体精度超过93%。为了测试所提出方法的鲁棒性,我们还将高斯混合模型(GMM)用于语言识别和情感识别。我们提出的HMM-NN方法显示出比基于GMM的方法更好的性能。更重要的是,我们用6种情感测试了所提出的算法,这些算法表明总体准确性仍然非常好,而基于GMM的方法的性能却大大降低了。值得注意的是,我们获得的性能接近于单语言情感识别系统所达到的性能,并且优于没有语言识别的远距离识别系统(约60%)。该作品显示了语言和情感类型之间的强烈关联,并且可以进一步扩展到其他场景,包括基于性别,基于面部表情和基于年龄的情感识别。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号