...
首页> 外文期刊>Iranian Journal of Science and Technology, Transactions of Electrical Engineering >A Multimodal Emotion Recognition System Using Facial Landmark Analysis
【24h】

A Multimodal Emotion Recognition System Using Facial Landmark Analysis

机译:使用面部地标分析的多模式情感识别系统

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

This paper introduces a multimodal emotion recognition system based on two different modalities, i.e., affective speech and facial expression. For affective speech, the common low-level descriptors including prosodic and spectral audio features (i.e., energy, zero crossing rate, MFCC, LPC, PLP and temporal derivatives) are extracted, whereas a novel visual feature extraction method is proposed in the case of facial expression. This method exploits the displacement of specific landmarks across consecutive frames of an utterance for feature extraction. To this end, the time series of temporal variations for each landmark is analyzed individually for extracting primary visual features, and then, the extracted features of all landmarks are concatenated for constructing the final feature vector. The analysis of displacement signal of landmarks is performed by the discrete wavelet transform which is a widely used mathematical transform in signal processing applications. In order to reduce the complexity of derived models and improve the efficiency, a variety of dimensionality-reduction schemes are applied. Furthermore, to exploit the advantages of multimodal emotion recognition systems, the feature-level fusion of the audio and the proposed visual features is examined. Results of experiments conducted on three SAVEE, RML and eNTERFACE05 databases show the efficiency of proposed visual feature extraction method in terms of performance criteria.
机译:本文介绍了基于两种不同方式的多模式情绪识别系统,即情感言论和面部表情。对于情感语音,提取包括韵律和光谱音频特征(即能量,零交叉率,MFCC,LPC,PLP和时间衍生物)的常见低级描述符,而在此情况下提出了一种新颖的视觉特征提取方法表情。该方法利用特定地标在具有特征提取的话语的连续帧中的位移。为此,单独分析每个地标的时间变化的时间序列用于提取主视觉特征,然后,所有地标的提取特征被连接到构建最终特征向量。通过离散小波变换来执行地标的分析,这是信号处理应用中广泛使用的数学变换。为了降低衍生模型的复杂性并提高效率,应用了各种维度减少方案。此外,为了利用多模式情绪识别系统的优点,检查了音频和所提出的视觉特征的特征级融合。在三个Savee,RML和EnterFace05数据库上进行的实验结果显示了在绩效标准方面提出了所提出的视觉特征提取方法的效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号