首页> 外文期刊>International journal of multimedia data engineering & management >Audiovisual Facial Action Unit Recognition using Feature Level Fusion
【24h】

Audiovisual Facial Action Unit Recognition using Feature Level Fusion

机译:使用特征级融合的视听面部动作单元识别

获取原文
获取原文并翻译 | 示例
       

摘要

Recognizing facial actions is challenging, especially when they are accompanied with speech. Instead of employing information solely from the visual channel, this work aims to exploit information from both visual and audio channels in recognizing speech-related facial action units (AUs). In this work, two feature-level fusion methods are proposed. The first method is based on a kind of human-crafted visual feature. The other method utilizes visual features learned by a deep convolutional neural network (CNN). For both methods, features are independently extracted from visual and audio channels and aligned to handle the difference in time scales and the time shift between the two signals. These temporally aligned features are integrated via feature-level fusion for AU recognition. Experimental results on a new audiovisual AU-coded dataset have demonstrated that both fusion methods outperform their visual counterparts in recognizing speech-related AUs. The improvement is more impressive with occlusions on the facial images, which would not affect the audio channel.
机译:识别面部动作具有挑战性,尤其是在伴随语音的情况下。这项工作不是在视觉渠道中仅使用信息,而是旨在利用视觉和音频渠道中的信息来识别与语音相关的面部动作单元(AU)。在这项工作中,提出了两种特征级融合方法。第一种方法是基于一种人工视觉特征。另一种方法利用了深度卷积神经网络(CNN)学习的视觉特征。对于这两种方法,特征都是从视觉和音频通道中独立提取的,并且经过对齐以处理两个信号之间的时标差异和时移。这些时间对齐的特征通过特征级别融合进行集成,以进行AU识别。在新的视听AU编码数据集上的实验结果表明,在识别与语音相关的AU时,两种融合方法均优于其视觉对应方法。面部图像上的遮挡不会对音频通道造成影响,因此改进效果更为明显。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号