Audiovisual Facial Action Unit Recognition using Feature Level Fusion

Zibo Meng; Shizhong Han; Min Chen; Yan Tong

首页> 外文期刊>International journal of multimedia data engineering & management >Audiovisual Facial Action Unit Recognition using Feature Level Fusion

【24h】

Audiovisual Facial Action Unit Recognition using Feature Level Fusion

机译：使用特征级融合的视听面部动作单元识别

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recognizing facial actions is challenging, especially when they are accompanied with speech. Instead of employing information solely from the visual channel, this work aims to exploit information from both visual and audio channels in recognizing speech-related facial action units (AUs). In this work, two feature-level fusion methods are proposed. The first method is based on a kind of human-crafted visual feature. The other method utilizes visual features learned by a deep convolutional neural network (CNN). For both methods, features are independently extracted from visual and audio channels and aligned to handle the difference in time scales and the time shift between the two signals. These temporally aligned features are integrated via feature-level fusion for AU recognition. Experimental results on a new audiovisual AU-coded dataset have demonstrated that both fusion methods outperform their visual counterparts in recognizing speech-related AUs. The improvement is more impressive with occlusions on the facial images, which would not affect the audio channel.

机译：识别面部动作具有挑战性，尤其是在伴随语音的情况下。这项工作不是在视觉渠道中仅使用信息，而是旨在利用视觉和音频渠道中的信息来识别与语音相关的面部动作单元（AU）。在这项工作中，提出了两种特征级融合方法。第一种方法是基于一种人工视觉特征。另一种方法利用了深度卷积神经网络（CNN）学习的视觉特征。对于这两种方法，特征都是从视觉和音频通道中独立提取的，并且经过对齐以处理两个信号之间的时标差异和时移。这些时间对齐的特征通过特征级别融合进行集成，以进行AU识别。在新的视听AU编码数据集上的实验结果表明，在识别与语音相关的AU时，两种融合方法均优于其视觉对应方法。面部图像上的遮挡不会对音频通道造成影响，因此改进效果更为明显。

著录项

来源
《International journal of multimedia data engineering & management》 |2016年第1期|60-76|共17页
作者
Zibo Meng; Shizhong Han; Min Chen; Yan Tong;
展开▼
作者单位

University of South Carolina, Columbia, SC, USA;

University of South Carolina, Columbia, SC, USA;

University of Washington Bothell, Bothell, WA, USA;

University of South Carolina, Columbia, SC, USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Action Units; Convolutional Neural Network; Facial Action Unit Recognition; Facial Activity; Feature-Level Information Fusion;

机译：行动单位;卷积神经网络面部动作单元识别;面部活动;功能级信息融合;
入库时间 2022-08-17 13:52:12

相似文献

外文文献
中文文献
专利

1. Texture and shape information fusion for facial expression and facial action unit recognition [J] . Kotsia I, Zafeiriou S, Pitas L Pattern Recognition: The Journal of the Pattern Recognition Society . 2008,第3期

机译：纹理和形状信息融合，用于面部表情和面部动作单元识别
2. Facial Expression Recognition Algorithm Based on Fusion of Transformed Multilevel Features and Improved Weighted Voting SVM [J] . Hao Meng, Fei Yuan, Yue Wu, Mathematical Problems in Engineering: Theory, Methods and Applications . 2021,第a期

机译：基于转换多级特征融合的面部表情识别算法及改进加权投票SVM
3. Facial expression recognition using feature level fusion [J] . Vanita Jain, Puneet Singh Lamba, Bhanu Singh, Journal of Discrete Mathematical Sciences and Cryptography . 2019,第2期

机译：使用特征级融合的面部表情识别
4. Feature Level Fusion for Bimodal Facial Action Unit Recognition [C] . Zibo Meng, Shizhong Han, Min Chen, IEEE International Symposium on Multimedia . 2015

机译：特征级融合用于双峰面部动作单元识别
5. Improving Speech-Related Facial Action Unit Recognition by Audiovisual Information Fusion [D] . Meng, Zibo. 2018

机译：视听信息融合改善与语音相关的面部动作单元识别
6. Facial Expression Recognition with Fusion Features Extracted from Salient Facial Areas [O] . Yanpeng Liu, Yibin Li, Xin Ma, 2017

机译：从显着面部区域提取融合特征的面部表情识别
7. Improving Speech Related Facial Action Unit Recognition by Audiovisual Information Fusion [O] . Meng, Zibo, Han, Shizhong, Liu, Ping, 2017

机译：通过视听提高言语相关的面部行动单元识别能力信息融合

Audiovisual Facial Action Unit Recognition using Feature Level Fusion

摘要

著录项

相似文献

相关主题

期刊订阅