首页> 美国卫生研究院文献>Sensors (Basel Switzerland) >Facial Expression Recognition Based on Weighted-Cluster Loss and Deep Transfer Learning Using a Highly Imbalanced Dataset
【2h】

Facial Expression Recognition Based on Weighted-Cluster Loss and Deep Transfer Learning Using a Highly Imbalanced Dataset

机译:基于高度不平衡数据集的加权聚类损失和深度转移学习的面部表情识别

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Facial expression recognition (FER) is a challenging problem in the fields of pattern recognition and computer vision. The recent success of convolutional neural networks (CNNs) in object detection and object segmentation tasks has shown promise in building an automatic deep CNN-based FER model. However, in real-world scenarios, performance degrades dramatically owing to the great diversity of factors unrelated to facial expressions, and due to a lack of training data and an intrinsic imbalance in the existing facial emotion datasets. To tackle these problems, this paper not only applies deep transfer learning techniques, but also proposes a novel loss function called weighted-cluster loss, which is used during the fine-tuning phase. Specifically, the weighted-cluster loss function simultaneously improves the intra-class compactness and the inter-class separability by learning a class center for each emotion class. It also takes the imbalance in a facial expression dataset into account by giving each emotion class a weight based on its proportion of the total number of images. In addition, a recent, successful deep CNN architecture, pre-trained in the task of face identification with the VGGFace2 database from the Visual Geometry Group at Oxford University, is employed and fine-tuned using the proposed loss function to recognize eight basic facial emotions from the AffectNet database of facial expression, valence, and arousal computing in the wild. Experiments on an AffectNet real-world facial dataset demonstrate that our method outperforms the baseline CNN models that use either weighted-softmax loss or center loss.
机译:面部表情识别(FER)在模式识别和计算机视觉领域是一个具有挑战性的问题。卷积神经网络(CNN)在对象检测和对象分割任务中的最新成功展示了在构建基于深度CNN的自动FER模型中的希望。然而,在现实世界中,由于与面部表情无关的因素的多样性,以及缺乏训练数据和现有面部表情数据集内在的不平衡性,性能急剧下降。为了解决这些问题,本文不仅应用了深度转移学习技术,而且提出了一种称为加权簇损失的新型损失函数,该函数在微调阶段使用。具体而言,加权聚类损失函数通过学习每个情感类别的类别中心来同时提高类别内的紧凑性和类别间的可分离性。它还通过基于每个情感类别在图像总数中所占的比例为每个情感类别赋予权重,来考虑面部表情数据集中的不平衡。此外,采用了牛津大学视觉几何小组的VGGFace2数据库对人脸识别任务进行了预训练的,最近成功的深度CNN架构,并使用建议的损失函数对其进行了微调,以识别八种基本面部表情来自AffectNet数据库,可以在野外进行面部表情,化合价和唤醒计算。在AffectNet真实世界面部数据集上进行的实验表明,我们的方法优于使用加权softmax损失或中心损失的基线CNN模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号