首页> 外文期刊>Signal Processing. Image Communication: A Publication of the the European Association for Signal Processing >Context prior-based with residual learning for face detection: A deep convolutional encoder-decoder network
【24h】

Context prior-based with residual learning for face detection: A deep convolutional encoder-decoder network

机译:基于上下文基于群体检测的残差学习:深度卷积编码器解码器网络

获取原文
获取原文并翻译 | 示例
       

摘要

In the field of security, faces are usually blurry, occluded, diverse pose and small in the image captured by an outdoor surveillance camera, which is affected by the external environment such as the camera pose and range, weather conditions, etc. It can be described as a problem of hard face detection in natural images. To solve this problem, we propose a deep convolutional neural network named feature hierarchy encoder-decoder network (FHEDN). It is motivated by two observations from contextual semantic information and the mechanism of multi-scale face detection. The proposed network is a scale-variant style architecture and single stage, which are composed of encoder and decoder subnetworks. Based on the assumption that contextual semantic information around face being auxiliary to detect faces, we introduce a residual mechanism to fuse context prior-based information into face feature and formulate the learning chain to train each encoder-decoder pair. In addition, we discuss some important factors in implement details such as the distribution of training dataset, the scale of feature hierarchy, and anchor box size, etc. They have some impact on the detection performance of the final network. Compared with some state-of-the-art algorithms, our method achieves promising performance on the popular benchmarks including AFW, PASCAL FACE, FDDB, and WIDER FACE. Consequently, the proposed approach can be efficiently implemented and routinely applied to detect faces with severe occlusion and arbitrary pose variations in unconstrained scenes. Our code and results are available on https://github.com/zzxcoder/EvaluationFHEDN.
机译:在安全领域,面孔通常是模糊的,遮挡,不同的姿势和由室外监视摄像机捕获的图像中的小,这些摄像机受到外部环境的影响,如相机姿势和范围,天气条件等。它可以是描述为自然图像中的硬面检测问题。为了解决这个问题,我们提出了一个名为特征层次编码器 - 解码器网络(FHEDN)的深度卷积神经网络。它具有来自上下文语义信息的两个观察和多尺度面部检测机制。所提出的网络是一种级别变体的样式架构和单级,由编码器和解码器子网组成。基于对辅助辅助辅助检测面的面部的上下文语义信息的假设,我们将基于背景信息的熔断机制引入面部特征并制定学习链以训练每个编码器 - 解码器对。此外,我们讨论了实施细节中的一些重要因素,例如培训数据集的分发,特征层次的规模和锚箱大小等。它们对最终网络的检测性能产生了一些影响。与某些最先进的算法相比,我们的方法在包括AFW,Pascal Face,FDDB和更广泛的脸上的流行基准上实现了有希望的性能。因此,可以有效地实现所提出的方法,并常规地应用于检测具有严重遮挡和无约束场景的任意姿态变化的面。我们的代码和结果可在https://github.com/zzxcoder/evaliondfhedn上获得。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号