首页> 外文OA文献 >Face Detection Using Single Cascade of Customized Features Discriminators
【2h】

Face Detection Using Single Cascade of Customized Features Discriminators

机译:使用级联的自定义特征鉴别器进行人脸检测

摘要

Face detection has become an important and helpful tool for camera and video processing. Useful human-computer interaction (HCI) applications such as drivers assistant system that prevents accidents and saves pedestrian lives when drivers attention is absent, needs a head pose estimator. A head pose estimator cannot function without face detector.There has been a considerable amount of literature to address the problem. The most significant results obtained on uptight frontal face detection which is a sub-problem of a larger problem of face detection. There are other types of sub-problems that has been studied with least significant advancements that the upright frontal face detection had accomplished. The problem of multi-pose detection is still under study and it remains hard.A solution to this large scale of the problem (multi-pose face detection) is critical in head pose accuracy. This thesis suggests a multi-pose face detection algorithm for uncontrolled environments. The detector is designed to be used in building head pose estimator for a human-computer interaction application. The observed design of the detector has to implement a cascade of classifiers. Each classifier has to address at least one certain area of the problem. The design have to maintain speed and an acceptable detection rate.These requirements can be satisfied by constructing the cascade to implement fast and simple classifiers at first stages of the cascade. A novel use of the integral image as a fast filter was invented to be placed at the start of the detection process. Included in the cascade, classifiers that are trained on special designed features aimed to solve part of the problem. One special unique classifier is a data mining based classifier that uses a modified version of the Maximal Frequent Itemset Algorithm (MAFIA) [2] for feature extraction.Special features classifiers use the extracted facial features information extracted from a new knowledge-based classifier/filter that was created with the capacity to locate to an acceptable ac- curacy the location of eyes, mouth and nose using a suite of approaches including discreet local minima and geometric measures. The extracted facial features were used to estimate head pose and extract classifier features accordingly to enhance detection rates.A cascade of classifiers based on fast and simple contrast features was used to refine and speed up the detection process. To further improve speed some components were parallelized. As an attempt to overcome some of the fundamental challenges of face detection, lighting correction and noise reduction were implemented based on the information extracted from images.Results are reported on the FDDB [12] benchmark showed 5.22% detection rate with 2000 false positives while OpenCV implementation of Viola-Jones [19] face detector showed 65.92 detection rate with 2010 false positives. This comparison is flawed; because Viola-Jones is an upright face detector and even though FDDB [12] includes a number on non-frontal faces and profiles the majority of the faces are frontal. The two solutions address two different problems that reflect large differences in difficulty.A standard benchmark testset and evaluation system as FDDB [12] benchmark and com- parable results from the same class of the problem at the time of writing this document was not available. The key points to building good face detector in general are; (1) resolving speed issues using fast techniques (e.g. integral image) at the start of the cascade and a powerful design, (2) using a huge number of different strong and weak features, and (3) eliminating variations (i.e. pose , noise and lighting variations). The algorithm was also tested on MIT+CMU upfront faces testset and reported 43.56% detection rate with 504 false positives.
机译:人脸检测已成为相机和视频处理的重要且有用的工具。有用的人机交互(HCI)应用程序(例如,驾驶员辅助系统)可以防止事故发生,并在驾驶员缺乏注意力时挽救行人的生命,因此需要头部姿势估计器。没有面部检测器,头部姿势估计器将无法运行。已有大量文献解决了这个问题。在正面挺直的面部检测中获得的最显着结果是较大的面部检测问题的一个子问题。已经研究了其他类型的子问题,这些问题的最显着进展是直立的额头面部检测已经完成。多姿势检测的问题仍在研究中,并且仍然很困难。解决这一大规模问题(多姿势面部检测)对于头部姿势精度至关重要。本文提出了一种用于不受控制的环境的多姿势人脸检测算法。该检测器设计用于人机交互应用中的建筑物头部姿态估计器。检测器的观察设计必须实现级联的分类器。每个分类器必须解决问题的至少一个特定区域。设计必须保持速度和可接受的检测率。通过构建级联以在级联的第一阶段实现快速简单的分类器,可以满足这些要求。发明了将积分图像用作快速滤波器的新颖方法,以将其放置在检测过程的开始。级联中包括分类器,这些分类器经过特殊设计,旨在解决部分问题。一个特殊的独特分类器是基于数据挖掘的分类器,它使用最大频繁项集算法(MAFIA)[2]的修改版进行特征提取。特殊特征分类器使用从新的基于知识的分类器/过滤器中提取的提取的面部特征信息通过使用包括谨慎的局部最小值和几何度量在内的一系列方法,可以将眼睛,嘴巴和鼻子的位置定位到可接受的精度。提取的面部特征被用于估计头部姿势,并相应地提取分类器特征以提高检测率。基于快速和简单的对比度特征的分类器级联被用于完善和加速检测过程。为了进一步提高速度,一些组件被并行化。为了克服人脸检测的一些基本挑战,基于从图像中提取的信息实施了灯光校正和降噪。FDDB[12]基准报告的结果显示,在使用OpenCV时,检测率为5.22%,错误率为2000。 Viola-Jones [19]面部检测器的实现显示65.92的检测率与2010年的假阳性。这种比较是有缺陷的。因为Viola-Jones是一个直立的面部检测器,并且即使FDDB [12]在非正面的面部和轮廓上都包含一个数字,大多数面部还是正面的。这两个解决方案解决了反映难度差异很大的两个不同问题。在编写本文档时,尚无标准的基准测试集和评估系统(如FDDB [12]基准)和相同问题类别的可比较结果。通常,构建好的人脸检测器的关键在于: (1)在级联开始时使用快速技术(例如,积分图像)和强大的设计来解决速度问题,(2)使用大量不同的强项和弱项,以及(3)消除变化(例如,姿势,噪声和灯光变化)。该算法还在MIT + CMU前脸测试集上进行了测试,并报告了43.56%的检测率和504个假阳性。

著录项

  • 作者

    Hammuda Ayman Omar;

  • 作者单位
  • 年度 2012
  • 总页数
  • 原文格式 PDF
  • 正文语种
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号