首页> 外文期刊>Industrial Informatics, IEEE Transactions on >Design and Implementation of a Pipelined Datapath for High-Speed Face Detection Using FPGA
【24h】

Design and Implementation of a Pipelined Datapath for High-Speed Face Detection Using FPGA

机译:使用FPGA的高速人脸检测流水线数据路径的设计与实现

获取原文
获取原文并翻译 | 示例
           

摘要

This paper presents design and implementation of a pipelined datapath for real-time face detection using cascades of boosted classifiers. We propose following methods: symmetric image downscaling, classifier sharing, and cascade merging, to achieve the desired processing speed and area efficiency. First, an image pyramid with 16 levels is generated from the input image to simultaneously detect faces with different scales. The downscaled images are then transferred to the first stage of the cascade that is shared between the corresponding image pairs based on the pixel validity of the symmetric image pyramid. The last method exploits the different hit ratios of the cascade stages. We use a tree-structured cascade of classifiers since most of the nonface elements are eliminated during the early stages of the classifier. The use of a synthesis tool confirms that the proposed design reduces resource utilization by one-eighth without accuracy loss, compared to the fully parallelized implementation of the same algorithm. We implemented the proposed hardware architecture on a Xilinx Virtex-5 LX330 FPGA. The indicative throughput is 307 frames/s irrespective of the number of faces in the scene for standard VGA (640 $times$ 480) images with an operating frequency of 125.59 MHz. We may ensure that face detection results are generated at each clock cycle after the initial pipeline delay, using this fully pipelined datapath for tree-structured cascade classifiers.
机译:本文介绍了使用增强分类器级联进行实时人脸检测的流水线数据路径的设计和实现。我们提出以下方法:对称图像缩小,分类器共享和级联合并,以实现所需的处理速度和面积效率。首先,从输入图像生成具有16个级别的图像金字塔,以同时检测具有不同比例的面部。然后,根据对称图像金字塔的像素有效性,将缩小后的图像传输到级联的第一级,该级在相应的图像对之间共享。最后一种方法利用了级联阶段的不同命中率。我们使用树结构的分类器级联,因为在分类器的早期就消除了大多数非面部元素。与相同算法的完全并行实现相比,使用综合工具证实了所提出的设计将资源利用率降低了八分之一,而没有精度损失。我们在Xilinx Virtex-5 LX330 FPGA上实现了建议的硬件架构。对于工作频率为125.59 MHz的标准VGA(640 x 480)图像,无论场景中的面孔数量如何,指示吞吐量为307帧/ s。我们可以使用针对树结构级联分类器的全流水线数据路径,确保在初始流水线延迟后的每个时钟周期生成人脸检测结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号