首页> 外文会议>IEEE International Workshop on Signal Processing Systems >Architecture Design of Convolutional Neural Networks for Face Detection on an FPGA Platform
【24h】

Architecture Design of Convolutional Neural Networks for Face Detection on an FPGA Platform

机译:FPGA平台上用于面部检测的卷积神经网络架构设计

获取原文
获取外文期刊封面目录资料

摘要

Convolutional neural networks (CNNs) have emerged to provide powerful discriminative capability, especially in the field of image recognition and object detection. However, their massive computation requirements, storage and memory accesses make them hard to be deployed on mobile or embedded systems. In this paper, a few optimizations for a CNN cascade face detection algorithm are proposed to increase throughput while minimizing computation, storage and bandwidth requirement under power constraints. First, the first net of the CNN cascade is converted to a fully convolutional network to reduce 83% of the computation. Second, network retraining is applied to quantize the model parameters from 32-bit floating-point to 2-bit fixed-point, resulting in 93.75% less parameter memory size. Finally, a CNN accelerator is implemented on a Xilinx ZYNQ FPGA board.
机译:卷积神经网络(CNN)的出现提供了强大的判别能力,尤其是在图像识别和目标检测领域。但是,它们庞大的计算需求,存储和内存访问使其难以部署在移动或嵌入式系统上。本文针对CNN级联人脸检测算法提出了一些优化措施,以提高吞吐量,同时在功率限制下将计算,存储和带宽需求降至最低。首先,将CNN级联的第一个网络转换为完全卷积网络,以减少83%的计算。其次,应用网络再训练对模型参数从32位浮点到2位定点进行量化,从而减少了93.75%的参数存储大小。最后,在Xilinx ZYNQ FPGA板上实现了CNN加速器。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号