首页> 外文会议>Design, Automation Test in Europe Conference Exhibition >Structural design optimization for deep convolutional neural networks using stochastic computing
【24h】

Structural design optimization for deep convolutional neural networks using stochastic computing

机译:基于随机计算的深度卷积神经网络的结构设计优化

获取原文

摘要

Deep Convolutional Neural Networks (DCNNs) have been demonstrated as effective models for understanding image content. The computation behind DCNNs highly relies on the capability of hardware resources due to the deep structure. DCNNs have been implemented on different large-scale computing platforms. However, there is a trend that DCNNs have been embedded into light-weight local systems, which requires low power/energy consumptions and small hardware footprints. Stochastic Computing (SC) radically simplifies the hardware implementation of arithmetic units and has the potential to satisfy the small low-power needs of DCNNs. Local connectivities and down-sampling operations have made DCNNs more complex to be implemented using SC. In this paper, eight feature extraction designs for DCNNs using SC in two groups are explored and optimized in detail from the perspective of calculation precision, where we permute two SC implementations for inner-product calculation, two down-sampling schemes, and two structures of DCNN neurons. We evaluate the network in aspects of network accuracy and hardware performance for each DCNN using one feature extraction design out of eight. Through exploration and optimization, the accuracies of SC-based DCNNs are guaranteed compared with software implementations on CPU/GPU/binary-based ASIC synthesis, while area, power, and energy are significantly reduced by up to 776x, 190x, and 32835x.
机译:深度卷积神经网络(DCNN)已被证明是理解图像内容的有效模型。由于深度的结构,DCNN背后的计算高度依赖于硬件资源的能力。 DCNN已在不同的大型计算平台上实现。但是,趋势是DCNN已嵌入到轻量级本地系统中,这要求低功耗/能耗和较小的硬件占用空间。随机计算(SC)从根本上简化了算术单元的硬件实现,并有可能满足DCNN的低功耗需求。本地连接和下采样操作使DCNN变得更加复杂,无法使用SC来实现。本文从计算精度的角度详细探讨和优化了两组使用SC的DCNN的八种特征提取设计,其中我们对用于内积计算的两种SC实现,两种下采样方案以及两种结构进行了优化。 DCNN神经元。我们使用八个特征提取设计中的每个DCNN,在网络准确性和硬件性能方面评估网络。通过探索和优化,与基于CPU / GPU /基于二进制ASIC合成的软件实现相比,可确保基于SC的DCNN的准确性,而面积,功耗和能源则显着减少了776x,190x和32835x。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号