首页> 外文会议>Asia and South Pacific Design Automation Conference >Towards Design Methodology of Efficient Fast Algorithms for Accelerating Generative Adversarial Networks on FPGAs
【24h】

Towards Design Methodology of Efficient Fast Algorithms for Accelerating Generative Adversarial Networks on FPGAs

机译:致力于在FPGA上加速生成对抗网络的高效快速算法的设计方法论

获取原文

摘要

Generative adversarial networks (GANs) have shown excellent performance in image and speech applications. GANs create impressive data primarily through a new type of operator called deconvolution (DeConv) or transposed convolution (Conv). To implement the DeConv layer in hardware, the state-of-the-art accelerator reduces the high computational complexity via the DeConv-to-Conv conversion and achieves the same results. However, there is a problem that the number of filters increases due to this conversion. Recently, Winograd minimal filtering has been recognized as an effective solution to improve the arithmetic complexity and resource efficiency of the Conv layer. In this paper, we propose an efficient Winograd DeConv accelerator that combines these two orthogonal approaches on FPGAs. Firstly, we introduce a new class of fast algorithm for DeConv layers using Winograd minimal filtering. Since there are regular sparse patterns in Winograd filters, we further amortize the computational complexity by skipping zero weights. Secondly, we propose a new dataflow to prevent resource underutilization by reorganizing the filter layout in the Winograd domain. Finally, we propose an efficient architecture for implementing Winograd DeConv by designing the line buffer and exploring the design space. Experimental results on various GANs show that our accelerator achieves up to 1.78×~8.38× speedup over the state-of-the-art DeConv accelerators.
机译:生成对抗网络(GAN)在图像和语音应用中表现出出色的性能。 GAN主要通过一种称为反卷积(DeConv)或转置卷积(Conv)的新型运算符来创建令人印象深刻的数据。为了在硬件中实现DeConv层,最新的加速器通过DeConv到Conv的转换降低了高计算复杂度,并获得了相同的结果。但是,由于该转换,存在滤波器数量增加的问题。最近,Winograd最小过滤已被视为提高Conv层的算术复杂性和资源效率的有效解决方案。在本文中,我们提出了一种高效的Winograd DeConv加速器,该加速器在FPGA上结合了这两种正交方法。首先,我们介绍了一种使用Winograd最小滤波的DeConv图层的新型快速算法。由于Winograd过滤器中存在规则的稀疏模式,因此我们通过跳过零权重来进一步摊销计算复杂性。其次,我们提出了一个新的数据流,以通过在Winograd域中重新组织过滤器布局来防止资源利用不足。最后,我们通过设计行缓冲区并探索设计空间,提出了一种用于实现Winograd DeConv的有效架构。在各种GAN上的实验结果表明,与最先进的DeConv加速器相比,我们的加速器可实现高达1.78×〜8.38×的加速。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号