首页> 外文会议>Annual IEEE/ACM International Symposium on Microarchitecture >LerGAN: A Zero-Free, Low Data Movement and PIM-Based GAN Architecture
【24h】

LerGAN: A Zero-Free, Low Data Movement and PIM-Based GAN Architecture

机译:LerGAN:无零,低数据移动和基于PIM的GAN架构

获取原文

摘要

As a powerful unsupervised learning method, Generative Adversarial Network (GAN) plays an important role in many domains such as video prediction and autonomous driving. It is one of the ten breakthrough technologies in 2018 reported in MIT Technology Review. However, training a GAN imposes three more challenges: (1) intensive communication caused by complex train phases of GAN, (2) much more ineffectual computations caused by special convolutions, and (3) more frequent off-chip memory accesses for exchanging inter-mediate data between the generator and the discriminator. In this paper, we propose LerGAN, a PIM-based GAN accelerator to address the challenges of training GAN. We first propose a zero-free data reshaping scheme for ReRAM-based PIM, which removes the zero-related computations. We then propose a 3D-connected PIM, which can reconfigure connections inside PIM dynamically according to dataflows of propagation and updating. Our proposed techniques reduce data movement to a great extent, avoiding I/O to become a bottleneck of training GANs. Finally, we propose LerGAN based on these two techniques, providing different levels of accelerating GAN for programmers. Experiments shows that LerGAN achieves 47.2×, 21.42× and 7.46× speedup over FPGA-based GAN accelerator, GPU platform, and ReRAM-based neural network accelerator respectively. Moreover, LerGAN achieves 9.75×, 7.68× energy saving on average over GPU platform, ReRAM-based neural network accelerator respectively, and has 1.04× energy consuming over FPGA-based GAN accelerator.
机译:作为一种强大的无监督学习方法,Generative Adversarial Network(GAN)在视频预测和自动驾驶等许多领域中发挥着重要作用。它是《麻省理工学院技术评论》(MIT Technology Review)报告的2018年十大突破性技术之一。但是,训练GAN会带来另外三个挑战:(1)由GAN的复杂训练阶段引起的密集通信;(2)由特殊卷积引起的无效计算;(3)交换芯片间内存的更频繁的片外内存访问在生成器和鉴别器之间中介数据。在本文中,我们提出了基于PIM的GAN加速器LerGAN,以解决训练GAN的挑战。我们首先为基于ReRAM的PIM提出零零数据重塑方案,该方案消除了零相关的计算。然后,我们提出了3D连接的PIM,它可以根据传播和更新的数据流动态地重新配置PIM内部的连接。我们提出的技术在很大程度上减少了数据移动,避免了I / O成为训练GAN的瓶颈。最后,我们基于这两种技术提出了LerGAN,为程序员提供了不同级别的加速GAN。实验表明,LerGAN的速度分别比基于FPGA的GAN加速器,GPU平台和基于ReRAM的神经网络加速器高47.2倍,21.42倍和7.46倍。此外,LerGAN分别在GPU平台和基于ReRAM的神经网络加速器上分别实现了9.75倍,7.68倍的节能,并且在基于FPGA的GAN加速器上的能耗为1.04倍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号