首页> 外文会议>International Symposium on Advanced Parallel Processing Technologies >Compiling Optimization for Neural Network Accelerators
【24h】

Compiling Optimization for Neural Network Accelerators

机译:神经网络加速器的编译优化

获取原文
获取外文期刊封面目录资料

摘要

Nowadays artificial neural networks are one of the most common computational models among all the intelligent methods. To cope with the evergrowing scales of neural networks and the restrictions of system energy consumption, there comes out a bunch of neural network (NN) accelerators. However, owing to their dedicated architecture, programming on NN accelerators is different from general processors. In order to improve performance, it is necessary to use global structure information of NN model to optimize the compilation. In this paper, we introduce a series of layer-based compile optimizations for NN accelerators. From top to bottom, we define a type of computational graph, carrying necessary information such as relationship between layer nodes and data nodes. Then according to the pattern of a NN layer computation process, we apply an intra layer loop unrolling and pipelining, including fine-grained and coarse-grained two levels. Similarly, we apply layer fusion optimization based on our computational graph and abstract pipelining stage. After expanding pipelining stages of layers, we can reduce some redundant IO operations, which we call it layer elimination optimization. The experiment results show that with our proposed optimizations the inference process can achieve up to 1.34x speedup than not using fusion optimization.
机译:如今,人工神经网络已成为所有智能方法中最常见的计算模型之一。为了应对不断增长的神经网络规模和系统能耗的限制,出现了许多神经网络(NN)加速器。但是,由于其专用的体系结构,因此在NN加速器上进行编程不同于常规处理器。为了提高性能,有必要使用NN模型的全局结构信息来优化编译。在本文中,我们介绍了一系列针对NN加速器的基于层的编译优化。从上到下,我们定义了一种计算图,其中包含必要的信息,例如层节点与数据节点之间的关系。然后根据NN层计算过程的模式,应用层内循环展开和流水线处理,包括细粒度和粗粒度两个级别。同样,我们根据计算图和抽象流水线阶段应用图层融合优化。在扩展了层的流水线阶段之后,我们可以减少一些多余的IO操作,我们将其称为层消除优化。实验结果表明,采用我们提出的优化方法,与不使用融合优化方法相比,推理过程可以实现高达1.34倍的加速。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号