首页> 外文期刊>IEEE computer architecture letters >Flexion: A Quantitative Metric for Flexibility in DNN Accelerators
【24h】

Flexion: A Quantitative Metric for Flexibility in DNN Accelerators

机译:屈曲:DNN加速器灵活性的定量度量

获取原文
获取原文并翻译 | 示例
           

摘要

Dataflow and tile size choices, which we collectively refer to as mappings, dictate the efficiency (i.e., latency and energy) of DNN accelerators. Rapidly evolving DNN models is one of the major challenges for DNN accelerators since the optimal mapping heavily depends on the layer shape and size. To maintain high efficiency across multiple DNN models, flexible accelerators that can support multiple mappings have emerged. However, we currently lack a metric to evaluate accelerator flexibility and quantitatively compare their capability to run different mappings. In this letter, we formally define the concept of flexibility in DNN accelerators and propose flexion (flexibility fraction), flexion, which is a quantitative metric of mapping flexibility on DNN accelerators. We codify the formalism we construct and evaluate the flexibility of accelerators based on Eyeriss, NVDLA, and TPUv1. We show that Eyeriss-like accelerator is 2.2x and 17.0x more flexible (i.e., capable of running more mappings) than NVDLA and TPUv1-based accelerators on selected ResNet-50 and MobileNetV2 layers. This work is the first work to enable such a quantitative comparison of the flexibility of accelerators.
机译:DataFlow和瓷砖大小选择,我们统称为映射,决定了DNN加速器的效率(即延迟和能量)。快速发展的DNN模型是DNN加速器的主要挑战之一,因为最佳映射大量取决于层形状和尺寸。为了在多个DNN型号中保持高效率,可以出现柔性加速器,可以支持多个映射。但是,我们目前缺乏评估加速器的灵活性,并定量比较它们运行不同映射的能力。在这封信中,我们正式定义了DNN加速器的灵活性概念,并提出了屈曲(灵活性分数),屈曲,这是DNN加速器上映射灵活性的定量度量。我们编纂了我们构建的形式主义,并根据Eyeriss,NVDLA和TPUV1评估加速器的灵活性。我们表明Eyeriss的加速器是2.2x和17.0x更灵活(即,能够运行更多映射),而不是所选Reset-50和MobileNetv2层上的基于NVDLA和基于TPUV1的加速器。这项工作是第一个能够实现加速器灵活性的定量比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号