Flexion: A Quantitative Metric for Flexibility in DNN Accelerators

Kwon Hyoukjun; Pellauer Michael; Parashar Angshuman; Krishna Tushar

首页> 外文期刊>IEEE computer architecture letters >Flexion: A Quantitative Metric for Flexibility in DNN Accelerators

【24h】

Flexion: A Quantitative Metric for Flexibility in DNN Accelerators

机译：屈曲：DNN加速器灵活性的定量度量

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Dataflow and tile size choices, which we collectively refer to as mappings, dictate the efficiency (i.e., latency and energy) of DNN accelerators. Rapidly evolving DNN models is one of the major challenges for DNN accelerators since the optimal mapping heavily depends on the layer shape and size. To maintain high efficiency across multiple DNN models, flexible accelerators that can support multiple mappings have emerged. However, we currently lack a metric to evaluate accelerator flexibility and quantitatively compare their capability to run different mappings. In this letter, we formally define the concept of flexibility in DNN accelerators and propose flexion (flexibility fraction), flexion, which is a quantitative metric of mapping flexibility on DNN accelerators. We codify the formalism we construct and evaluate the flexibility of accelerators based on Eyeriss, NVDLA, and TPUv1. We show that Eyeriss-like accelerator is 2.2x and 17.0x more flexible (i.e., capable of running more mappings) than NVDLA and TPUv1-based accelerators on selected ResNet-50 and MobileNetV2 layers. This work is the first work to enable such a quantitative comparison of the flexibility of accelerators.

机译：DataFlow和瓷砖大小选择，我们统称为映射，决定了DNN加速器的效率（即延迟和能量）。快速发展的DNN模型是DNN加速器的主要挑战之一，因为最佳映射大量取决于层形状和尺寸。为了在多个DNN型号中保持高效率，可以出现柔性加速器，可以支持多个映射。但是，我们目前缺乏评估加速器的灵活性，并定量比较它们运行不同映射的能力。在这封信中，我们正式定义了DNN加速器的灵活性概念，并提出了屈曲（灵活性分数），屈曲，这是DNN加速器上映射灵活性的定量度量。我们编纂了我们构建的形式主义，并根据Eyeriss，NVDLA和TPUV1评估加速器的灵活性。我们表明Eyeriss的加速器是2.2x和17.0x更灵活（即，能够运行更多映射），而不是所选Reset-50和MobileNetv2层上的基于NVDLA和基于TPUV1的加速器。这项工作是第一个能够实现加速器灵活性的定量比较。

著录项

来源
《IEEE computer architecture letters》 |2021年第1期|1-4|共4页
作者
Kwon Hyoukjun; Pellauer Michael; Parashar Angshuman; Krishna Tushar;
展开▼
作者单位

Georgia Inst Technol Atlanta GA 30332 USA;

NVIDIA Westford MA 01886 USA;

NVIDIA Westford MA 01886 USA;

Georgia Inst Technol Atlanta GA 30332 USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Hardware; Shape; Measurement; Two dimensional displays; Parallel processing; Adders; Tensors; DNN accelerator; dataflow;

机译：硬件;形状;测量;二维显示;并行处理;加法器;张量;DNN加速器;数据流;

相似文献

外文文献
中文文献
专利

1. MAERI: Enabling Flexible Dataflow Mapping over DNN Accelerators via Reconfigurable Interconnects [J] . Hyoukjun Kwon, Ananda Samajdar, Tushar Krishna ACM SIGPLAN Notices: A Monthly Publication of the Special Interest Group on Programming Languages . 2018,第2期

机译：Maeri：通过可重新配置的互连实现DNN加速器的灵活数据流映射
2. Synthesis of Flexible Accelerators for Early Adoption of Ring-LWE Post-quantum Cryptography [J] . HAMID NEJATOLLAHI, FELIPE VALENCIA, SUBHADEEP BANIK, ACM Transactions on Embedded Computing Systems . 2020,第2期

机译：柔性加速器的早期采用铃声Quantum加密综合
3. Heterogeneous Computing Utilizing FPGAs: A New and Flexible Approach Integrating Dedicated Hardware Accelerators into Common Computing Platforms [J] . Reichenbach Marc, Holzinger Philipp, Haeublein Konrad, Journal of signal processing systems for signal, image, and video technology . 2019,第7期

机译：利用FPGA的异构计算：一种将专用硬件加速器集成到通用计算平台中的灵活新方法
4. SIGMA: A Sparse and Irregular GEMM Accelerator with Flexible Interconnects for DNN Training [C] . Eric Qin, Ananda Samajdar, Hyoukjun Kwon, IEEE International Symposium on High Performance Computer Architecture . 2020

机译：SIGMA：具有灵活互连功能的稀疏且不规则的GEMM加速器，用于DNN培训
5. An LLVM-IR Datagraph-based Simulator for Flexible Design Space Exploration over Accelerator Architectures [D] . Wang, Zhengrong. 2018

机译：基于LLVM-IR组数据的模拟器，用于加速器架构的灵活设计空间探索
6. Improving Knowledge Awareness and Use of Flexible Career Policies through an Accelerator Intervention at the University of California Davis School of Medicine [O] . Dr. Amparo C. Villablanca, Dr. Laurel Beckett, Dr. Jasmine Nettiksimmons, -1

机译：加州大学戴维斯分校医学院通过加速器干预提高知识意识和使用灵活的职业政策
7. Balancing Efficiency and Flexibility for DNN Acceleration via Temporal GPU-Systolic Array Integration [O] . Cong Guo, Yangjie Zhou, Jingwen Leng, 2020

机译：通过时间GPU-Systolic阵列集成平衡DNN加速度的效率和灵活性

Flexion: A Quantitative Metric for Flexibility in DNN Accelerators

摘要

著录项

相似文献

相关主题

期刊订阅