31.3 A Compute-Adaptive Elastic Clock-Chain Technique with Dynamic Timing Enhancement for 2D PE-Array-Based Accelerators

机译：31.3一种基于2D PE阵列的加速器的具有动态时序增强功能的计算自适应弹性时钟链技术

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Dynamic timing error detection and correction techniques, e.g. razor flops, have been previously applied to microprocessors to exploit the dynamic timing margin within pipelines [1]. Adaptive clock techniques have also been adopted to enhance microprocessor performance, such as schemes to reduce the timing guardband for on-chip supply droops [2]–[3] or to exploit instruction-level dynamic timing slack [4]. Recently, 2D PE array-based accelerators have been developed for machine learning (ML) applications. Many efforts have been dedicated to improve the energy efficiency of such accelerators, e.g. DVFS management for the DNN under various bit precision [5]. A razor technique was also applied to a 1D 8-MAC pipelined accelerator to explore timing error tolerance [6]. Despite the above efforts, a fine-grained dynamic-timing-based technique has not been implemented within a large 2D array based ML accelerator. One main challenge comes from the large amount of compute-timing bottlenecks within the 2D array, which will continuously trigger critical path adaptation or pipeline stalls, nullifying the benefits of previous dynamic-timing techniques [4], [6]. To deal with the difficulty, we propose the following solutions. A local in-situ compute-detection scheme was applied to anticipate upcoming timing variations within the PE unit and guide both instruction-based and operand-based adaptive clock management. To loosen the stringent timing requirements in a large 2D PE array, an “elastic” clock-chain technique using multiple loosely synchronized clock domains was developed enabling dynamic-timing enhancement through clusters of PE units.

机译：动态定时错误检测和纠正技术，例如剃须刀触发器曾被应用于微处理器，以利用流水线内的动态时序余量[1]。自适应时钟技术也已被采用来增强微处理器性能，例如减少片上电源下降[2]-[3]的时序保护带或利用指令级动态时序松弛[4]的方案。最近，已经开发了基于2D PE阵列的加速器，用于机器学习（ML）应用程序。为了提高这种加速器的能量效率，已经付出了许多努力，例如，在美国，在各种比特精度下，DNN的DVFS管理[5]。剃刀技术还应用于一维8-MAC流水线加速器，以探索时序误差容限[6]。尽管做出了上述努力，但尚未在基于大型2D数组的ML加速器中实现基于细粒度动态定时的技术。一个主要挑战来自2D阵列中大量的计算时序瓶颈，这些瓶颈将持续触发关键路径自适应或流水线停顿，从而使先前的动态时序技术的优势无效[4]，[6]。为了解决这个困难，我们提出以下解决方案。应用本地原位计算检测方案来预测PE单元内即将出现的时序变化，并指导基于指令和基于操作数的自适应时钟管理。为了放宽大型2D PE阵列中严格的时序要求，开发了使用多个松散同步时钟域的“弹性”时钟链技术，从而可以通过PE单元集群来增强动态时序。

著录项

来源
《IEEE International Solid- State Circuits Conference》|2020年|482-484|共3页
会议地点
作者
Tianyu Jia; Yuhao Ju; Jie Gu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Clocks; Two dimensional displays; Synchronization; Pipelines; Voltage measurement; Adaptive arrays;

机译：时钟;二维显示;同步;管道;电压测量;自适应阵列;
入库时间 2022-08-26 14:35:49

相似文献

外文文献
中文文献
专利

1. A Dynamic Timing Enhanced DNN Accelerator With Compute-Adaptive Elastic Clock Chain Technique [J] . Jia Tianyu, Ju Yuhao, Gu Jie IEEE Journal of Solid-State Circuits . 2021,第1期

机译：具有计算自适应弹性时钟链技术的动态定时增强DNN加速器
2. Comparison of the Timing of Hepatic Arterial Phase and Image Quality Using Test-Bolus and Bolus-Tracking Techniques in Gadolinium-Ethoxybenzyl-Diethylenetriamine Pentaacetic Acid-Enhanced Hepatic Dynamic Magnetic Resonance Imaging [J] . Iyama Yuji, Nakaura Takeshi, Yokoyama Koichi, Journal of computer assisted tomography . 2017,第4期

机译：使用试验推注和推注 - 乙氧基 - 二亚乙基三胺五乙酸戊酸 - 增强肝动力磁共振成像的试验推注和图像质量定时比较肝动脉相和图像质量的定时
3. Wide dynamic range FPGA-based TDC for monitoring a trigger timing distribution system in linear accelerators [J] . T. Suwada, F. Miyahara, K. Furukawa, Nuclear Instruments & Methods in Physics Research. Section A, Accelerators, Spectrometers, Detectors and Associated Equipment . 2015,第juna21期

机译：基于FPGA的宽动态范围TDC，用于监视线性加速器中的触发定时分配系统
4. Asymptotic 2D-modelling for dynamics of linear elastic thick shells [C] . R. Nzengwa Shell Structures: Theory and Applications Conference . 2006

机译：线性弹性厚壳动力学的渐近2D建模
5. Ultra-dynamic Fine-grained Power and Clock Management Techniques for Microprocessors and Machine Learning Accelerators [D] . Jia, Tianyu . 2019

机译：用于微处理器和机器学习加速器的超动态细粒型电力和时钟管理技术
6. Comparison of the Timing of Hepatic Arterial Phase and Image Quality Using Test-Bolus and Bolus-Tracking Techniques in Gadolinium–Ethoxybenzyl–Diethylenetriamine Pentaacetic Acid–Enhanced Hepatic Dynamic Magnetic Resonance Imaging [O] . Yuji Iyama, Takeshi Nakaura, Koichi Yokoyama, -1

机译：使用Test-乙氧基苄基-二亚乙基三胺五乙酸-增强肝动态磁共振成像技术中的测试团和团追踪技术比较肝动脉期的时间和图像质量
7. Comparison of 3D Volumetric Subtraction Technique and 2D Dynamic Contrast Enhancement Technique in the Evaluation of Contrast Enhancement for Diagnosing Cushing's Disease [O] . Yae Won Park, Ha Yan Kim, Ho-Joon Lee, 2018

机译：3D体积减法技术与2D动态对比增强技术在诊断缓冲疾病中对比增强中的评价中的比较

31.3 A Compute-Adaptive Elastic Clock-Chain Technique with Dynamic Timing Enhancement for 2D PE-Array-Based Accelerators

摘要

著录项

相似文献

相关主题

期刊订阅