首页> 外文会议>Design, Automation and Test in Europe Conference and Exhibition >WavePro: Clock-less Wave-Propagated Pipeline Compiler for Low-Power and High-Throughput Computation
【24h】

WavePro: Clock-less Wave-Propagated Pipeline Compiler for Low-Power and High-Throughput Computation

机译:WavePro:用于低功耗和高通量计算的无时钟波传播管道编译器

获取原文

摘要

Clock-less Wave-Propagated Pipelining is a long-known approach to achieve high-throughput without the over-head of costly sampling registers. However, due to many design challenges, which have only increased with technology scaling, this approach has never been widely accepted and has generally been limited to small and very specific demonstrations. This paper addresses this barrier by presenting WavePro, a generic and scalable algorithm, capable of skew balancing any combinatorial logic netlist for the application of wave-pipelining. The algorithm was implemented in the WavePro Compiler automation utility, which interfaces with industry delays extraction and standard timing analysis tools to produce a sign-off quality result. The utility is demonstrated upon a dot-product accelerator in a 65 nm CMOS technology, using a vendor-provided standard cell library and commercial timing analysis tools. By reducing the worst-case output skew by over 70%, the test case example was able to achieve equivalent throughput of an 8-staged sequentially pipelined implementation with power savings of almost 3×.
机译:较小的波浪传播的流水线是一种长期以来的方法,可以在没有昂贵的采样寄存器的过头的情况下实现高吞吐量。然而,由于许多设计挑战,这才能与技术缩放增加,因此从未被广泛接受并且通常仅限于小而非常具体的演示。本文通过呈现WavePro,通用和可扩展算法,能够偏斜任何组合逻辑网列表来解决该障碍,以偏斜任何组合逻辑网,以应用波管线。该算法在WavePro编译器自动化实用程序中实现,该算法与行业延迟提取和标准时序分析工具的接口,以产生签字质量结果。使用供应商提供的标准单元库和商业时序分析工具,在65nm CMOS技术中的DOT-Product Accelerator上证明了该实用程序。通过减少70%以上的最坏情况输出偏斜,测试壳体示例能够实现8分阶段顺序流水线实现的等效吞吐量,其功率节省差不多3倍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号