首页> 外文会议>International Symposium on Computer Architecture >Synchroscalar: A Multiple Clock Domain, Power-Aware, Tile-Based Embedded Processor
【24h】

Synchroscalar: A Multiple Clock Domain, Power-Aware, Tile-Based Embedded Processor

机译:Synchroscalar:一个多时钟域,电源感知,基于Tile的嵌入式处理器

获取原文

摘要

We present Synchroscalar, a tile-based architecture for embedded processing that is designed to provide the flexibility of DSPs while approaching the power efficiency of ASICs. We achieve this goal by providing high parallelism and voltage scaling while minimizing control and communication costs. Specifically, Synchroscalar uses columns of processor tiles organized into statically-assigned frequency-voltage domains to minimize power consumption. Furthermore, while columns use SIMD control to minimize overhead, data-dependent computations can be supported by extremely flexible statically-scheduled communication between columns. We provide a detailed evaluation of Synchroscalar including SPICE simulation, wire and device models, synthesis of key components, cycle-level simulation, and compiler- and hand-optimized signal processing applications. We find that the goal of meeting, not exceeding, performance targets with data-parallel applications leads to designs that depart significantly from our intuitions derived from general-purpose microprocessor design. In particular, synchronous design and substantial global interconnect are desirable in the low-frequency, low-power domain. This global interconnect supports parallelization and reduces processor idle time, which are critical to energy efficient implementations of high bandwidth signal processing. Overall, Synchroscalar provides programmability while achieving power efficiencies within 8-30X of known ASIC implementations, which is 10-60X better than conventional DSPs. In addition, frequency-voltage scaling in Synchroscalar provides between 3-32% power savings in our application suite.
机译:我们呈现Synchroscalar,一种基于嵌入式处理的嵌入式处理,旨在提供DSP的灵活性,同时接近ASIC的功率效率。我们通过提供高行平行和电压缩放来实现这一目标,同时最小化控制和通信成本。具体地,Synchroscalar使用处理器块的列组织成静态分配的频率电压域,以最大限度地减少功耗。此外,当列使用SIMD控件以最小化开销时,可以通过极其灵活的列之间的静态预定通信来支持数据相关计算。我们提供了Synchroscalar的详细评估,包括Spice仿真,电线和设备模型,合成关键组件,循环级模拟和编译器和手动优化信号处理应用。我们发现会议的目标,不超过,具有数据并行应用的性能目标导致设计从我们从通用微处理器设计中得出的直觉中显着偏离。特别地,在低频,低功率域中是期望同步设计和实质的全局互连。此全局互连支持并行化并减少处理器空闲时间,这对于高带宽信号处理的节能实现至关重要。总体而言,同步roscalar提供可编程性,同时在已知的ASIC实现中实现电力效率,比传统DSP更好地为10-60倍。此外,Synchroscalar中的频率电压缩放提供了我们的应用程序套件中的3-32%的节能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号