首页> 外文期刊>Computer-Aided Design of Integrated Circuits and Systems, IEEE Transactions on >A Reconfigurable Source-Synchronous On-Chip Network for GALS Many-Core Platforms
【24h】

A Reconfigurable Source-Synchronous On-Chip Network for GALS Many-Core Platforms

机译:适用于GALS多核平台的可重构源同步片上网络

获取原文
获取原文并翻译 | 示例

摘要

This paper presents a globally-asynchronous locally-synchronous (GALS)-compatible circuit-switched on-chip network that is well suited for use in many-core platforms targeting streaming digital signal processing and embedded applications which typically have a high degree of task-level parallelism among computational kernels. Inter-processor communication is achieved through a simple yet effective reconfigurable source-synchronous network. Interconnect paths between processors can sustain a peak throughput of one word per cycle. A theoretical model is developed for analyzing the performance of the network. A 65 nm complementary metal-oxide–semiconductor GALS chip utilizing this network was fabricated which contains 164 programmable processors, three accelerators and three shared memory modules. For evaluating the efficiency of this platform, a complete 802.11a wireless local area network baseband receiver was implemented. It has a real-time throughput of 54 Mb/s with all processors running at 594 MHz and 0.95-V, and consumes an average of 174.8 mW with 12.2 mW (or 7.0%) dissipated by its interconnect links and switches. With the chip's dual supply voltages set at 0.95-V and 0.75-V, and individual processors' oscillators operating at workload-based optimal frequencies, the receiver consumes 123.2 mW, which is a 29.5% reduction in power. Measured power consumption values from the chip are within 2–5% of the estimated values.
机译:本文介绍了一种与全局异步本地同步(GALS)兼容的电路交换片上网络,该网络非常适合用于针对流数字信号处理和嵌入式应用(通常具有高度任务-计算内核之间的级别并行性。处理器间的通信是通过简单但有效的可重新配置源同步网络实现的。处理器之间的互连路径可以维持每个周期一个字的峰值吞吐量。建立了用于分析网络性能的理论模型。利用该网络制造了65 nm互补金属氧化物半导体GALS芯片,其中包含164个可编程处理器,三个加速器和三个共享存储模块。为了评估该平台的效率,实施了完整的802.11a无线局域网基带接收器。它的实时吞吐量为54 Mb / s,所有处理器均以594 MHz和0.95-V的频率运行,平均消耗174.8 mW,其互连链路和交换机耗散了12.2 mW(或7.0%)。芯片的双电源电压分别设置为0.95V和0.75V,并且各个处理器的振荡器以基于工作负载的最佳频率工作,接收器功耗为123.2 mW,功耗降低了29.5%。芯片测得的功耗值在估计值的2%到5%之内。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号