...
首页> 外文期刊>Journal of Low Power Electronics >Efficient Evaluation of Power/Area/Latency Design Trade-Offs for Coarse-Grained Reconfigurable Processor Arrays
【24h】

Efficient Evaluation of Power/Area/Latency Design Trade-Offs for Coarse-Grained Reconfigurable Processor Arrays

机译:粗粒度可重构处理器阵列的功率/面积/延迟设计折衷方案的高效评估

获取原文
获取原文并翻译 | 示例

摘要

The presented evaluation framework allows extremely fast but still accurate power, area, and latency characterization of different design alternatives in a multidimensional design space of highly parameterized coarse-grained reconfigurable processor arrays. For the first time, we propose to use a relational database system, managing table-based, probabilistic macro-models, constructed with the help of a new non-uniform parameter sampling technique for the average power estimation of corresponding processor arrays on the architectural level. This leads to power estimation speeds in the milliseconds range within 10% estimation error compared to a state-of-the-art commercial gate-level post-layout power estimator. Furthermore, our approach fully accounts for such important power reduction techniques, like clock gating and operand isolation, which are commonly ignored otherwise. The feasibility and accuracy were tested in several case study implementations in a commercial 90 nm standard cell library. Experimental results show a superior scalability of the proposed technique: heterogeneous 100-core coarse-grained processor array with ≈0.5×10~6 logic gates circuit complexity, implementing a signal processing algorithm, can be analyzed for power and area within less than a minute on a standard consumer PC. Since currently there exists no published architecture-level power/area estimation framework for coarse-grained, software-programmable architectures, our work tries to address this shortcoming.
机译:所提出的评估框架允许在高度参数化的粗粒度可重构处理器阵列的多维设计空间中,以极快速但仍然准确的方式描述不同设计方案的功耗,面积和等待时间。首次,我们建议使用关系数据库系统,管理基于表的概率宏模型,该模型是借助新的非均匀参数采样技术构建的,用于在体系结构级别上对相应处理器阵列的平均功率进行估计。与最先进的商用门级布局后功率估算器相比,这导致功率估算速度在毫秒范围内,误差在10%以内。此外,我们的方法充分考虑了重要的功耗降低技术,例如时钟门控和操作数隔离,否则通常会忽略这些技术。在商业90 nm标准单元库中的几个案例研究中测试了可行性和准确性。实验结果表明,该技术具有出色的可扩展性:具有≈0.5×10〜6逻辑门电路复杂度的异构100核粗粒度处理器阵列,实现了信号处理算法,可以在不到一分钟的时间内分析功率和面积在标准家用PC上。由于当前没有针对粗粒度的软件可编程体系结构的已发布体系结构级功耗/面积估计框架,因此我们的工作试图解决这一缺陷。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号