首页> 外文学位 >Architectural optimizations and synthesis tools for improved energy efficiency and faster design closure for FPGAs.
【24h】

Architectural optimizations and synthesis tools for improved energy efficiency and faster design closure for FPGAs.

机译:架构优化和综合工具可提高能效并更快地完成FPGA的设计封闭。

获取原文
获取原文并翻译 | 示例

摘要

FPGAs are evolving at a rapid pace with improved performance and logic density. However, power efficiency of FPGAs has continuously lagged behind, and hence power optimization of FPGAs is crucial. Trends in technology scaling makes leakage power a serious concern for designers; on the other hand, routing power is the dominant component of total power consumption in FPGAs. We propose a hierarchical look-up table (LUT) structure for FPGAs to improve leakage power consumption. We present an analysis on the number of inputs actually used by LUTs, and depending on the number of inputs used by the LUTs, we shut down SRAM cells, transistors, and multiplexers associated with the unused LUT inputs. Based on this technique, for 180nm technology, we report an average savings of 22.94% (as high as 64.22%) in leakage power for logic blocks. The savings will be even greater for technologies 90nm or below that are currently in use. We also propose a Dual-Vdd-dual-Vt interconnect architecture, where voltage scaling is applied within the programmable interconnect structure of the FPGA to reduce routing power consumption. Our experiments reveal that an average reduction of 23.45% (as high as 47%) in total interconnect power is achievable with 11.75% worst-case delay penalty.; Another major challenge in FPGA-based design is to comply with the resource and storage capacity of the target device. We propose an early estimation framework for the hardware cost before actually attempting the synthesis of a streaming accelerator on reconfigurable logic. Specifically, our proposed framework tackles the problem of pre-synthesis estimation of functional unit area cost, while incorporating the potential impact of resource constraints and different operator bitwidths on the final implementation. We evaluated our estimation technique by comparing the estimated area with the area of the synthesized design and the average estimation error is 9.3%. We also present a global resource sharing technique for pipelined CDFG synthesis and optimization techniques to reduce the area requirement of the stream queue buffers in reconfigurable accelerators.
机译:FPGA的发展日新月异,性能和逻辑密度得到了提高。但是,FPGA的功率效率一直落后于后面,因此FPGA的功率优化至关重要。技术扩展的趋势使泄漏功率成为设计人员的严重关注点。另一方面,路由功率是FPGA总功耗的主要组成部分。我们提出了一种用于FPGA的分层查找表(LUT)结构,以改善泄漏功耗。我们对LUT实际使用的输入数量进行了分析,并根据LUT使用的输入数量,关闭了与未使用的LUT输入相关的SRAM单元,晶体管和多路复用器。基于这种技术,对于180nm技术,我们报告逻辑块的泄漏功率平均节省22.94%(高达64.22%)。对于目前正在使用的90nm或以下的技术,节省的费用将更大。我们还提出了Dual-Vdd-dual-Vt互连架构,其中在FPGA的可编程互连结构内应用了电压缩放功能,以减少路由功耗。我们的实验表明,总互连功率平均降低23.45%(高达47%),最坏情况下的延迟损失为11.75%。基于FPGA的设计的另一个主要挑战是遵守目标设备的资源和存储容量。在实际尝试在可重配置逻辑上综合流式加速器之前,我们提出了一个硬件成本的早期估算框架。具体而言,我们提出的框架解决了功能单元面积成本的预综合估算问题,同时将资源约束和不同的操作员位宽对最终实现的潜在影响纳入了考虑范围。我们通过将估计面积与综合设计的面积进行比较来评估我们的估计技术,平均估计误差为9.3%。我们还为流水线CDFG合成和优化技术提出了一种全局资源共享技术,以减少可重配置加速器中流队列缓冲区的面积要求。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号