...
【24h】

Improvements to Technology Mapping for LUT-Based FPGAs

机译:基于LUT的FPGA技术映射的改进

获取原文
获取原文并翻译 | 示例
           

摘要

This paper presents several orthogonal improvements to the state-of-the-art lookup table (LUT)-based field-programmable gate array (FPGA) technology mapping. The improvements target the delay and area of technology mapping as well as the runtime and memory requirements. 1) Improved cut enumeration computes all K-feasible cuts, without pruning, for up to seven inputs for the largest Microelectronics Center of North Carolina benchmarks. A new technique for on-the-fly cut dropping reduces, by orders of magnitude, the memory needed to represent cuts for large designs. 2) The notion of cut factorization is introduced, in which one computes a subset of cuts for a node and generates other cuts from that subset as needed. Two cut factorization schemes are presented, and a new algorithm that uses cut factorization for delay-oriented mapping for FPGAs with large LUTs is proposed. 3) Improved area recovery leads to mappings with the area, on average, 6% smaller than the previous best work while preserving the delay optimality when starting from the same optimized netlists. 4) Lossless synthesis accumulates alternative circuit structures seen during logic optimization. Extending the mapper to use structural choices reduces the delay, on average, by 6% and the area by 12%, compared with the previous work, while increasing the runtime 1.6 times. Performing five iterations of mapping with choices reduces the delay by 10% and the area by 19% while increasing the runtime eight times. These improvements, on top of the state-of-the-art methods for LUT mapping, are available in the package ABC
机译:本文提出了对基于最新查找表(LUT)的现场可编程门阵列(FPGA)技术映射的几种正交改进。这些改进针对技术映射的延迟和区域以及运行时和内存需求。 1)改进的切割枚举可计算北卡罗来纳州最大的微电子中心最大7个输入的所有K可行切割,而无需修剪。即时剪切掉落的新技术将代表大型设计的剪切所需的内存减少了几个数量级。 2)引入了割因子分解的概念,其中一个计算节点切割的子集,并根据需要从该子集生成其他割。提出了两种割因子分解方案,并提出了一种将割因子分解用于具有大LUT的FPGA的面向延迟的映射的新算法。 3)改进的区域恢复能力使区域映射比以前的最佳工作平均减少了6%,同时从相同的优化网表开始保留了延迟最优性。 4)无损综合会积累逻辑优化过程中看到的替代电路结构。与以前的工作相比,扩展映射器以使用结构选择可以平均减少6%的延迟和12%的延迟,同时将运行时间增加1.6倍。通过选择执行五次映射迭代,可将延迟减少10%,将面积减少19%,同时将运行时间增加八倍。除了用于LUT映射的最新方法外,这些改进还包含在ABC软件包中

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号