首页> 外文期刊>IEEE transactions on circuits and systems . I , Regular papers >Low-Latency, Low-Area, and Scalable Systolic-Like Modular Multipliers for GF(2m) Based on Irreducible All-One Polynomials
【24h】

Low-Latency, Low-Area, and Scalable Systolic-Like Modular Multipliers for GF(2m) Based on Irreducible All-One Polynomials

机译:基于不可约的全一多项式的GF(2m)低延迟,低面积和可伸缩的类似脉动的模数乘法器

获取原文
获取原文并翻译 | 示例
       

摘要

In this paper, an efficient recursive formulation is suggested for systolic implementation of canonical basis finite field multiplication over GF(2m) based on irreducible AOP. We have derived a recursive algorithm for the multiplication, and used that to design a regular and localized bit-level dependence graph (DG) for systolic computation. The bit-level regular DG is converted into a fine-grained DG by node-splitting, and mapped that into a parallel systolic architecture. Unlike most of the existing structures, it does not involve any global communications for modular reduction. The proposed bit-parallel systolic structure has the same cycle time as that of the best existing bit-parallel systolic structure [1], but involves significantly less number of registers. The proposed bit-parallel design has a scalable latency of l+⌈log2s⌉+1 cycles which is considerably low compared with those of existing systolic designs. Moreover, the proposed time-multiplexed structure is designed specifically for scalability of throughput and hardware-complexity to meet the area-time trade-off in resource-constrained applications while maintaining or reducing the overall latency. The ASIC synthesis report shows that the proposed bit-parallel structures offers nearly 30% saving of area and nearly 38% saving of power consumption over the best of the existing AOP-based systolic finite field multiplier.
机译:本文提出了一种有效的递归公式,用于基于不可约AOP的GF(2m)上规范基础有限域乘法的收缩实现。我们已经得出了用于乘法的递归算法,并使用该算法来设计用于脉动计算的规则和局部位级依赖图(DG)。通过节点拆分将位级别的常规DG转换为细粒度DG,然后将其映射到并行脉动体系结构中。与大多数现有结构不同,它不涉及任何用于模块化缩减的全局通信。所提出的位并行脉动结构与最佳的现有位并行脉动结构[1]的周期时间相同,但是所涉及的寄存器数量却明显减少。所提出的比特并行设计的可扩展延迟为l +⌈log2s⌉+ 1周期,与现有的脉动设计相比,该延迟相当低。此外,提出的时分复用结构专为吞吐量的可伸缩性和硬件复杂性而设计,以满足资源受限应用程序中的时空折衷,同时保持或减少了总体延迟。 ASIC综合报告显示,与现有的基于AOP的脉动有限域乘法器相比,所提出的位并行结构可节省近30%的面积,并节省近38%的功耗。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号