首页> 外文期刊>Very Large Scale Integration (VLSI) Systems, IEEE Transactions on >A New VLSI Architecture of Parallel Multiplier–Accumulator Based on Radix-2 Modified Booth Algorithm
【24h】

A New VLSI Architecture of Parallel Multiplier–Accumulator Based on Radix-2 Modified Booth Algorithm

机译:基于Radix-2改进Booth算法的并行乘法器-累加器VLSI新架构

获取原文
获取原文并翻译 | 示例

摘要

In this paper, we proposed a new architecture of multiplier-and-accumulator (MAC) for high-speed arithmetic. By combining multiplication with accumulation and devising a hybrid type of carry save adder (CSA), the performance was improved. Since the accumulator that has the largest delay in MAC was merged into CSA, the overall performance was elevated. The proposed CSA tree uses 1's-complement-based radix-2 modified Booth's algorithm (MBA) and has the modified array for the sign extension in order to increase the bit density of the operands. The CSA propagates the carries to the least significant bits of the partial products and generates the least significant bits in advance to decrease the number of the input bits of the final adder. Also, the proposed MAC accumulates the intermediate results in the type of sum and carry bits instead of the output of the final adder, which made it possible to optimize the pipeline scheme to improve the performance. The proposed architecture was synthesized with 250, 180 and 130 $ mu$m, and 90 nm standard CMOS library. Based on the theoretical and experimental estimation, we analyzed the results such as the amount of hardware resources, delay, and pipelining scheme. We used Sakurai's alpha power law for the delay modeling. The proposed MAC showed the superior properties to the standard design in many ways and performance twice as much as the previous research in the similar clock frequency. We expect that the proposed MAC can be adapted to various fields requiring high performance such as the signal processing areas.
机译:在本文中,我们提出了一种用于高速算术的乘法器和累加器(MAC)的新体系结构。通过将乘法与累加相结合并设计出一种混合类型的进位保存加法器(CSA),可以提高性能。由于在MAC中具有最大延迟的累加器已合并到CSA中,因此整体性能得以提高。提出的CSA树使用基于1的补码基数2修改的Booth算法(MBA),并具有用于符号扩展的修改数组,以增加操作数的位密度。 CSA将进位传播到部分乘积的最低有效位,并预先生成最低有效位,以减少最终加法器的输入位数。而且,所提出的MAC以求和和进位的类型而不是最终加法器的输出来累积中间结果,这使得有可能优化流水线方案以提高性能。所提出的体系结构是用250、180和130μm以及90 nm标准CMOS库合成的。基于理论和实验估计,我们分析了诸如硬件资源量,延迟和流水线方案之类的结果。我们将樱井的α幂定律用于延迟建模。拟议的MAC在许多方面都显示出优于标准设计的性能,并且在类似的时钟频率下的性能是以前研究的两倍。我们期望所提出的MAC可以适应需要高性能的各种领域,例如信号处理领域。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号