首页> 外文会议>IEEE Annual International Symposium on Field-Programmable Custom Computing Machines >Exploring Writeback Designs for Efficiently Leveraging Parallel-Execution Units in FPGA-Based Soft-Processors
【24h】

Exploring Writeback Designs for Efficiently Leveraging Parallel-Execution Units in FPGA-Based Soft-Processors

机译:探索写回设计,以有效利用基于FPGA的软处理器中的并行执行单元

获取原文

摘要

Maximizing processor performance depends on maximizing the product of instruction throughput and clock frequency. Writeback mechanisms and forwarding networks heavily impact both of these properties along with the resource usage and scalability of the processor design. Furthermore, these mechanisms are typically multiplexer heavy which can make their implementation resource inefficient on FPGAs. In this paper, we explore multiple different writeback and result storage mechanisms using an FPGA-based RISC-V soft-processor (Taiga), exploring both exception-safe and non-exception-safe designs. Writeback mechanisms based on per-unit result storage and centralized storage are explored while leveraging FPGA specific resources such as LUTRAMs. We evaluate the designs based on their impact on instruction throughput, processor frequency, and scalability of both simultaneous instructions in-flight and the number of execution units. As each design has different characteristics, we focus on comparing and contrasting the designs. We find that across all designs, average IPC can vary by up to 11%, with a few designs reaching the maximum IPC of one for some benchmarks. Clock frequency is found to vary by up to 20% across the designs, but is not significantly impacted when increasing the number of execution units. Scaling up the instructions in-flight is found to have the greatest variability, with LUT usage increasing by 3% to 93% across the different designs. Overall, we find that under current constraints, a commit-buffer design provides the highest combination of performance and performance per LUT.
机译:最大化处理器性能取决于最大化指令吞吐量和时钟频率的乘积。回写机制和转发网络会严重影响这两个属性以及处理器设计的资源使用率和可伸缩性。此外,这些机制通常占用大量的多路复用器,这会使它们的实现资源在FPGA上效率低下。在本文中,我们使用基于FPGA的RISC-V软处理器(Taiga)探索了多种不同的写回和结果存储机制,并探讨了异常安全和非异常安全设计。在利用FPGA特定资源(例如LUTRAM)的同时,探索了基于单位结果存储和集中存储的回写机制。我们基于设计对指令吞吐量,处理器频率以及同时进行中的指令的可扩展性和执行单元数量的影响来评估设计。由于每个设计都有不同的特征,因此我们专注于比较和对比设计。我们发现,在所有设计中,平均IPC可能相差11%,有些设计达到某些基准的最大IPC。发现时钟频率在整个设计中的差异高达20%,但在增加执行单元数量时不会受到明显影响。发现在飞行中扩大指令的可变性最大,在不同设计中,LUT的使用率增加了3%至93%。总的来说,我们发现在当前的约束下,提交缓冲区设计可提供性能和每个LUT性能的最高组合。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号