首页> 外文会议>IEEE Annual International Symposium on Field-Programmable Custom Computing Machines >Evaluating the Performance Efficiency of a Soft-Processor, Variable-Length, Parallel-Execution-Unit Architecture for FPGAs Using the RISC-V ISA
【24h】

Evaluating the Performance Efficiency of a Soft-Processor, Variable-Length, Parallel-Execution-Unit Architecture for FPGAs Using the RISC-V ISA

机译:使用RISC-V ISA评估FPGA的软处理器,可变长度,并行执行单元架构的性能效率

获取原文

摘要

FPGA-based soft-processors have traditionally focused on fixed-pipeline designs. These designs have limited Instruction Level Parallelism (ILP) and constrain the integration of tightly-coupled accelerators, potentially limiting the speedup they can provide. Recently, it has been proposed that replacing the fixed-pipeline datapath in these soft processors with variable-latency parallel-execution functional units could facilitate the integration of custom instructions. In this paper, we discuss and analyze the architectural impact and requirements for decoupling the pipeline stages and supporting parallel execution units. We find that, relative to a fixed pipeline architecture, our variable-latency, parallel-execution architecture: increases resource usage by 8% LUTs and 9% FlipFlops but results in up to a 42% increase in Instruction Per Cycle (IPC), with an overall improvement of 28% MIPS/LUT. Finally, we analyze the performance tradeoffs of tightly integrating custom instructions into a fixed pipeline versus parallel execution units architecture.
机译:传统上,基于FPGA的软处理器专注于固定流水线设计。这些设计具有有限的指令级并行性(ILP),并限制了紧密耦合的加速器的集成,从而有可能限制它们可以提供的加速。最近,已经提出用可变等待时间并行执行功能单元代替这些软处理器中的固定流水线数据路径可以促进定制指令的集成。在本文中,我们讨论并分析了架构影响以及将流水线阶段分离并支持并行执行单元的要求。我们发现,相对于固定的流水线体系结构,我们的可变延迟,并行执行体系结构:将资源使用量增加了8 \%的LUT和9 \%的触发器,但导致每周期指令(IPC)的增加高达42% ),整体提升了28%MIPS / LUT。最后,我们分析了将自定义指令紧密集成到固定流水线与并行执行单元体系结构之间的性能折衷。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号